I'm trying to create a Shared library (*.so) in ASM and I'm not sure that i do it correct... My code is: <pre class="prettyprint"><code> .section .data .globl var1 var1: .quad 0x012345 .section .text .globl func1 func1: xor %rax, %rax # mov var1, %rcx # this is commented ret </code></pre> To compile it i run <pre class="prettyprint"><code>gcc ker.s -g -fPIC -m64 -o ker.o gcc ker.o -shared -fPIC -m64 -o libker.so </code></pre> I can access variable var1 and call func1 with dlopen() and dlsym() from a program in C. The problem is in variable var1. When i try to access it from func1, i.e. uncomment that line, the compiler generates an error: <pre class="prettyprint"><code>/usr/bin/ld: ker.o: relocation R_X86_64_32S against `var1' can not be used when making a shared object; recompile with -fPIC ker.o: could not read symbols: Bad value collect2: ld returned 1 exit status </code></pre> I don't understand. I've already compiled with -fPIC, so what's wrong?

<blockquote> I've already compiled with -fPIC, so what's wrong? </blockquote> That part of the error message is for people who are linking compiler-generated code. You're writing asm by hand, so as datenwolf correctly wrote, when writing a shared library in assembly, you have to take care for yourself that the code is position independent. This means file must not contain any 32-bit absolute addresses (because relocation to an arbitrary 64-bit base is impossible). 64-bit absolute relocations are supported, but normally you should only use that for jump tables. <hr> <code>mov var1, %rcx</code> uses a 32-bit absolute addressing mode. You should normally never do this, even in position-dependent x86-64 code. The normal use-cases for 32-bit absolute addresses are: putting an address into a 64-bit register with<code>mov $var1, %edi</code> (zero-extends into RDI) and indexing static arrays: <code>mov arr(,%rdx,4), %edx</code> <code>mov var1(%rip), %rcx</code> uses a RIP-relative 32-bit offset. It's the efficient way to address static data, and compilers always use this even without <code>-fPIE</code> or <code>-fPIC</code> for static/global variables. You have basically two possibilities: <ul> <li> Normal library-private static data, like C compilers will make for <code>__attribute__((visibility("hidden"))) long var1;</code>, same as for <code>-fno-PIC</code>. <pre class="prettyprint"><code>.data .globl var1 # linkable from other .o files in the same shared object / library .hidden var1 # not visible for *dynamic* linking outside the library var1: .quad 0x012345 .text .globl func1 func1: xor %eax, %eax # return 0 mov var1(%rip), %rcx ret </code></pre> </li> <li> full symbol-interposition-aware code like compilers generate for <code>-fPIC</code>. You have to use the Global Offset Table. This is how a compiler does it, if you tell him to produce code for a shared library. Note that this comes with a performance hit because of the additional indirection. See Sorry state of dynamic libraries on Linux for more about symbol-interposition and the overheads it imposes on code-gen for shared libraries if you're not careful about restricting symbol visibility to allow inlining. <code>var1@GOTPCREL</code> is the address of a pointer to your <code>var1</code>, the pointer itself is reachable with rip-relative addressing, while the content (the address of <code>var1</code>) is filled by the linker during loading of the library. This supports the case where the program using your library defined <code>var1</code>, so <code>var1</code> in your library should resolve to that memory location instead of the one in the <code>.data</code> or <code>.bss</code> (or <code>.text</code>) of your <code>.so</code>. <pre class="prettyprint"><code> .section .data .globl var1 # without .hidden var1: .quad 0x012345 .section .text .globl func1 func1: xor %eax, %eax mov var1@GOTPCREL(%rip), %rcx mov (%rcx), %rcx ret </code></pre> </li> </ul> See some additional information at http://www.bottomupcs.com/global_offset_tables.html An example on the Godbolt compiler explorer of <code>-fPIC</code> vs. <code>-fPIE</code> shows the difference that symbol-interposition makes for getting the address of non-hidden global variables: <ul> <li> <code>movl $x, %eax</code> 5 bytes, <code>-fno-pie</code> </li> <li> <code>leaq x(%rip), %rax</code> 7 bytes, <code>-fPIE</code> and hidden globals or <code>static</code> with <code>-fPIC</code> </li> <li> <code>y@GOTPCREL(%rip), %rax</code> 7 bytes and a load instead of just ALU, <code>-fPIC</code> with non-hidden globals.</li> </ul> Actually loading always uses <code>x(%rip)</code>, except for non-hidden / non-<code>static</code> vars with <code>-fPIC</code> where it has to get the runtime address from the GOT first, because it's not a link-time constant offset relative to the code. Related: 32-bit absolute addresses no longer allowed in x86-64 Linux? (PIE executables). <hr> A previous version of this answer stated that the DATA and BSS segments could move relative to TEXT when loading a dynamic library. This is incorrect, only the library base address is relocatable. RIP-relative access to other segments within the same library is guaranteed to be ok, and compilers emit code that does this. The ELF headers specify how the segments (which contain the sections) need to be loaded/mapped into memory.

ELF Shared Object in x86-64 Assembly language

Tags:

gcc

assembly

x86-64

linker

shared-libraries

I'm trying to create a Shared library (*.so) in ASM and I'm not sure that i do it correct...

My code is:

    .section .data
    .globl var1
var1:
    .quad     0x012345

    .section .text
    .globl func1
func1:
    xor %rax, %rax
  # mov var1, %rcx       # this is commented
    ret

To compile it i run

gcc ker.s -g -fPIC -m64 -o ker.o
gcc ker.o -shared -fPIC -m64 -o libker.so

I can access variable var1 and call func1 with dlopen() and dlsym() from a program in C.

The problem is in variable var1. When i try to access it from func1, i.e. uncomment that line, the compiler generates an error:

/usr/bin/ld: ker.o: relocation R_X86_64_32S against `var1' can not be used when making a shared object; recompile with -fPIC
ker.o: could not read symbols: Bad value
collect2: ld returned 1 exit status

I don't understand. I've already compiled with -fPIC, so what's wrong?

394

asked Feb 18 '12 12:02

zorgit

1 Answers

I've already compiled with -fPIC, so what's wrong?

That part of the error message is for people who are linking compiler-generated code.

You're writing asm by hand, so as datenwolf correctly wrote, when writing a shared library in assembly, you have to take care for yourself that the code is position independent.

This means file must not contain any 32-bit absolute addresses (because relocation to an arbitrary 64-bit base is impossible). 64-bit absolute relocations are supported, but normally you should only use that for jump tables.

mov var1, %rcx uses a 32-bit absolute addressing mode. You should normally never do this, even in position-dependent x86-64 code. The normal use-cases for 32-bit absolute addresses are: putting an address into a 64-bit register withmov $var1, %edi (zero-extends into RDI)
and indexing static arrays: mov arr(,%rdx,4), %edx

mov var1(%rip), %rcx uses a RIP-relative 32-bit offset. It's the efficient way to address static data, and compilers always use this even without -fPIE or -fPIC for static/global variables.

You have basically two possibilities:

Normal library-private static data, like C compilers will make for __attribute__((visibility("hidden"))) long var1;, same as for -fno-PIC.

.data
    .globl var1       # linkable from other .o files in the same shared object / library
    .hidden var1      # not visible for *dynamic* linking outside the library
var1:
    .quad     0x012345

.text
    .globl func1
func1:
    xor  %eax, %eax             # return 0
    mov  var1(%rip), %rcx   
    ret

full symbol-interposition-aware code like compilers generate for -fPIC.

You have to use the Global Offset Table. This is how a compiler does it, if you tell him to produce code for a shared library. Note that this comes with a performance hit because of the additional indirection.

See Sorry state of dynamic libraries on Linux for more about symbol-interposition and the overheads it imposes on code-gen for shared libraries if you're not careful about restricting symbol visibility to allow inlining.

var1@GOTPCREL is the address of a pointer to your var1, the pointer itself is reachable with rip-relative addressing, while the content (the address of var1) is filled by the linker during loading of the library. This supports the case where the program using your library defined var1, so var1 in your library should resolve to that memory location instead of the one in the .data or .bss (or .text) of your .so.
```
    .section .data
    .globl var1
    # without .hidden
var1:
    .quad     0x012345

    .section .text
    .globl func1
func1:
    xor %eax, %eax
    mov var1@GOTPCREL(%rip), %rcx
    mov (%rcx), %rcx
    ret
```

See some additional information at http://www.bottomupcs.com/global_offset_tables.html

An example on the Godbolt compiler explorer of -fPIC vs. -fPIE shows the difference that symbol-interposition makes for getting the address of non-hidden global variables:

movl $x, %eax 5 bytes, -fno-pie
leaq x(%rip), %rax 7 bytes, -fPIE and hidden globals or static with -fPIC
y@GOTPCREL(%rip), %rax 7 bytes and a load instead of just ALU, -fPIC with non-hidden globals.

Actually loading always uses x(%rip), except for non-hidden / non-static vars with -fPIC where it has to get the runtime address from the GOT first, because it's not a link-time constant offset relative to the code.

Related: 32-bit absolute addresses no longer allowed in x86-64 Linux? (PIE executables).

A previous version of this answer stated that the DATA and BSS segments could move relative to TEXT when loading a dynamic library. This is incorrect, only the library base address is relocatable. RIP-relative access to other segments within the same library is guaranteed to be ok, and compilers emit code that does this. The ELF headers specify how the segments (which contain the sections) need to be loaded/mapped into memory.

125

answered Sep 16 '22 21:09

Gunther Piez

Related questions
                            
                                how to implement AES128 encryption/decryption using AES-NI instructions and GCC
                            
                                How to add compiler flags on codeblocks
                            
                                Meaning of yywrap() in flex
                            
                                How do I find my program's main(...) function?
                            
                                How do I tell if a C integer variable is signed?
                            
                                Porting windows code, what to use instead of __int64 _tmain and _TCHAR*?
                            
                                Is there any way to do 128-bit shifts on gcc <4.4?
                            
                                ... with constructor not allowed in union problem
                            
                                Structure alignment in GCC (should alignment be specified in typedef?)
                            
                                Is there any way to get gcc or clang to warn on explicit casts?
                            
                                Object file to binary code
                            
                                Building OpenCV as static libraries
                            
                                gcc shared library failed linking to glibc
                            
                                how to iterate all regex matches in a std::string with their starting positions in c++11 std::regex?
                            
                                Wrong gcc generated assembly ordering, results in performance hit
                            
                                In class static const ODR
                            
                                Boost build fails C++11 feature checks when using (custom) GCC 4.x or 5.x
                            
                                gcc gives error while using fmod()
                            
                                How to use the __attribute__ keyword in GCC C?
                            
                                Debugging Segmentation Faults on a Mac?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With