How does C++ linking work in practice? What I am looking for is a detailed explanation about how the linking happens, and not what commands do the linking. There's already a similar question about compilation which doesn't go into too much detail: How does the compilation/linking process work?

EDIT: I have moved this answer to the duplicate: https://stackoverflow.com/a/33690144/895245 This answer focuses on address relocation, which is one of the crucial functions of linking. A minimal example will be used to clarify the concept. <h3>0) Introduction</h3> Summary: relocation edits the <code>.text</code> section of object files to translate: <ul> <li>object file address</li> <li>into the final address of the executable</li> </ul> This must be done by the linker because the compiler only sees one input file at a time, but we must know about all object files at once to decide how to: <ul> <li>resolve undefined symbols like declared undefined functions</li> <li>not clash multiple <code>.text</code> and <code>.data</code> sections of multiple object files</li> </ul> Prerequisites: minimal understanding of: <ul> <li>x86-64 or IA-32 assembly</li> <li>global structure of an ELF file. I have made a tutorial for that </li> </ul> Linking has nothing to do with C or C++ specifically: compilers just generate the object files. The linker then takes them as input without ever knowing what language compiled them. It might as well be Fortran. So to reduce the crust, let's study a NASM x86-64 ELF Linux hello world: <pre class="prettyprint"><code>section .data hello_world db "Hello world!", 10 section .text global _start _start: ; sys_write mov rax, 1 mov rdi, 1 mov rsi, hello_world mov rdx, 13 syscall ; sys_exit mov rax, 60 mov rdi, 0 syscall </code></pre> compiled and assembled with: <pre class="prettyprint"><code>nasm -felf64 hello_world.asm # creates hello_world.o ld -o hello_world.out hello_world.o # static ELF executable with no libraries </code></pre> with NASM 2.10.09. <h3>1) .text of .o</h3> First we decompile the <code>.text</code> section of the object file: <pre class="prettyprint"><code>objdump -d hello_world.o </code></pre> which gives: <pre class="prettyprint"><code>0000000000000000 <_start>: 0: b8 01 00 00 00 mov $0x1,%eax 5: bf 01 00 00 00 mov $0x1,%edi a: 48 be 00 00 00 00 00 movabs $0x0,%rsi 11: 00 00 00 14: ba 0d 00 00 00 mov $0xd,%edx 19: 0f 05 syscall 1b: b8 3c 00 00 00 mov $0x3c,%eax 20: bf 00 00 00 00 mov $0x0,%edi 25: 0f 05 syscall </code></pre> the crucial lines are: <pre class="prettyprint"><code> a: 48 be 00 00 00 00 00 movabs $0x0,%rsi 11: 00 00 00 </code></pre> which should move the address of the hello world string into the <code>rsi</code> register, which is passed to the write system call. But wait! How can the compiler possibly know where <code>"Hello world!"</code> will end up in memory when the program is loaded? Well, it can't, specially after we link a bunch of <code>.o</code> files together with multiple <code>.data</code> sections. Only the linker can do that since only he will have all those object files. So the compiler just: <ul> <li>puts a placeholder value <code>0x0</code> on the compiled output</li> <li>gives some extra information to the linker of how to modify the compiled code with the good addresses</li> </ul> This "extra information" is contained in the <code>.rela.text</code> section of the object file <h3>2) .rela.text</h3> <code>.rela.text</code> stands for "relocation of the .text section". The word relocation is used because the linker will have to relocate the address from the object into the executable. We can disassemble the <code>.rela.text</code> section with: <pre class="prettyprint"><code>readelf -r hello_world.o </code></pre> which contains; <pre class="prettyprint"><code>Relocation section '.rela.text' at offset 0x340 contains 1 entries: Offset Info Type Sym. Value Sym. Name + Addend 00000000000c 000200000001 R_X86_64_64 0000000000000000 .data + 0 </code></pre> The format of this section is fixed documented at: http://www.sco.com/developers/gabi/2003-12-17/ch4.reloc.html Each entry tells the linker about one address which needs to be relocated, here we have only one for the string. Simplifying a bit, for this particular line we have the following information: <ul> <li> <code>Offset = C</code>: what is the first byte of the <code>.text</code> that this entry changes. If we look back at the decompiled text, it is exactly inside the critical <code>movabs $0x0,%rsi</code>, and those that know x86-64 instruction encoding will notice that this encodes the 64-bit address part of the instruction. </li> <li> <code>Name = .data</code>: the address points to the <code>.data</code> section </li> <li> <code>Type = R_X86_64_64</code>, which specifies what exactly what calculation has to be done to translate the address. This field is actually processor dependent, and thus documented on the AMD64 System V ABI extension section 4.4 "Relocation". That document says that <code>R_X86_64_64</code> does: <ul> <li> <code>Field = word64</code>: 8 bytes, thus the <code>00 00 00 00 00 00 00 00</code> at address <code>0xC</code> </li> <li> <code>Calculation = S + A</code> <ul> <li> <code>S</code> is value at the address being relocated, thus <code>00 00 00 00 00 00 00 00</code> </li> <li> <code>A</code> is the addend which is <code>0</code> here. This is a field of the relocation entry.</li> </ul> So <code>S + A == 0</code> and we will get relocated to the very first address of the <code>.data</code> section. </li> </ul> </li> </ul> <h3>3) .text of .out</h3> Now lets look at the text area of the executable <code>ld</code> generated for us: <pre class="prettyprint"><code>objdump -d hello_world.out </code></pre> gives: <pre class="prettyprint"><code>00000000004000b0 <_start>: 4000b0: b8 01 00 00 00 mov $0x1,%eax 4000b5: bf 01 00 00 00 mov $0x1,%edi 4000ba: 48 be d8 00 60 00 00 movabs $0x6000d8,%rsi 4000c1: 00 00 00 4000c4: ba 0d 00 00 00 mov $0xd,%edx 4000c9: 0f 05 syscall 4000cb: b8 3c 00 00 00 mov $0x3c,%eax 4000d0: bf 00 00 00 00 mov $0x0,%edi 4000d5: 0f 05 syscall </code></pre> So the only thing that changed from the object file are the critical lines: <pre class="prettyprint"><code> 4000ba: 48 be d8 00 60 00 00 movabs $0x6000d8,%rsi 4000c1: 00 00 00 </code></pre> which now point to the address <code>0x6000d8</code> (<code>d8 00 60 00 00 00 00 00</code> in little-endian) instead of <code>0x0</code>. Is this the right location for the <code>hello_world</code> string? To decide we have to check the program headers, which tell Linux where to load each section. We disassemble them with: <pre class="prettyprint"><code>readelf -l hello_world.out </code></pre> which gives: <pre class="prettyprint"><code>Program Headers: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flags Align LOAD 0x0000000000000000 0x0000000000400000 0x0000000000400000 0x00000000000000d7 0x00000000000000d7 R E 200000 LOAD 0x00000000000000d8 0x00000000006000d8 0x00000000006000d8 0x000000000000000d 0x000000000000000d RW 200000 Section to Segment mapping: Segment Sections... 00 .text 01 .data </code></pre> This tells us that the <code>.data</code> section, which is the second one, starts at <code>VirtAddr</code> = <code>0x06000d8</code>. And the only thing on the data section is our hello world string.

How does C++ linking work in practice? [duplicate]

1 Answers

EDIT: I have moved this answer to the duplicate: https://stackoverflow.com/a/33690144/895245

This answer focuses on address relocation, which is one of the crucial functions of linking.

A minimal example will be used to clarify the concept.

0) Introduction

Summary: relocation edits the .text section of object files to translate:

object file address
into the final address of the executable

This must be done by the linker because the compiler only sees one input file at a time, but we must know about all object files at once to decide how to:

resolve undefined symbols like declared undefined functions
not clash multiple .text and .data sections of multiple object files

Prerequisites: minimal understanding of:

x86-64 or IA-32 assembly
global structure of an ELF file. I have made a tutorial for that

Linking has nothing to do with C or C++ specifically: compilers just generate the object files. The linker then takes them as input without ever knowing what language compiled them. It might as well be Fortran.

So to reduce the crust, let's study a NASM x86-64 ELF Linux hello world:

section .data     hello_world db "Hello world!", 10 section .text     global _start     _start:          ; sys_write         mov rax, 1         mov rdi, 1         mov rsi, hello_world         mov rdx, 13         syscall          ; sys_exit         mov rax, 60         mov rdi, 0         syscall

compiled and assembled with:

nasm -felf64 hello_world.asm            # creates hello_world.o ld -o hello_world.out hello_world.o     # static ELF executable with no libraries

with NASM 2.10.09.

1) .text of .o

First we decompile the .text section of the object file:

objdump -d hello_world.o

which gives:

0000000000000000 <_start>:    0:   b8 01 00 00 00          mov    $0x1,%eax    5:   bf 01 00 00 00          mov    $0x1,%edi    a:   48 be 00 00 00 00 00    movabs $0x0,%rsi   11:   00 00 00   14:   ba 0d 00 00 00          mov    $0xd,%edx   19:   0f 05                   syscall   1b:   b8 3c 00 00 00          mov    $0x3c,%eax   20:   bf 00 00 00 00          mov    $0x0,%edi   25:   0f 05                   syscall

the crucial lines are:

   a:   48 be 00 00 00 00 00    movabs $0x0,%rsi   11:   00 00 00

which should move the address of the hello world string into the rsi register, which is passed to the write system call.

But wait! How can the compiler possibly know where "Hello world!" will end up in memory when the program is loaded?

Well, it can't, specially after we link a bunch of .o files together with multiple .data sections.

Only the linker can do that since only he will have all those object files.

So the compiler just:

puts a placeholder value 0x0 on the compiled output
gives some extra information to the linker of how to modify the compiled code with the good addresses

This "extra information" is contained in the .rela.text section of the object file

2) .rela.text

.rela.text stands for "relocation of the .text section".

The word relocation is used because the linker will have to relocate the address from the object into the executable.

We can disassemble the .rela.text section with:

readelf -r hello_world.o

which contains;

Relocation section '.rela.text' at offset 0x340 contains 1 entries:   Offset          Info           Type           Sym. Value    Sym. Name + Addend 00000000000c  000200000001 R_X86_64_64       0000000000000000 .data + 0

The format of this section is fixed documented at: http://www.sco.com/developers/gabi/2003-12-17/ch4.reloc.html

Each entry tells the linker about one address which needs to be relocated, here we have only one for the string.

Simplifying a bit, for this particular line we have the following information:

Offset = C: what is the first byte of the .text that this entry changes.

If we look back at the decompiled text, it is exactly inside the critical movabs $0x0,%rsi, and those that know x86-64 instruction encoding will notice that this encodes the 64-bit address part of the instruction.
Name = .data: the address points to the .data section
Type = R_X86_64_64, which specifies what exactly what calculation has to be done to translate the address.

This field is actually processor dependent, and thus documented on the AMD64 System V ABI extension section 4.4 "Relocation".

That document says that R_X86_64_64 does:
- Field = word64: 8 bytes, thus the 00 00 00 00 00 00 00 00 at address 0xC
- Calculation = S + A
  - S is value at the address being relocated, thus 00 00 00 00 00 00 00 00
  - A is the addend which is 0 here. This is a field of the relocation entry.
  So S + A == 0 and we will get relocated to the very first address of the .data section.

3) .text of .out

Now lets look at the text area of the executable ld generated for us:

objdump -d hello_world.out

gives:

00000000004000b0 <_start>:   4000b0:   b8 01 00 00 00          mov    $0x1,%eax   4000b5:   bf 01 00 00 00          mov    $0x1,%edi   4000ba:   48 be d8 00 60 00 00    movabs $0x6000d8,%rsi   4000c1:   00 00 00   4000c4:   ba 0d 00 00 00          mov    $0xd,%edx   4000c9:   0f 05                   syscall   4000cb:   b8 3c 00 00 00          mov    $0x3c,%eax   4000d0:   bf 00 00 00 00          mov    $0x0,%edi   4000d5:   0f 05                   syscall

So the only thing that changed from the object file are the critical lines:

  4000ba:   48 be d8 00 60 00 00    movabs $0x6000d8,%rsi   4000c1:   00 00 00

which now point to the address 0x6000d8 (d8 00 60 00 00 00 00 00 in little-endian) instead of 0x0.

Is this the right location for the hello_world string?

To decide we have to check the program headers, which tell Linux where to load each section.

We disassemble them with:

readelf -l hello_world.out

which gives:

Program Headers:   Type           Offset             VirtAddr           PhysAddr                  FileSiz            MemSiz              Flags  Align   LOAD           0x0000000000000000 0x0000000000400000 0x0000000000400000                  0x00000000000000d7 0x00000000000000d7  R E    200000   LOAD           0x00000000000000d8 0x00000000006000d8 0x00000000006000d8                  0x000000000000000d 0x000000000000000d  RW     200000   Section to Segment mapping:   Segment Sections...    00     .text    01     .data

This tells us that the .data section, which is the second one, starts at VirtAddr = 0x06000d8.

And the only thing on the data section is our hello world string.

189

answered Oct 07 '22 13:10

Ciro Santilli 新疆再教育营六四事件法轮功郝海东

Related questions
                            
                                Why is C++'s NULL typically an integer literal rather than a pointer like in C?
                            
                                Calling overloaded operator () from object pointer
                            
                                How to set a timeout on blocking sockets in boost asio?
                            
                                Why is the sign different after subtracting unsigned and signed?
                            
                                To what extent is it acceptable to think of C++ pointers as memory addresses?
                            
                                Checking for a null object in C++
                            
                                Convert wchar_t to char
                            
                                C++ fastest way to clear or erase a vector
                            
                                Casting int to bool in C/C++
                            
                                std::unique_ptr usage
                            
                                Remove First and Last Character C++
                            
                                Does C/C++ offer any guarantee on minimal execution time?
                            
                                Getters and Setters. Is there performance overhead?
                            
                                Using default in a switch statement when switching over an enum
                            
                                How to enable gdb pretty printing for C++ STL objects in Eclipse CDT?
                            
                                Is there a C++ equivalent to getcwd?
                            
                                Boost PropertyTree: check if child exists
                            
                                std::chrono and cout
                            
                                Emacs C++-mode incorrect indentation?
                            
                                Forcing GCC to compile .cpp file as C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does C++ linking work in practice? [duplicate]

Tags:

c++

linker

Klaufir

People also ask