I've been reading ELF standard here. From what I understand, each ELF contains ELF header, program headers (why more than one?) and section headers. Can anyone please explain: <ol> <li>How are ELF files generated? is it the compiler responsibility?</li> <li>What are sections and why do we need them?</li> <li>What are program headers and why do we need them?</li> <li>Inside program headers, what's the meaning of the fields p_vaddr and p_paddr? </li> <li>Does each section have it's own section header? </li> </ol> Alternatively, does any one have a link to a more friendly documenation of ELF?

<ol> <li> How are ELF files generated? is it the compiler responsibility? They can be generated by a compiler, an assembler, or any other tool that can generate them. Even your own program you wrote for generating ELF files ;) They're just streams of bytes after all, so they can be generated by just writing bytes into a file in binary mode. You can do that too. </li> <li> What are sections and why do we need them? ELF files are subdivided into sections. Sections are the smallest continuous regions in the file. You can think of them as pages in an organizer, each with its own name and type that describes what does it contain inside. Linkers use this information to combine different parts of the program coming from different modules into one executable file or a library, by merging sections of the same type (gluing pages together, if you will). In executable files, sections are optional, but they're usually there to describe what's in the file and where does it begin, and how much bytes does it take. </li> <li> What are program headers and why do we need them? They're mostly for making executable files. In order to run a program, sections aren't enough, because you have to specify not only what's there in the file, but also where should it be loaded into memory in the running process. Program headers are just for that purpose: they describe segments, which are regions of memory in the running process, with different access privileges & stuff. Each program header describes one segment. It tells the loader where should it load a certain region in the file into memory and what permissions should it set for that region (e.g. should it be allowed to execute code from it? should it be writable or just for reading?) Segments can be further subdivided into sections. For example, if you have to specify that your code segment is further subdivided into code and static read-only strings for the messages the program displays. Or that your data segment is subdivided into funky data and hardcore data :J It's for you to decide. In executable files, sections are optional, but it's nice to have them, because they describe what's in the file and allow for dumping selected parts of it (e.g. with the <code>objdump</code> tool). Sometimes they are needed, though, for storing dynamic linking information, symbol tables, debugging information, stuff like that. </li> <li> Inside program headers, what's the meaning of the fields <code>p_vaddr</code> and <code>p_paddr</code>? Those are the addresses at which the data in the file will be loaded. They map the contents of the file into their corresponding memory locations. The first one is a virtual address, the second one is physical address. Physical addresses are the "raw" memory addresses. On modern operating systems, those are no longer used in the userland. Instead, userland programs use virtual addresses. The operating system deceives the userland program that it is alone in memory, and that the entire address space is available for it. Under the hood, the operating system maps those virtual addresses to physical ones in the actual memory, and it does it transparently to the program. Of course, not every address in the virtual address space is available at the same time. There are limitations imposed by the actual physical memory available. So the operating system just maps the memory for the segments the program actually uses (here's where the "segments" part from the ELF file's program headers comes into play). If the process tries to access some unmapped memory, the operating system steps in and says, "sorry, chap, this memory doesn't belong to you". (The program can address it, but it cannot access it.) </li> <li> Does each section have it's own section header? Yes. If it doesn't have an entry in the Section Headers Table, it's not a section :q Because they only way to tell if some part of the file is a section, is by looking in to the Section Headers Table which tells you what sections are defined in the file and where you can find them. You can think of the Section Headers Table as a table of contents in a book. Without the table of contents, there aren't any chapters after all, because they're not listed anywhere. The book may have headings, but the content is not subdivided into logical chapters that can be found through the table of contents. Same goes with sections in ELF files: there can be some regions of data, but you can't tell without the "table of contents" which is the SHT. </li> </ol>

ELF files - What is a section and why do we need it?

1 Answers

How are ELF files generated? is it the compiler responsibility?

They can be generated by a compiler, an assembler, or any other tool that can generate them. Even your own program you wrote for generating ELF files ;) They're just streams of bytes after all, so they can be generated by just writing bytes into a file in binary mode. You can do that too.
What are sections and why do we need them?

ELF files are subdivided into sections. Sections are the smallest continuous regions in the file. You can think of them as pages in an organizer, each with its own name and type that describes what does it contain inside. Linkers use this information to combine different parts of the program coming from different modules into one executable file or a library, by merging sections of the same type (gluing pages together, if you will).

In executable files, sections are optional, but they're usually there to describe what's in the file and where does it begin, and how much bytes does it take.
What are program headers and why do we need them?

They're mostly for making executable files. In order to run a program, sections aren't enough, because you have to specify not only what's there in the file, but also where should it be loaded into memory in the running process. Program headers are just for that purpose: they describe segments, which are regions of memory in the running process, with different access privileges & stuff.

Each program header describes one segment. It tells the loader where should it load a certain region in the file into memory and what permissions should it set for that region (e.g. should it be allowed to execute code from it? should it be writable or just for reading?)

Segments can be further subdivided into sections. For example, if you have to specify that your code segment is further subdivided into code and static read-only strings for the messages the program displays. Or that your data segment is subdivided into funky data and hardcore data :J It's for you to decide.

In executable files, sections are optional, but it's nice to have them, because they describe what's in the file and allow for dumping selected parts of it (e.g. with the objdump tool). Sometimes they are needed, though, for storing dynamic linking information, symbol tables, debugging information, stuff like that.
Inside program headers, what's the meaning of the fields p_vaddr and p_paddr?

Those are the addresses at which the data in the file will be loaded. They map the contents of the file into their corresponding memory locations. The first one is a virtual address, the second one is physical address.

Physical addresses are the "raw" memory addresses. On modern operating systems, those are no longer used in the userland. Instead, userland programs use virtual addresses. The operating system deceives the userland program that it is alone in memory, and that the entire address space is available for it. Under the hood, the operating system maps those virtual addresses to physical ones in the actual memory, and it does it transparently to the program.

Of course, not every address in the virtual address space is available at the same time. There are limitations imposed by the actual physical memory available. So the operating system just maps the memory for the segments the program actually uses (here's where the "segments" part from the ELF file's program headers comes into play). If the process tries to access some unmapped memory, the operating system steps in and says, "sorry, chap, this memory doesn't belong to you". (The program can address it, but it cannot access it.)
Does each section have it's own section header?

Yes. If it doesn't have an entry in the Section Headers Table, it's not a section :q Because they only way to tell if some part of the file is a section, is by looking in to the Section Headers Table which tells you what sections are defined in the file and where you can find them.

You can think of the Section Headers Table as a table of contents in a book. Without the table of contents, there aren't any chapters after all, because they're not listed anywhere. The book may have headings, but the content is not subdivided into logical chapters that can be found through the table of contents. Same goes with sections in ELF files: there can be some regions of data, but you can't tell without the "table of contents" which is the SHT.

190

answered Oct 14 '22 10:10

BarbaraKwarc

Related questions
                            
                                Why is my simple `main` program's ELF header say it's a `DYN (Shared object file)` instead of an executable? [duplicate]
                            
                                Why does Go use its own Code generator? [closed]
                            
                                How does ELF file format defines the stack?
                            
                                gcc / ld: overlapping sections (.tbss, .init_array) in statically-linked ELF binary
                            
                                What is the use of the SHT_NULL section in ELF?
                            
                                Trace32 command to read symbol contents from ELF file
                            
                                How to find load relocation for a PIE binary?
                            
                                Is there a reliable way to know what libraries could be dlopen()ed in an elf binary?
                            
                                .plt .plt.got what is different?
                            
                                How to build the elf interpreter (ld-linux.so.2/ld-2.17.so) as static library?
                            
                                Why ELF executables have a fixed load address?
                            
                                Is it possible to convert a bash script into an executable?
                            
                                How to convert Linux kernel Bin into ELF format
                            
                                libxml-ruby failed to load at x86_64
                            
                                Obtain source using debugging symbols
                            
                                What is the difference in byte code like Java bytecode and files and machine code executables like ELF?
                            
                                Canonical way to sign (and verify) an ELF file?
                            
                                Forcing a symbol to the top of a ELF file
                            
                                What is <.got> section in ELF?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

ELF files - What is a section and why do we need it?

Tags:

elf

Shmoopy

People also ask

1 Answers

BarbaraKwarc

Recent Activity

Donate For Us