Position independent code, shared libraries and code veneers - getting them to work together

Tags:

I'm developing for an embedded platform and I'm having a hard time working out how to link shared libraries dynamically. I'm using the bFLT file format and I don't have control over where the executable and shared library is loaded.

My loader correctly loads the shared library and executable into memory and modifies the executable's GOT at run time to link to the shared library.

I can successfully take the address of the function and I know it's correct from disassembling the code at that location. However, if I try to call the function, the whole thing crashes.

Turns out GCC adds a 'code veneer' when calling shared library functions and takes a detour when the function is called and doesn't actually branch to the address of the function. The address that the code veneer branches to isn't relocated properly because it doesn't show up in the list of relocations in the executable binary.

The disassembly of the veneer looks like this:

000008d0 <__library_call_veneer>:
 8d0:   e51ff004    ldr pc, [pc, #-4]   ; 8d4 <__library_call_veneer+0x4>
 8d4:   03000320    .word   0x03000320  ; This address isn't correctly relocated!

If I take the address of the function and put it into a function pointer (therefore, bypassing the 'code veneer') and call it, the shared library works perfectly.

So for example:

#define DIRECT_LIB_CALL(x, args...) do { \
        typeof(x) * volatile tmp = x; \
        tmp(#args); \
    } while (0)

DIRECT_LIB_CALL(library_call); /* works */
library_call(); /* crashes */

Is there a way to either, tell GCC to not produce a code veneer and branch directly to the address located in the GOT or somehow make the address that the code veneer branches show up in the list of relocations to perform?

340

asked Apr 03 '12 04:04

tangrs

1 Answers

I found a workaround to this problem. It's not the best or cleanest method but it does the job in my case.

I took advantage of the --wrap option in my linker which redirects symbols to __wrap_symbol. With this, I set up a awk script that automatically generates ASM files that load a properly relocated address into the pc. Any library calls would be redirected to this code. Basically what I did was make my own code veneers. Since the generated code veneer wasn't being referenced, it simply got optimized away.

Additionally, I had to place my veneers in the .data section since anything in the .text section was not relocated correctly. Since, the platform I'm working on doesn't differentiate between code and data that much, this hacky workaround works.

Here's a link to the project I'm working on where you can look up the specifics.

135

answered Sep 28 '22 06:09

tangrs

Related questions
                            
                                In a C/Java project, what is an appropriate way to manage the build?
                            
                                Status of POSIX implementations
                            
                                Given centers, find minimum radius for set of circles such that they fully cover another
                            
                                kill signal example
                            
                                are there any simple/example event-driven webservers in C?
                            
                                How to Generate Network Packets with C/C++
                            
                                I can print the memory with gdb's x command ,but if I use printf,segmentation fault
                            
                                When can a cond var be used to synchronize its own destruction/unmapping?
                            
                                Why does the address of a local variable vary when executing multiple times, but not when debugging it with GDB?
                            
                                -fstack-protector, -fstack-protector-all and -fmudflap
                            
                                Intel icc: how to dump optimized code as C file
                            
                                Writing kernel memory to ext2 block
                            
                                How to write DD-WRT C app?
                            
                                Calling system() from multithreaded program
                            
                                8 byte missing on EVP_DecryptFinal
                            
                                What is required printf precision for a __float128 to not lose information?
                            
                                Converting Ada String to C Void*
                            
                                How does one unit test handling of the error conditions for Python/C APIs like PyType_Ready and PyObject_New?
                            
                                Terminology when Initializing C Structures
                            
                                possible to revive a corefile back into a running program?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Position independent code, shared libraries and code veneers - getting them to work together

Tags:

c

shared-libraries

arm

tangrs

People also ask

1 Answers

tangrs

Recent Activity

Donate For Us