I'm expermenting with function pointers on Linux and trying to execute this C program: <pre class="prettyprint"><code>#include <stdio.h> #include <string.h> int myfun() { return 42; } int main() { char data[500]; memcpy(data, myfun, sizeof(data)); int (*fun_pointer)() = (void*)data; printf("%d\n", fun_pointer()); return 0; } </code></pre> Unfortunately it segfaults on <code>fun_pointer()</code> call. I suspect that it is connected with some memory flags, but I don't found information about it. Could you explain why this code segfaults? Don't see to the fixed <code>data</code> array size, it is ok and copying without calling the function is successfull. UPD: Finally I've found that the memory segment should be marked as executable using mprotect system call called with <code>PROT_EXEC</code> flag. Moreover the memory segment should be returned by mmap function as stated in the POSIX specification. There is the same code that uses allocated by <code>mmap</code> memory with <code>PROT_EXEC</code> flag (and works): <pre class="prettyprint"><code>#include <stdio.h> #include <string.h> #include <sys/mman.h> int myfun() { return 42; } int main() { size_t size = (char*)main - (char*)myfun; char *data = mmap(NULL, size, PROT_EXEC | PROT_READ | PROT_WRITE, MAP_PRIVATE | MAP_ANONYMOUS, 0, 0); memcpy(data, myfun, size); int (*fun_pointer)() = (void*)data; printf("%d\n", fun_pointer()); munmap(data, size); return 0; } </code></pre> This example should be complied with <code>-fPIC</code> gcc option to ensure that the code in functions is position-independent.

Several problems there: <ul> <li>Your <code>data</code> array stays in data segment, not in code segment.</li> <li>The address relocation is not handled.</li> <li>The code size is not known, just guessed.</li> </ul>

In addition to Diask's answer you probably want to use some JIT compilation techniques (to generate executable code in memory), and you should be sure that the memory zone containing the code is executable (see mprotect(2) and the NX bit; often the call stack is not executable for security reasons). You could use GNU lightning (quickly emitting slow machine code), asmjit, libjit, LLVM, GCCJIT (able to slowly emit fast optimized machine code). You could also emit some C code in some temporary file <code>/tmp/emittedcode.c</code>, fork a compilation command <code>gcc -Wall -O -fPIC -shared /tmp/emittedcode.c -o /tmp/emittedcode.so</code> then dlopen(3) that shared object <code>/tmp/emittedcode.so</code> and use dlsym(3) to find function pointers by their name there. See also this, this, this, this and that answers. Read about trampoline code, closures, and continuations & CPS. Of course, copying code from one zone to another usually don't work (it has to be position independent code to make that work, or you need your own relocation machinery, a bit like a linker does).

Linux: executing code that is loaded to memory manually

Tags:

c

linux

I'm expermenting with function pointers on Linux and trying to execute this C program:

#include <stdio.h>
#include <string.h>

int myfun() 
{
    return 42;
}

int main()
{
    char data[500];
    memcpy(data, myfun, sizeof(data));
    int (*fun_pointer)() = (void*)data;
    printf("%d\n", fun_pointer());

    return 0;
}

Unfortunately it segfaults on fun_pointer() call. I suspect that it is connected with some memory flags, but I don't found information about it.

Could you explain why this code segfaults? Don't see to the fixed data array size, it is ok and copying without calling the function is successfull.

UPD: Finally I've found that the memory segment should be marked as executable using mprotect system call called with PROT_EXEC flag. Moreover the memory segment should be returned by mmap function as stated in the POSIX specification. There is the same code that uses allocated by mmap memory with PROT_EXEC flag (and works):

#include <stdio.h>
#include <string.h>
#include <sys/mman.h>

int myfun() 
{
    return 42;
}

int main()
{
    size_t size = (char*)main - (char*)myfun;
    char *data = mmap(NULL, size, PROT_EXEC | PROT_READ | PROT_WRITE,
        MAP_PRIVATE | MAP_ANONYMOUS, 0, 0);
    memcpy(data, myfun, size);

    int (*fun_pointer)() = (void*)data;
    printf("%d\n", fun_pointer());

    munmap(data, size);
    return 0;
}

This example should be complied with -fPIC gcc option to ensure that the code in functions is position-independent.

278

asked Dec 28 '15 08:12

Alexander Rodin

2 Answers

Several problems there:

Your data array stays in data segment, not in code segment.
The address relocation is not handled.
The code size is not known, just guessed.

141

answered Oct 15 '22 16:10

dlask

In addition to Diask's answer you probably want to use some JIT compilation techniques (to generate executable code in memory), and you should be sure that the memory zone containing the code is executable (see mprotect(2) and the NX bit; often the call stack is not executable for security reasons). You could use GNU lightning (quickly emitting slow machine code), asmjit, libjit, LLVM, GCCJIT (able to slowly emit fast optimized machine code). You could also emit some C code in some temporary file /tmp/emittedcode.c, fork a compilation command gcc -Wall -O -fPIC -shared /tmp/emittedcode.c -o /tmp/emittedcode.so then dlopen(3) that shared object /tmp/emittedcode.so and use dlsym(3) to find function pointers by their name there.

See also this, this, this, this and that answers. Read about trampoline code, closures, and continuations & CPS.

Of course, copying code from one zone to another usually don't work (it has to be position independent code to make that work, or you need your own relocation machinery, a bit like a linker does).

answered Oct 15 '22 17:10

Basile Starynkevitch

Related questions
                            
                                Static vs. Malloc
                            
                                Check if two "simple" 'if statements' in C are equivalent
                            
                                How do I unpack and extract data properly using msgpack-c?
                            
                                what is "hanging reference" & "general protection fault"?
                            
                                Is there a way to redirect syslog messages to stdout?
                            
                                what is __dirstream, where can we find the definition
                            
                                "...redeclared as different kind of symbol"?
                            
                                How to efficiently write a large sequence of NULL bytes in a file?
                            
                                How to re-use C structs in ARM assembly in a maintainable and readable way?
                            
                                Optimize code generated by sympy
                            
                                How to get rid of awful useless macros
                            
                                Is a goto in alloca's function scope valid?
                            
                                How to only accept a certain precision (so many decimals places) in scanf?
                            
                                Calls that precede a function's definition cannot be inlined?
                            
                                How to change interpreter path and pass command line arguments to an "executable" shared library on Linux?
                            
                                Division of very big numbers using arrays in C
                            
                                SIZE command in UNIX
                            
                                Creating a DER formatted ECDSA signature from raw r and s
                            
                                Using increment in ternary operator in C
                            
                                Processes resources not limited by setrlimit

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With