I have some example code here which I'm using to understand some C behaviour for a beginner's CTF: <pre class="prettyprint"><code>// example.c #include <stdio.h> void main() { void (*print)(); print = getenv("EGG"); print(); } </code></pre> Compile: <code>gcc -z execstack -g -m32 -o example example.c</code> Usage: <code>EGG=$(echo -ne '\x90\xc3) ./example</code> If I compile the code with the <code>execstack</code> flag, the program will execute the opcodes I've injected above. Without the flag, the program will crash due to a segmentation fault. Why exactly is this? Is it because <code>getenv</code> is storing the actual opcodes on the stack, and the execstack flag allows jumps to the stack? Or does <code>getenv</code> push a pointer onto the stack, and there are some other rules about what sections of memory are executable? I read the manpage, but I couldn't work out exactly what the rules are and how they're enforced. Another issue is I think I'm also really lacking a good tool to visualise memory whilst debugging, so its hard to figure this out. Any advice would be really appreciated.

<code>getenv</code> doesn't store the env var's value on the stack. It's already on the stack from process startup, and <code>getenv</code> obtains a pointer to it. See the i386 System V ABI's description of where argv[] and envp[] are located at process startup: above <code>[esp]</code>. <code>_start</code> doesn't copy them before calling <code>main</code>, just calculates pointers to them to pass as args to <code>main</code>. (Links to the latest version at https://github.com/hjl-tools/x86-psABI/wiki/X86-psABI, where the official current version is maintained.) <hr> Your code is casting a pointer to stack memory (containing the value of an env var) into a function pointer and calling through it. Look at the compiler-generated asm (e.g. on https://godbolt.org/): it'll be something like <code>call getenv</code> / <code>call eax</code>. <code>-zexecstack</code> in your kernel version1 makes all your pages executable, not just the stack. It also applies to <code>.data</code>, <code>.bss</code>, and <code>.rodata</code> sections, and memory allocated with <code>malloc</code> / <code>new</code>. The exact mechanism on GNU/Linux was a "read-implies-exec" process-wide flag that affects all future allocations, including manual use of <code>mmap</code>. See Unexpected exec permission from mmap when assembly files included in the project for more about the <code>GNU_STACK</code> ELF header stuff. Footnote 1: Linux after 5.4 or so only makes the stack itself executable, not READ_IMPLIES_EXEC: Linux default behavior of executable .data section changed between 5.4 and 5.9? Fun fact: taking the address of a nested function that accesses its parents local variables gets gcc to enable <code>-zexecstack</code>. It stores code for an executable "trampoline" onto the stack that passes a "static chain" pointer to the actual nested function, allowing it to reference its parent's stack-frame. <hr> If you wanted to exec data as code without <code>-zexecstack</code>, you'd use <code>mprotect(PROT_EXEC|PROT_READ|PROT_WRITE)</code> on the page containing that env var. (It's part of your stack so you shouldn't remove write permission; it could be in the same page as main's stack frame for example.) <hr> Related: With GNU/Linux <code>ld</code> from binutils before late 2018 or so, the <code>.rodata</code> section is linked into the same ELF segment as the <code>.text</code> section, and thus <code>const char code[] = {0xc3}</code> or string literals are executable. Current <code>ld</code> gives <code>.rodata</code> its own segment that's mapped read without exec, so finding ROP / Spectre "gadgets" in read-only data is no longer possible, unless you use <code>-zexecstack</code>. And even that doesn't work on current kernels; <code>char code[] = ...;</code> as a local inside a function will put data on the stack where it's actually executable. See How to get c code to execute hex machine code? for details.

Exactly what cases does the gcc execstack flag allow and how does it enforce it?

Tags:

c

linux

x86

gcc

shellcode

I have some example code here which I'm using to understand some C behaviour for a beginner's CTF:

// example.c

#include <stdio.h>


void main() {
        void (*print)();

        print = getenv("EGG");
        print();
}

Compile: gcc -z execstack -g -m32 -o example example.c

Usage: EGG=$(echo -ne '\x90\xc3) ./example

If I compile the code with the execstack flag, the program will execute the opcodes I've injected above. Without the flag, the program will crash due to a segmentation fault.

Why exactly is this? Is it because getenv is storing the actual opcodes on the stack, and the execstack flag allows jumps to the stack? Or does getenv push a pointer onto the stack, and there are some other rules about what sections of memory are executable? I read the manpage, but I couldn't work out exactly what the rules are and how they're enforced.

Another issue is I think I'm also really lacking a good tool to visualise memory whilst debugging, so its hard to figure this out. Any advice would be really appreciated.

801

asked Nov 16 '18 22:11

Isaac

1 Answers

getenv doesn't store the env var's value on the stack. It's already on the stack from process startup, and getenv obtains a pointer to it.

See the i386 System V ABI's description of where argv[] and envp[] are located at process startup: above [esp].

_start doesn't copy them before calling main, just calculates pointers to them to pass as args to main. (Links to the latest version at https://github.com/hjl-tools/x86-psABI/wiki/X86-psABI, where the official current version is maintained.)

Your code is casting a pointer to stack memory (containing the value of an env var) into a function pointer and calling through it. Look at the compiler-generated asm (e.g. on https://godbolt.org/): it'll be something like call getenv / call eax.

-zexecstack in your kernel version¹ makes all your pages executable, not just the stack. It also applies to .data, .bss, and .rodata sections, and memory allocated with malloc / new.

The exact mechanism on GNU/Linux was a "read-implies-exec" process-wide flag that affects all future allocations, including manual use of mmap. See Unexpected exec permission from mmap when assembly files included in the project for more about the GNU_STACK ELF header stuff.

Footnote 1: Linux after 5.4 or so only makes the stack itself executable, not READ_IMPLIES_EXEC: Linux default behavior of executable .data section changed between 5.4 and 5.9?

Fun fact: taking the address of a nested function that accesses its parents local variables gets gcc to enable -zexecstack. It stores code for an executable "trampoline" onto the stack that passes a "static chain" pointer to the actual nested function, allowing it to reference its parent's stack-frame.

If you wanted to exec data as code without -zexecstack, you'd use mprotect(PROT_EXEC|PROT_READ|PROT_WRITE) on the page containing that env var. (It's part of your stack so you shouldn't remove write permission; it could be in the same page as main's stack frame for example.)

With GNU/Linux ld from binutils before late 2018 or so, the .rodata section is linked into the same ELF segment as the .text section, and thus const char code[] = {0xc3} or string literals are executable.

Current ld gives .rodata its own segment that's mapped read without exec, so finding ROP / Spectre "gadgets" in read-only data is no longer possible, unless you use -zexecstack. And even that doesn't work on current kernels; char code[] = ...; as a local inside a function will put data on the stack where it's actually executable. See How to get c code to execute hex machine code? for details.

189

answered Oct 13 '22 01:10

Peter Cordes

Related questions
                            
                                Can I use a function typedef in function definitions?
                            
                                Why is Python's weekday() different from tm_wday in C?
                            
                                Assign to array in struct in c
                            
                                Why is flattening a multidimensional array in C illegal? [duplicate]
                            
                                Cryptic struct definition in C
                            
                                how to turn on icc/icpc warnings?
                            
                                In x86, why do I have the same instruction two times, with reversed operands?
                            
                                Any Faster RMS Value Calculation in C?
                            
                                How to pass an array of Swift strings to a C function taking a char ** parameter
                            
                                HOST_NAME_MAX undefined after include <limits.h>
                            
                                Assembler debug of undefined expression
                            
                                What is "allocation context"?
                            
                                Compiler optimizations and temporary assignments in C and C++
                            
                                What exactly does the C Structure Dot Operator Do (Lower Level Perspective)?
                            
                                What is the purpose of 61 in tm_sec field from the tm structure
                            
                                Passing a Rust variable to a C function that expects to be able to modify it
                            
                                Why is this nested macro replacement failing?
                            
                                Why does popen() invoke a shell to execute a process?
                            
                                C function call without bracket
                            
                                Handle C typedef on different platform using NativeCall

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With