Interpreting this line in Assembly language?

Tags:

Below are the first 5 lines of a disassembled C program that I am trying to reverse engineer back into it's C code for purposes of better learning assembly language. At the beginning of this code I see it makes room on the stack and immediately calls

0x000000000040054e <+8>:    mov    %fs:0x28,%rax

I am confused what this line does, and what might be calling this from the corresponding C program. The only time I have seen this line so far is when a different method within a C program is called, but this time it is not followed by any Callq instructions so I am not so sure... Any ideas what else could be in this C program to be making this call?

0x0000000000400546 <+0>:    push   %rbp
0x0000000000400547 <+1>:    mov    %rsp,%rbp   
0x000000000040054a <+4>:    sub    $0x40,%rsp
0x000000000040054e <+8>:    mov    %fs:0x28,%rax
0x0000000000400557 <+17>:   mov    %rax,-0x8(%rbp)
0x000000000040055b <+21>:   xor    %eax,%eax
0x000000000040055d <+23>:   movl   $0x17,-0x30(%rbp)
...

I know this is to provide some form of stack protection for buffer overflow attacks, I just need to know what C code would prompt this protection if not for a seperate method.

314

asked May 10 '18 17:05

Anon.

1 Answers

As you say, this is code used to defend against buffer overflows. The compiler generates this "stack canary check" for functions that have local variables that might be buffers that could be overflowed. Note the instructions immediately above and below the line you are asking about:

sub  $0x40, %rsp
mov  %fs:0x28, %rax
mov  %rax, -0x8(%ebp)
xor  %eax, %eax

The sub allocates 64 bytes of space on the stack, which is enough room for at least one small array. Then a secret value is copied from %fs:0x28 to the top of that space, just below the previous frame pointer and the return address, and then it is erased from the register file.

The body of the function does something with arrays; if it writes sufficiently far past the end of an array, it will overwrite the secret value. At the end of the function, there will be code along the lines of

    mov    -0x8(%rbp), %rax
    xor    %fs:28, %rax
    jne    1
    mov    %rbp, %rsp
    pop    %rbp
    ret
1:
    call    __stack_chk_fail   # does not return

This verifies that the secret value is unchanged, and crashes the program if it has changed. The idea is that someone trying to exploit a simple buffer overflow vulnerability, like you have when you use gets, won't be able to change the return address without also modifying the secret value.

The compiler has several different heuristics, selectable with command line options, for deciding when it is necessary to generate stack-canary protection code.

You can't write C code corresponding to this assembly language yourself, because it uses the unusual %fs:nnnn addressing mode; the stack-canary code intentionally uses an addressing mode that no other code generation relies on, to make it as difficult as possible for the adversary to learn the secret value.

166

answered Oct 16 '22 16:10

zwol

Related questions
                            
                                AVX scalar operations are much faster
                            
                                find address of PLT stub
                            
                                assembly lea instruction of int *q = p++ and int c = a++
                            
                                How to set ss and sp registers correctly in i386
                            
                                Is the stack frame required for all functions in C on x86-64?
                            
                                Linking LAPACK from Intel MKL with gfortran
                            
                                How to set the alignment for the .data section?
                            
                                Keras with Tensorflow backend on GPU. MKL ERROR: Parameter 4 was incorrect on entry to DLASCL
                            
                                _umul128 on Windows 32 bits
                            
                                Polygot include file for nasm/yasm and C
                            
                                Is numpy+mkl faster than numpy?
                            
                                difference between load1 and broadcast intrinsics
                            
                                Drain the instruction pipeline of Intel Core 2 Duo?
                            
                                what is fastest x86-64 assembly-language divide algorithm for huge numbers?
                            
                                Why does the BIOS entry point start with a WBINVD instruction?
                            
                                Does lock xchg have the same behavior as mfence?
                            
                                Undefined symbols for architecture i386 [duplicate]
                            
                                How does Intel TBB's scalable_allocator work?
                            
                                Can I load a 32 bit DLL into a 64 bit process on Windows?
                            
                                Why is memcmp(a, b, 4) only sometimes optimized to a uint32 comparison?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Interpreting this line in Assembly language?

Tags:

c

x86

gcc

assembly

reverse-engineering

intel

Anon.

People also ask

1 Answers

zwol

Recent Activity

Donate For Us