I decided yesterday to learn assembly (NASM syntax) after years of C++ and Python and I'm already confused about the way to exit a program. It's mostly about ret because it's the suggested instruction on SASM IDE.
I'm speaking for main obviously. I don't care about x86 backward compatibility. Only the x64 Linux best way. I'm curious.
If you use printf
or other libc functions, it's best to ret
from main or call exit
. (Which are equivalent; main's caller will call the libc exit
function.)
If not, if you were only making other raw system calls like write
with syscall
, it's also appropriate and consistent to exit that way, but either way, or call exit
are 100% fine in main.
If you want to work without libc at all, e.g. put your code under _start:
instead of main:
and link with ld
or gcc -static -nostdlib
, then you can't use ret
. Use mov eax, 231
(__NR_exit_group) / syscall
.
main
is a real & normal function like any other (called with a valid return address), but _start
(the process entry point) isn't. On entry to _start
, the stack holds argc
and argv
, so trying to ret
would set RIP=argc, and then code-fetch would segfault on that unmapped address. Nasm segmentation fault on RET in _start
Exiting via a system call is like calling _exit()
in C - skip atexit()
and libc cleanup, notably not flushing any buffered stdout output (line buffered on a terminal, full-buffered otherwise).
This leads to symptoms such as Using printf in assembly leads to empty output when piping, but works on the terminal (or if your output doesn't end with \n
, even on a terminal.)
main
is a function, called (indirectly) from CRT startup code. (Assuming you link your program normally, like you would a C program.) Your hand-written main works exactly like a compiler-generate C main
function would. Its caller (__libc_start_main
) really does do something like int result = main(argc, argv); exit(result);
,
e.g. call rax
(pointer passed by _start
) / mov edi, eax
/ call exit
.
So returning from main is exactly1 like calling exit
.
Syscall implementation of exit() for a comparison of the relevant C functions, exit
vs. _exit
vs. exit_group
and the underlying asm system calls.
C question: What is the difference between exit and return? is primarily about exit()
vs. return
, although there is mention of calling _exit()
directly, i.e. just making a system call. It's applicable because C main compiles to an asm main just like you'd write by hand.
Footnote 1: You can invent a hypothetical intentionally weird case where it's different. e.g. you used stack space in main
as your stdio buffer with sub rsp, 1024
/ mov rsi, rsp
/ ... / call setvbuf
. Then returning from main would involve putting RSP above that buffer, and __libc_start_main's call to exit could overwrite some of that buffer with return addresses and locals before execution reached the fflush cleanup. This mistake is more obvious in asm than C because you need leave
or mov rsp, rbp
or add rsp, 1024
or something to point RSP at your return address.
In C++, return from main runs destructors for its locals (before global/static exit stuff), exit
doesn't. But that just means the compiler makes asm that does more stuff before actually running the ret
, so it's all manual in asm, like in C.
The other difference is of course the asm / calling-convention details: exit status in EAX (return value) or EDI (first arg), and of course to ret
you have to have RSP pointing at your return address, like it was on function entry. With call exit
you don't, and you can even do a conditional tailcall of exit like jne exit
. Since it's a noreturn function, you don't really need RSP pointing at a valid return address. (RSP should be aligned by 16 before a call, though, or RSP%16 = 8 before a tailcall, matching the alignment after call pushes a return address. It's unlikely that exit / fflush cleanup will do any alignment-required stores/loads to the stack, but it's a good habit to get this right.)
(This whole footnote is about ret
vs. call exit
, not syscall
, so it's a bit of a tangent from the rest of the answer. You can also run syscall
without caring where the stack-pointer points.)
SYS_exit
vs. SYS_exit_group
raw system callsThe raw SYS_exit
system call is for exiting the current thread, like pthread_exit()
.
(eax=60 / syscall
, or eax=1 / int 0x80
).
SYS_exit_group
is for exiting the whole program, like _exit
.
(eax=231 / syscall
, or eax=252 / int 0x80
).
In a single-threaded program you can use either, but conceptually exit_group makes more sense to me if you're going to use raw system calls. glibc's _exit()
wrapper function actually uses the exit_group
system call (since glibc 2.3). See Syscall implementation of exit() for more details.
However, nearly all the hand-written asm you'll ever see uses SYS_exit
1. It's not "wrong", and SYS_exit
is perfectly acceptable for a program that didn't start more threads. Especially if you're trying to save code size with xor eax,eax
/ inc eax
(3 bytes in 32-bit mode) or push 60
/ pop rax
(3 bytes in 64-bit mode), while push 231
/pop rax
would be even larger than mov eax,231
because it doesn't fit in a signed imm8.
Note 1: (Usually actually hard-coding the number, not using __NR_
... constants from asm/unistd.h
or their SYS_
... names from sys/syscall.h
)
And historically, it's all there was. Note that in unistd_32.h, __NR_exit
has call number 1, but __NR_exit_group
= 252 wasn't added until years later when the kernel gained support for tasks that share virtual address space with their parent, aka threads started by clone(2)
. This is when SYS_exit
conceptually became "exit current thread". (But one could easily and convincingly argue that in a single-threaded program, SYS_exit
does still mean exit the whole program, because it only differs from exit_group
if there are multiple threads.)
To be honest, I've never used eax=252 / int 0x80 in anything, only ever eax=1. It's only in 64-bit code where I often use mov eax,231
instead of mov eax,60
because neither number is "simple" or memorable the way 1 is, so might as well be a cool guy and use the "modern" exit_group
way in my single-threaded toy program / experiment / microbenchmark / SO answer. :P (If I didn't enjoy tilting at windmills, I wouldn't spend so much time on assembly, especially on SO.)
And BTW, I usually use NASM for one-off experiments so it's inconvenient to use pre-defined symbolic constants for call numbers; with GCC to preprocess a .S
before running GAS you can make your code self-documenting with #include <sys/syscall.h>
so you can use mov $SYS_exit_group, %eax
(or $__NR_exit_group
), or mov eax, __NR_exit_group
with .intel_syntax noprefix
.
int 0x80
ABI in 64-bit code:What happens if you use the 32-bit int 0x80 Linux ABI in 64-bit code? explains what happens if you use the COMPAT_IA32_EMULATION int 0x80
ABI in 64-bit code.
It's totally fine for just exiting, as long as your kernel has that support compiled in, otherwise it will segfault just like any other random int number like int 0x7f
. (e.g. on WSL1, or people that built custom kernels and disabled that support.)
But the only reason you'd do it that way in asm would be so you could build the same source file with nasm -felf32
or nasm -felf64
. (You can't use syscall
in 32-bit code, except on some AMD CPUs which have a 32-bit version of syscall
. And the 32-bit ABI uses different call numbers anyway so this wouldn't let the same source be useful for both modes.)
Related:
ret
from _start
call exit
vs. mov eax,60
/syscall
(_exit) vs. mov eax,231
/syscall
(exit_group).call exit
or call puts
won't link with nasm -felf64 foo.asm
&& gcc foo.o
.starti
(stop at the process entry point, e.g. in the dynamic linker's _start
) and stepi
until you get to your own _start
or main
.If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With