I have just browsed the Linux kernel source tree and read the file tools/include/nolibc/nolibc.h. I saw the <code>syscall</code> in this file uses <code>%r8</code>, <code>%r9</code> and <code>%r10</code> in the clobber list. Also there is a comment that says: <blockquote> rcx and r8..r11 may be clobbered, others are preserved. </blockquote> As far as I know, <code>syscall</code> only clobbers <code>%rax</code>, <code>%rcx</code> and <code>%r11</code> (and memory). Is there a real example of <code>syscall</code> that clobbers <code>%r8</code>, <code>%r9</code> and <code>%r10</code>?

According to x86-64 ABI about syscall section A.2 AMD64 Linux Kernel Conventions, A.2.1 Calling Conventions [1]: <blockquote> <ol> <li> User-level applications use as integer registers for passing the sequence <code>%rdi</code>, <code>%rsi</code>, <code>%rdx</code>, <code>%rcx</code>, <code>%r8</code> and <code>%r9</code>. The kernel interface uses <code>%rdi</code>, <code>%rsi</code>, <code>%rdx</code>, <code>%r10</code>, <code>%r8</code> and <code>%r9</code>. </li> <li> A system-call is done via the <code>syscall</code> instruction. The kernel destroys registers <code>%rcx</code> and <code>%r11</code>. </li> <li> The number of the <code>syscall</code> has to be passed in register <code>%rax</code>. </li> <li> System-calls are limited to six arguments, no argument is passed directly on the stack. </li> <li> Returning from the <code>syscall</code>, register <code>%rax</code> contains the result of the system-call. A value in the range between -4095 and -1 indicates an error, it is -errno. </li> <li> Only values of class INTEGER or class MEMORY are passed to the kernel. </li> </ol> </blockquote> From (2), (5) and (6), we can conclude that Linux x86-64 syscall clobbers <code>%rax</code>, <code>%rcx</code> and <code>%r11</code> (and <code>"memory"</code>). Link: https://gitlab.com/x86-psABIs/x86-64-ABI/-/wikis/x86-64-psABI [1]

When does Linux x86-64 syscall clobber %r8, %r9 and %r10?

2 Answers

Only 32-bit system calls (e.g. via int 0x80) in 64-bit mode step on those registers, along with R11. (What happens if you use the 32-bit int 0x80 Linux ABI in 64-bit code?).

syscall properly saves/restores all regs including R8, R9, and R10, so user-space using it can assume they keep their values, except the RAX return value. (The kernel's syscall entry point even saves RCX and R11, but at that point they've already been overwritten by the syscall instruction itself with the original RIP and before-masking RFLAGS value.)

Those, with R11, are the non-legacy registers that are call-clobbered in the function-calling convention, so compiler-generated code for C functions inside the kernel naturally preserves R12-R15, even if an asm entry point didn't save them.

Currently the 64-bit int 0x80 entry point just pushes 0 for the call-clobbered R8-R11 registers in the process-state struct that it will restore from before returning to user space, instead of the original register values.

Historically, the int 0x80 entry point from 32-bit user-space didn't save/restore those registers at all. So their values were whatever compiler-generated kernel code left sitting around. This was thought to be innocent because 32-bit mode can't read those registers, until it was realized that user-space can far-jump to 64-bit mode, using the same CS value that the kernel uses for normal 64-bit user-space processes, selecting that system-wide GDT entry. So there was an actual info leak of kernel data, which was fixed by zeroing those registers.

IDK whether there used to be or still is a separate entry point from 64-bit user-space vs. 32-bit, or how they differ in struct pt_regs layout. The historical situation where int 0x80 leaked r8..r11 wouldn't have made sense for 64-bit user-space; that leak would have been obvious. So if they're unified now, they must not have been in the past.

141

answered Sep 28 '22 03:09

Peter Cordes

According to x86-64 ABI about syscall section A.2 AMD64 Linux Kernel Conventions, A.2.1 Calling Conventions [1]:

User-level applications use as integer registers for passing the sequence %rdi, %rsi, %rdx, %rcx, %r8 and %r9. The kernel interface uses %rdi, %rsi, %rdx, %r10, %r8 and %r9.

A system-call is done via the syscall instruction. The kernel destroys registers %rcx and %r11.

The number of the syscall has to be passed in register %rax.

System-calls are limited to six arguments, no argument is passed directly on the stack.

Returning from the syscall, register %rax contains the result of the system-call. A value in the range between -4095 and -1 indicates an error, it is -errno.

Only values of class INTEGER or class MEMORY are passed to the kernel.

From (2), (5) and (6), we can conclude that Linux x86-64 syscall clobbers %rax, %rcx and %r11 (and "memory").

Link: https://gitlab.com/x86-psABIs/x86-64-ABI/-/wikis/x86-64-psABI [1]

answered Sep 28 '22 05:09

Ammar Faizi

Related questions
                            
                                ASLR Entropy Bits for Stack on Linux
                            
                                OpenJDK 64-Bit Server VM warning: ignoring option MaxPermSize=350m;
                            
                                Howto debug when nginx gives 502 bad gateway?
                            
                                Is sched_getcpu() reliable on Linux?
                            
                                why questionmark comes in the end of filename when i create .txt file through shell script? [duplicate]
                            
                                What are the consequences of changing a symbol from .globl to .weak?
                            
                                What does spawn ,expect and send command in linux/unix
                            
                                How to avoid spaces in echo when it is split into multiple lines
                            
                                Error: 'GL/glfw3.h: No such file or directory' when compiling C++ programs using OpenGL on Linux
                            
                                How to remove single quotes from file names
                            
                                C - using fork() and exec() twice
                            
                                Unable to use docker due to ZScaler and certificate issues
                            
                                How to move images of docker in aufs directory to overlay2?
                            
                                Grep resource usage
                            
                                ffmpeg img to video = Could find no file with path
                            
                                How do I change the permissions in openshift container platform?
                            
                                Cleaning up after QApplication
                            
                                Why is SDL so much slower on Mac than Linux?
                            
                                What does maximum resident set size mean?
                            
                                The command, pip install --upgrade pip, install all version of pip

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When does Linux x86-64 syscall clobber %r8, %r9 and %r10?

Tags:

linux

assembly

x86-64

system-calls

Ammar Faizi

People also ask

2 Answers

Peter Cordes

Ammar Faizi

Recent Activity

Donate For Us