Why are SIMD instructions not used in kernel?

1 Answers

Saving/restoring FPU (including SIMD vector registers) state is more expensive than just integer GP register state. It's simply not worth the cost in most cases.

In Linux kernel code, all you have to do is call kernel_fpu_begin() / kernel_fpu_end() around your code. This is what the RAID drivers do. See http://yarchive.net/comp/linux/kernel_fp.html.

x86 doesn't have any future-proof way to save/restore one or a couple vector registers. (Other than manual save/restore of an xmm register using legacy SSE instructions, potentially causing SSE/AVX transition stalls on Intel CPUs if user-space had the upper halves of any ymm/zmm registers dirty).

The reason legacy SSE works is that some Windows drivers were already doing this when Intel wanted to introduce AVX, so they invented that transition-penalty stuff instead of having legacy SSE instructions zero the upper 128b of ymm registers. (See this for more detail on that design decision.) So basically we can blame Windows binary-only drivers for the SSE/AVX transition-penalty mess.

IDK about non-x86 architectures, and whether the existing SIMD instruction sets have a future-proof way to save/restore a register that will continue to work for longer vectors. ARM32 might, if extensions continue the pattern of using multiple 32-bit FP registers as single wider register. (e.g. q2 is composed of s8 through s11.) So saving/restoring a couple q registers should be future-proof, if a 256b NEON extension simply lets you use 2 q registers as one 256b register. Or if the new wider vectors are separate, and don't extend the existing registers.

101

answered Sep 22 '22 12:09

Peter Cordes

Related questions
                            
                                Why printk doesn't print message in kernel log(dmesg)
                            
                                Transition from real to protected mode in the Linux kernel
                            
                                Why does Linux favor 0x7f mappings?
                            
                                disable_local_irq and kernel timers
                            
                                Linux: How to assign USB driver to device [closed]
                            
                                Syntax to get the value of environment variable in Kconfig file
                            
                                From the kernel to the user space (DMA)
                            
                                How to use a spin lock if copy_to_user needs to be called?
                            
                                Trying to port GCC specific asm goto to Clang
                            
                                How to compile Linux kernel code on Windows?
                            
                                Linux: boot arguments with U-Boot and Flat Image Tree (FIT)
                            
                                Netfilter kernel module to intercept packets and log them
                            
                                Unusually slow TCP-connection in Linux
                            
                                Linux Kernel Invalidating TLB Entries
                            
                                Intel MSR frequency scaling per - thread
                            
                                divide by zero exception handling in Linux
                            
                                Does anyone know where to define the hardware, revision and serial no. fields of /proc/cpuinfo?
                            
                                Shutdown (embedded) linux from kernel-space
                            
                                May i know in Linux kernel what is the purpose of GFP_HARDWALL flag?
                            
                                what is the use of Flattened device tree - Linux Kernel

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why are SIMD instructions not used in kernel?

Tags:

operating-system

linux-kernel

simd

linux-device-driver

ispc

Saksham Jain

People also ask

1 Answers

Peter Cordes

Recent Activity

Donate For Us