How expensive are kernel context switches compared to userspace context switches?

Tags:

According to C10k and this paper, throughput of 1-thread-per-connection servers degrade as more and more clients connect and more and more threads are created. According to those two sources, this is because the more threads exist, the more time is spent on context switching compared to actual work done by those threads. Evented servers don't seem to suffer as much from performance degredation at high connection counts.

However, evented servers also do context switches between clients, they just do it in userspace.

Why are these userspace context switches faster than kernel thread context switches?
What exactly does a kernel context switch do that's so much more expensive?
How expensive is a kernel context switch exactly? How much time does it take?
Does kernel context switching time depend on the number of threads?

I'm mostly interested in how the Linux kernel handles context switching but information about other OSes is welcome too.

304

asked Aug 07 '11 12:08

Hongli

1 Answers

Why are these userspace context switches faster than kernel thread context switches?

Because the CPU does not need to switch to kernel mode and back to user mode.

What exactly does a kernel context switch do that's so much more expensive?

Mostly the switch to kernel mode. IIRC, the page tables are the same in kernel mode and user mode in Linux, so at least there is no TLB invalidation penalty.

How expensive is a kernel context switch exactly? How much time does it take?

Needs to be measured and can vary from machine to machine. I guess that a typical desktop/server machine these days can do a few hundred thousands of context switches per second, probably a few million.

Does kernel context switching time depend on the number of threads?

Depends on how the kernel scheduler handles this. AFAIK, in Linux it is pretty efficient, even with large thread counts, but more threads means more memory usage means more cache pressure and thus likely lower performance. I also expect some overhead involved in the handling of thousands of sockets.

104

answered Sep 27 '22 17:09

Ringding

Related questions
                            
                                Howto multithreaded jython scripts running from java?
                            
                                Java interrupt thread when reading socket [duplicate]
                            
                                Clipboard monitoring on Mac OS X | Java
                            
                                Alternative way to threads under Android
                            
                                How to Keep Listener Thread Alive
                            
                                How do I detect multi-threaded use?
                            
                                Thread as a GC root
                            
                                stdin, stdout and stderr are shared between?
                            
                                Android right approach : where JSON response should be parsed - in UI thread, or in another one?
                            
                                C++11 std::thread and virtual function binding
                            
                                Use of std::memory_order_consume in the Folly's lock free SPSC queue
                            
                                C++: Terminate called without an active exception (GCC)
                            
                                Web Api Controller and Thread Pool
                            
                                HttpClient and stream issues
                            
                                Modify server's variable from client's thread (threading, python)
                            
                                ExecutorService: how to prevent thread starvation when synchronization barriers are done in the threads
                            
                                Why does multiprocessing.Lock() not lock shared resource in Python?
                            
                                Interleaved parallel file read slower than sequential read?
                            
                                .Net Timeouts: WaitForSingleObject vs Timer
                            
                                How to use HttpClient with multithreaded operation?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How expensive are kernel context switches compared to userspace context switches?

Tags:

multithreading

kernel

Hongli

People also ask

1 Answers

Ringding

Recent Activity

Donate For Us