Linux Process States

2 Answers

When a process needs to fetch data from a disk, it effectively stops running on the CPU to let other processes run because the operation might take a long time to complete – at least 5ms seek time for a disk is common, and 5ms is 10 million CPU cycles, an eternity from the point of view of the program!

From the programmer point of view (also said "in userspace"), this is called a blocking system call. If you call write(2) (which is a thin libc wrapper around the system call of the same name), your process does not exactly stop at that boundary; it continues, in the kernel, running the system call code. Most of the time it goes all the way up to a specific disk controller driver (filename → filesystem/VFS → block device → device driver), where a command to fetch a block on disk is submitted to the proper hardware, which is a very fast operation most of the time.

THEN the process is put in sleep state (in kernel space, blocking is called sleeping – nothing is ever 'blocked' from the kernel point of view). It will be awakened once the hardware has finally fetched the proper data, then the process will be marked as runnable and will be scheduled. Eventually, the scheduler will run the process.

Finally, in userspace, the blocking system call returns with proper status and data, and the program flow goes on.

It is possible to invoke most I/O system calls in non-blocking mode (see O_NONBLOCK in open(2) and fcntl(2)). In this case, the system calls return immediately and only report submitting the disk operation. The programmer will have to explicitly check at a later time whether the operation completed, successfully or not, and fetch its result (e.g., with select(2)). This is called asynchronous or event-based programming.

Most answers here mentioning the D state (which is called TASK_UNINTERRUPTIBLE in the Linux state names) are incorrect. The D state is a special sleep mode which is only triggered in a kernel space code path, when that code path can't be interrupted (because it would be too complex to program), with the expectation that it would block only for a very short time. I believe that most "D states" are actually invisible; they are very short lived and can't be observed by sampling tools such as 'top'.

You can encounter unkillable processes in the D state in a few situations. NFS is famous for that, and I've encountered it many times. I think there's a semantic clash between some VFS code paths, which assume to always reach local disks and fast error detection (on SATA, an error timeout would be around a few 100 ms), and NFS, which actually fetches data from the network which is more resilient and has slow recovery (a TCP timeout of 300 seconds is common). Read this article for the cool solution introduced in Linux 2.6.25 with the TASK_KILLABLE state. Before this era there was a hack where you could actually send signals to NFS process clients by sending a SIGKILL to the kernel thread rpciod, but forget about that ugly trick.…

140

answered Sep 22 '22 22:09

zerodeux

While waiting for read() or write() to/from a file descriptor return, the process will be put in a special kind of sleep, known as "D" or "Disk Sleep". This is special, because the process can not be killed or interrupted while in such a state. A process waiting for a return from ioctl() would also be put to sleep in this manner.

An exception to this is when a file (such as a terminal or other character device) is opened in O_NONBLOCK mode, passed when its assumed that a device (such as a modem) will need time to initialize. However, you indicated block devices in your question. Also, I have never tried an ioctl() that is likely to block on a fd opened in non blocking mode (at least not knowingly).

How another process is chosen depends entirely on the scheduler you are using, as well as what other processes might have done to modify their weights within that scheduler.

Some user space programs under certain circumstances have been known to remain in this state forever, until rebooted. These are typically grouped in with other "zombies", but the term would not be correct as they are not technically defunct.

answered Sep 25 '22 22:09

Tim Post

Related questions
                            
                                How to modify memory contents using GDB?
                            
                                Symbolic link to a hook in git
                            
                                How to get a list of programs running with nohup
                            
                                Compilation fails with "relocation R_X86_64_32 against `.rodata.str1.8' can not be used when making a shared object"
                            
                                Limitations of Intel Assembly Syntax Compared to AT&T [closed]
                            
                                Process list on Linux via Python
                            
                                Linux: Which process is causing "device busy" when doing umount? [closed]
                            
                                Gem Command not found
                            
                                Turning multiple lines into one comma separated line [duplicate]
                            
                                docker.errors.DockerException: Error while fetching server API version
                            
                                How to split CSV files as per number of rows specified?
                            
                                Recursively find files with a specific extension
                            
                                How to avoid using printf in a signal handler?
                            
                                Bash Shell Script - Check for a flag and grab its value
                            
                                The GNU screen is unresponsive, seems blocked
                            
                                Use sudo with password as parameter [closed]
                            
                                Expand a possible relative path in bash
                            
                                How to get memory usage at runtime using C++?
                            
                                How do I get the absolute directory of a file in bash?
                            
                                Does WGET timeout?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Linux Process States

Tags:

linux

process

cpu

kernel

states

Blair

People also ask

2 Answers

zerodeux

Tim Post

Recent Activity

Donate For Us