Can please summarize the events/steps that happen when I try to execute a read()/write() system call. How does the kernel know which file system to issue these commands. Lets say a process calls write(). Then It will call sys_write(). Now probably, since sys_write() is executed on behalf of the current process, it can access the struct task_struct and hence it can access the struct files_struct and struct fs_struct which contains file system information. But after that I am not seeing, how this fs_struct is helping to identify the file system. Edit: Now that Alex has described the flow...I have still doubt how the read/write are getting routed to a FS, since the VFS does not do it, then it must be happening somewhere else, Also how is the underlying block device and then finally the hardware protocol PCI/USB getting attached. A simple flow chart involving actual data structures would be helpful Please help.

This answer is based on kernel version 4.0. I traced out some of the code which handles a <code>read</code> syscall. I recommend you clone the Linux source repo and follow along in the source code. <ol> <li>Syscall handler for <code>read</code>, at <code>fs/read_write.c:620</code> is called. It receives a file descriptor (integer) as an argument, and calls <code>fdget_pos</code> to convert it to a <code>struct fd</code>.</li> <li> <code>fdget_pos</code> calls <code>__fdget_pos</code> calls <code>__fdget</code> calls <code>__fget_light</code>. <code>__fget_light</code> uses <code>current->files</code>, the file descriptor table for the current process, to look up the <code>struct file</code> which corresponds to the passed file descriptor number.</li> <li>Back in the syscall handler, the file struct is passed to <code>vfs_read</code>, at <code>fs/read_write.c:478</code>.</li> <li> <code>vfs_read</code> calls <code>__vfs_read</code>, which calls <code>file->f_op->read</code>. From here on, you are in filesystem-specific code.</li> </ol> So the VFS doesn't really bother "identifying" the filesystem which a file lives on; it simply uses the table of "file operation" function pointers which is stored in its <code>struct file</code>. When that <code>struct file</code> is initialized, it is given the correct <code>f_op</code> function pointer table which implements all the filesystem-specific operations for its filesystem.

How linux identify a particular file system to execute system call

Tags:

linux

linux-kernel

vfs

Can please summarize the events/steps that happen when I try to execute a read()/write() system call. How does the kernel know which file system to issue these commands.

Lets say a process calls write(). Then It will call sys_write().

Now probably, since sys_write() is executed on behalf of the current process, it can access the struct task_struct and hence it can access the struct files_struct and struct fs_struct which contains file system information.

But after that I am not seeing, how this fs_struct is helping to identify the file system.

Edit: Now that Alex has described the flow...I have still doubt how the read/write are getting routed to a FS, since the VFS does not do it, then it must be happening somewhere else, Also how is the underlying block device and then finally the hardware protocol PCI/USB getting attached.

A simple flow chart involving actual data structures would be helpful

Please help.

976

asked Mar 24 '15 03:03

Haswell

1 Answers

This answer is based on kernel version 4.0. I traced out some of the code which handles a read syscall. I recommend you clone the Linux source repo and follow along in the source code.

Syscall handler for read, at fs/read_write.c:620 is called. It receives a file descriptor (integer) as an argument, and calls fdget_pos to convert it to a struct fd.
fdget_pos calls __fdget_pos calls __fdget calls __fget_light. __fget_light uses current->files, the file descriptor table for the current process, to look up the struct file which corresponds to the passed file descriptor number.
Back in the syscall handler, the file struct is passed to vfs_read, at fs/read_write.c:478.
vfs_read calls __vfs_read, which calls file->f_op->read. From here on, you are in filesystem-specific code.

So the VFS doesn't really bother "identifying" the filesystem which a file lives on; it simply uses the table of "file operation" function pointers which is stored in its struct file. When that struct file is initialized, it is given the correct f_op function pointer table which implements all the filesystem-specific operations for its filesystem.

125

answered Oct 04 '22 17:10

Alex D

Related questions
                            
                                USBDEVFS_RESET vs IOCTL_USB_RESET
                            
                                ALSA Configuration How To Combine MMAP Emulation and Ladspa Plugin in asound.conf
                            
                                Starting FOREVER or PM2 as WWW-DATA from a PHP script
                            
                                Java program is getting slower after running for a while
                            
                                Eventloop has high ksoftirqd load; nginx does not but does same system-calls. Why?
                            
                                How can I find the alpha shape (concave hull) of a 2d point cloud?
                            
                                Docker: --ipc=host and security
                            
                                Send keystrokes to non-active GUI application without occupying the keyboard
                            
                                Calling Haskell from Java with C in between
                            
                                Docker container can reach DNS but not resolve hosts
                            
                                Is there a way to improve performance of linux pipes?
                            
                                where is amd64 psABI? [duplicate]
                            
                                How to conveniently sync a file between two git repositories
                            
                                How to extract C source code from .so file?
                            
                                Creating a high-performance network server in C++
                            
                                Why Linux/gnu linker chose address 0x400000?
                            
                                Amazon linux AMI vs Ubuntu
                            
                                GUI debugger for c++ on linux [closed]
                            
                                why is "autoreconf" not used often?
                            
                                npm hangs on postinstall / unlock

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With