Is lseek() O(1) complexity?

Tags:

I know that my question has an answer here: QFile seek performance. But I am not completely satisfied with the answer. Even after looking at the following implementation of generic_file_llseek() for ext4, I can't seem to understand how can the complexity be measured.

/**
 * generic_file_llseek - generic llseek implementation for regular files
 * @file:       file structure to seek on
 * @offset:     file offset to seek to
 * @origin:     type of seek
 *
 * This is a generic implemenation of ->llseek useable for all normal local
 * filesystems.  It just updates the file offset to the value specified by
 * @offset and @origin under i_mutex.
 */
loff_t generic_file_llseek(struct file *file, loff_t offset, int origin)
{
        loff_t rval;

        mutex_lock(&file->f_dentry->d_inode->i_mutex);
        rval = generic_file_llseek_unlocked(file, offset, origin);
        mutex_unlock(&file->f_dentry->d_inode->i_mutex);

        return rval;
}

/**
 * generic_file_llseek_unlocked - lockless generic llseek implementation
 * @file:       file structure to seek on
 * @offset:     file offset to seek to
 * @origin:     type of seek
 *
 * Updates the file offset to the value specified by @offset and @origin.
 * Locking must be provided by the caller.
 */
loff_t
generic_file_llseek_unlocked(struct file *file, loff_t offset, int origin)
{
        struct inode *inode = file->f_mapping->host;

        switch (origin) {
        case SEEK_END:
                offset += inode->i_size;
                break;
        case SEEK_CUR:
                /*
                 * Here we special-case the lseek(fd, 0, SEEK_CUR)
                 * position-querying operation.  Avoid rewriting the "same"
                 * f_pos value back to the file because a concurrent read(),
                 * write() or lseek() might have altered it
                 */
                if (offset == 0)
                        return file->f_pos;
               break;
        }

        if (offset < 0 || offset > inode->i_sb->s_maxbytes)
                return -EINVAL;

        /* Special lock needed here? */
        if (offset != file->f_pos) {
                file->f_pos = offset;

                file->f_version = 0;
        }

        return offset;
}

Say, for example, I have a 4GB file, and I know the offset for the middle portion in the file, how exactly does a lseek() get me there without traversing the entire file? Does the OS already know where each byte of the file resides?

227

asked Feb 09 '14 11:02

Nehal J Wani

2 Answers

lseek() as implemented in ext4 will just increment the file pointer and do some validation checks. It doesn't depend on the file size, meaning it is O(1).

Also you can see this in the code, there isn't any loop nor suspicious function calls in there.

However, while this is true on ext4, it might be not true for other filesystems, as this behaviour isn't guaranteed by POSIX. But it is likely unless the filesystem is meant for a very special purpose.

144

answered Oct 13 '22 06:10

hek2mgl

lseek's complexity depends on the representation of file in your system. On most modern systems a file is organized by some clever tree-like data structure resulting into seek being executed in time O(logx(n)), where n is the size of the file and x some system depending number.

answered Oct 13 '22 04:10

Marian

Related questions
                            
                                Value lookup table in C by strings?
                            
                                do malloc/memcpy function run independently on NUMA?
                            
                                REPL for interpreter using Flex/Bison
                            
                                linux kernel aio functionality
                            
                                what is mean by "suppress results from generated code"
                            
                                Find holes in C structs due to alignment
                            
                                Graph Data Structures with millions of nodes (Social network)
                            
                                How to make global constant (work across multiple files) in C program?
                            
                                Using C Preprocessing to get integer value of a string
                            
                                OpenSSL ASN.1 programming tutorial
                            
                                Reading output of a USB webcam in Linux
                            
                                How to deal with network port abuse in sockets
                            
                                Can I use GCC's __builtin_expect() with ternary operator in C
                            
                                What type of address returned on applying ampersand to a variable or a data type in C/C++ or in any other such language?
                            
                                How to check if a structure is initialized?
                            
                                Correct way to allocate and free arrays of pointers to arrays
                            
                                Bitfields, why implementation specific?
                            
                                Why is _GNU_SOURCE macro required for pthread_mutexattr_settype() while it is in POSIX/IEEE standard?
                            
                                How to handle C++ internal data structure in R in order to allow save/load?
                            
                                ASM call conventions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is lseek() O(1) complexity?

Tags:

c

linux

time-complexity

ext4

lseek

Nehal J Wani

People also ask

2 Answers

hek2mgl

Marian

Recent Activity

Donate For Us