Data structures used to build file systems?

1 Answers

All file systems are different, so there are a huge number of data structures that actually get used in file systems.

Many file systems use some sort of bit vector (usually referred to as a bitmap) to track where certain free blocks are, since they have excellent performance for querying whether a specific block of disk is in use and (for disks that aren't overwhelmingly full) support reasonably fast lookups of free blocks.

Many older file systems (ext and ext2) stored directory structures using simple linked lists. Apparently this was actually fast enough for most applications, though some types of applications that used lots of large directories suffered noticeable performance hits.

The XFS file system was famous for using B+-trees for just about everything, including directory structures and its journaling system. From what I remember from my undergrad OS course, the philosophy was that since it took so long to write, debug, and performance tune the implementation of the B+-tree, it made sense to use it as much as possible.

Other file systems (ext3 and ext4) use a variant of the B-tree called the HTree that I'm not very familiar with. Apparently it uses some sort of hashing scheme to keep the branching factor high so that very few disk accesses are used.

I have heard anecdotally that some operating systems tried using splay trees to store their directory structures but ran into trouble with them. Specifically, it prevented multithreaded access to the same directory from multiple readers (since in a splay tree, each access reshapes the tree) and encountered an edge case where the tree would degenerate to a linked list if all elements of the tree were accesses sequentially. That said, I don't know if this is just an urban legend, since these problems would have been apparent before anyone tried to code them up.

Microsoft's FAT32 system used a huge array (the file allocation table) that store what files were stored where and which disk sectors follow one another logically in a file. The main drawback is that the table had to be set up in advance, so there ended up being upper limits on the sizes of files that could be stored on the disk. However, the array-based system was pretty easy to implement.

This is not an exhaustive list - I'm sure that other file systems use other data structures. However, I hope it helps give you a push in the right direction.

Hope this helps!

148

answered Oct 19 '22 23:10

templatetypedef

Related questions
                            
                                C++ - interval tree implementation
                            
                                Order-preserving data structures in C#
                            
                                How is it possible to build a suffix tree in linear time?
                            
                                Data structure for storing recurring events?
                            
                                Immutable queue in Clojure
                            
                                Pop multiple values from Redis data structure atomically?
                            
                                How does Scala's Vector work?
                            
                                How expensive are Python dictionaries to handle?
                            
                                Search an element in a heap
                            
                                Using a Python Dictionary as a Key (Non-nested)
                            
                                insert, delete, max in O(1)
                            
                                Sets in Ruby?
                            
                                What is a hash table and how do you make it in C? [closed]
                            
                                Binary search vs binary search tree
                            
                                Is there a standard Java implementation of a Fibonacci heap?
                            
                                Explain the difference between a data *structure* and a data *type* [closed]
                            
                                What are lenses used/useful for?
                            
                                check if a number already exist in a list in python
                            
                                What is primary and secondary clustering in hash?
                            
                                Anyone familiar with mp4 data structure?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Data structures used to build file systems?

Tags:

operating-system

filesystems

data-structures

Bernice

People also ask

1 Answers

templatetypedef

Recent Activity

Donate For Us