Quick file access in a directory with 500,000 files

Tags:

I have a directory with 500,000 files in it. I would like to access them as quickly as possible. The algorithm requires me to repeatedly open and close them (can't have 500,000 file open simultaneously).

How can I do that efficiently? I had originally thought that I could cache the inodes and open the files that way, but *nix doesn't provide a way to open files by inode (security or some such).

The other option is to just not worry about it and hope the FS does good job on file look up in a directory. If that is the best option, which FS's would work best. Do certain filename patterns look up faster than others? eg 01234.txt vs foo.txt

BTW this is all on Linux.

246

asked Nov 21 '08 22:11

deft_code

1 Answers

Assuming your file system is ext3, your directory is indexed with a hashed B-Tree if dir_index is enabled. That's going to give you as much a boost as anything you could code into your app.

If the directory is indexed, your file naming scheme shouldn't matter.

http://lonesysadmin.net/2007/08/17/use-dir_index-for-your-new-ext3-filesystems/

answered Sep 28 '22 14:09

Corbin March

Related questions
                            
                                How to properly handle if a function exits without encoutering a return
                            
                                How many numbers higher than average [C++]
                            
                                Why is the comparison between const char[] and const char* different?
                            
                                Why I am getting a segmentation fault while inserting elements to the vector of data inside the class?
                            
                                Why does brace initialization in C++ solve the problem of initialization of STL containers?
                            
                                unexpected output of C++
                            
                                specialize return type to void or const lvalue reference
                            
                                What's the point of unnamed non-type template parameters?
                            
                                Why does overload resolution select pointer type for 0 but not 1, when it could select ellipses in either case?
                            
                                Why does C++20 not support "void f(Concept const auto&)"?
                            
                                Is std::list circular?
                            
                                Recursive unordered_map
                            
                                Find minimum number of digits required to make a given number
                            
                                Counting bits of ones in Byte by time Complexity O(1) C++ code
                            
                                OpenCV Probabilistic Hough Line Transform giving different results with C++ and Python?
                            
                                C++20 Concepts: Element iterable concept
                            
                                I don't know why this static_assert() code doesn't work
                            
                                Prevent pointer from being passed as array
                            
                                How to extract four unsigned short ints from one long long int?
                            
                                C++ : Finding out decorated names

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Quick file access in a directory with 500,000 files

Tags:

c++

linux

file-io

filesystems

inode

deft_code

People also ask

1 Answers

Corbin March

Recent Activity

Donate For Us