Task Parallel Library for directory traversal

Tags:

I'd like to traverse a directory on my hard drive and search through all the files for a specific search string. This sounds like the perfect candidate for something that could (or should) be done in parallel since the IO is rather slow.

Traditionally, I would write a recursive function to finds and processes all files in the current directory and then recurse into all the directories in that directory. I'm wondering how I can modify this to be more parallel. At first I simply modified:

foreach (string directory in directories) { ... }

Parallel.ForEach(directories, (directory) => { ... })

but I feel that this might create too many tasks and get itself into knots, especially when trying to dispatch back onto a UI thread. I also feel that the number of tasks is unpredictable and that this might not be an efficient way to parallize (is that a word?) this task.

Has anyone successfully done something like this before? What advice do you have in doing so?

407

asked Nov 10 '10 22:11

rein

1 Answers

No, this doesn't sound like a good candidate for parallelism precisely because the IO is slow. You're going to be diskbound. Assuming you've only got one disk, you don't really want to be making it seek to multiple different places at the same time.

It's a bit like trying to attach several hoses to the same tap in order to get water out faster - or trying to run 16 CPU-bound threads on a single core :)

answered Sep 22 '22 01:09

Jon Skeet

Related questions
                            
                                Why ever cast reference types when you can use "as"? [duplicate]
                            
                                Process.kill() denied in Windows 7 32bits even with Administrator privileges
                            
                                C# Regexp: How to extract $1, $2 variables from match
                            
                                Efficient way to read a specific line number of a file. (BONUS: Python Manual Misprint)
                            
                                C# - Console.Beep does not work on Windows Vista
                            
                                Generating every character combination up to a certain word length
                            
                                Identity column in EF 4
                            
                                If Using Enterprise Library, Is log4net better to log with?
                            
                                WPF - How to add effects (like Shadow) to a Label
                            
                                What's the right pattern for waiting on a file lock to be released?
                            
                                Why the shortcut created by my MSI install start the setup process again each time?
                            
                                basic about "using" construct
                            
                                How can I get X, Y positions of mouse relative to form even when clicking on another control?
                            
                                How can I use Automapper to map an object to an unknown destination type?
                            
                                Object Orientated Design Parent / Child Relationship
                            
                                How to forbid backspace key in WPF
                            
                                How do I get the children of an Element in Watin?
                            
                                Proper way to handle button clicks in asp.net mvc?
                            
                                What does generic typing to a new instance achieve?
                            
                                Regex and the colon (:)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Task Parallel Library for directory traversal

Tags:

c#

.net

task-parallel-library

rein

People also ask

1 Answers

Jon Skeet

Recent Activity

Donate For Us