Why does grep hang when run against the / directory?

Tags:

My question is in two parts :

1) Why does grep hang when I grep all files under "/" ?

for example :

grep -r 'h' ./

(note : right before the hang/crash, I note that I see some "no such device or address" messages , regarding sockets....

Of course, I know that grep shouldn't run against a socket, but I would think that since sockets are just files in Unix, it should return a negative result, rather than crashing.

2) Now, my follow up question : In any case -- how can I grep the whole filesystem? Are there certain *NIX directories which we should leave out when doing this ? In particular, I'm looking for all recently written log files.

951

asked Nov 01 '11 18:11

jayunit100

1 Answers

As @ninjalj said, if you don't use -D skip, grep will try to read all your device files, socket files, and FIFO files. In particular, on a Linux system (and many Unix systems), it will try to read /dev/zero, which appears to be infinitely long.

You'll be waiting for a while.

If you're looking for a system log, starting from /var/log is probably the best approach.

If you're looking for something that really could be anywhere in your file system, you can do something like this:

find / -xdev -type f -print0 | xargs -0 grep -H pattern

The -xdev argument to find tells it to stay within a single filesystem; this will avoid /proc and /dev (as well as any mounted filesystems). -type f limits the search to ordinary files. -print0 prints the file names separated by null characters rather than newlines; this avoid problems with files having spaces or other funny characters in their names.

xargs reads a list of file names (or anything else) on its standard input and invokes the specified command on everything in the list. The -0 option works with find's -print0.

The -H option to grep tells it to prefix each match with the file name. By default, grep does this only if there are two or more file names on its command line. Since xargs splits its arguments into batches, it's possible that the last batch will have just one file, which would give you inconsistent results.

Consider using find ... -name '*.log' to limit the search to files with names ending in .log (assuming your log files have such names), and/or using grep -I ... to skip binary files.

Note that all this depends on GNU-specific features. Some of these options might not be available on MacOS (which is based on BSD) or on other Unix systems. Consult your local documentation, and consider installing GNU findutils (for find and xargs) and/or GNU grep.

Before trying any of this, use df to see just how big your root filesystem is. Mine is currently 268 gigabytes; searching all of it would probably take several hours. A few minutes spent (a) restricting the files you search and (b) making sure the command is correct will be well worth the time you spend.

187

answered Sep 19 '22 13:09

Keith Thompson

Related questions
                            
                                Python: how to distinguish between socket error and timeout?
                            
                                Advantage of Synchronous vs asynchronous in TCP socket connection
                            
                                Blocking socket returns EAGAIN
                            
                                How to transfer a file between two connected computers in python?
                            
                                Attempt was made to access a socket in a way forbidden by its access permissions
                            
                                Is there a way to reopen a socket?
                            
                                Is it possible to close Java sockets on both client and server sides?
                            
                                recvfrom returns invalid argument when *from* is passed
                            
                                Is boost asio and c++11 a good match?
                            
                                Does ServerSocket accept return socket on arbitrary port?
                            
                                Socket.io : Get Client sessionID at any point
                            
                                Read Data from a Java Socket
                            
                                How Create a .NET HttpWebRequest class from a Socket's received byte[]'s
                            
                                Exchange Data between two apps across PC on LAN
                            
                                How to set reuse address option for a datagram socket in java code?
                            
                                How to retrieve boost asio socket file descriptor
                            
                                How to detect when a Protocol Buffer message is fully received?
                            
                                In a non blocking socket connect, select() always returns 1
                            
                                I am getting this error "TypeError: str() takes at most 1 argument (2 given)" at "client_response" variable
                            
                                BindException with INTERNET permission requested

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does grep hang when run against the / directory?

Tags:

grep

unix

filesystems

sockets

crash

jayunit100

People also ask

1 Answers

Keith Thompson

Recent Activity

Donate For Us