Fastest possible grep

Tags:

I'd like to know if there is any tip to make grep as fast as possible. I have a rather large base of text files to search in the quickest possible way. I've made them all lowercase, so that I could get rid of -i option. This makes the search much faster.

Also, I've found out that -F and -P modes are quicker than the default one. I use the former when the search string is not a regular expression (just plain text), the latter if regex is involved.

Does anyone have any experience in speeding up grep? Maybe compile it from scratch with some particular flag (I'm on Linux CentOS), organize the files in a certain fashion or maybe make the search parallel in some way?

922

asked Jan 30 '12 15:01

pistacchio

1 Answers

Try with GNU parallel, which includes an example of how to use it with grep:

grep -r greps recursively through directories. On multicore CPUs GNU parallel can often speed this up.
find . -type f | parallel -k -j150% -n 1000 -m grep -H -n STRING {} 
This will run 1.5 job per core, and give 1000 arguments to grep.

For big files, it can split it the input in several chunks with the --pipe and --block arguments:

 parallel --pipe --block 2M grep foo < bigfile

You could also run it on several different machines through SSH (ssh-agent needed to avoid passwords):

parallel --pipe --sshlogin server.example.com,server2.example.net grep foo < bigfile

answered Oct 13 '22 05:10

Chewie

Related questions
                            
                                Bash script error: "function: not found". Why would this appear?
                            
                                How to start a shell without any user configuration?
                            
                                Simple Socket Server in Bash?
                            
                                What's the cmd/PowerShell equivalent of back tick on Bash?
                            
                                How to load ~/.bash_profile when entering bash from within zsh?
                            
                                How to run 'cd' in shell script and stay there after script finishes?
                            
                                Prevent grep returning an error when input doesn't match
                            
                                Why is "MINGW64" appearing on my Git bash?
                            
                                Test for empty string with X"" [duplicate]
                            
                                Difference between $HOME and '~' (tilde)?
                            
                                Multiple Bash traps for the same signal
                            
                                How to check that a parameter was supplied to a bash script [duplicate]
                            
                                Limit on file name length in bash [closed]
                            
                                Using cut command to remove multiple columns
                            
                                Using bash history to get a previous command, copy it and then 'run' it but with the command commented
                            
                                Don't save current bash session to history
                            
                                How can I make all line endings (EOLs) in all files in Visual Studio Code, Unix-like?
                            
                                Exec commands on kubernetes pods with root access
                            
                                kill a process in bash [duplicate]
                            
                                How to print a function definition in Bash?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fastest possible grep

Tags:

grep

bash

unix

pistacchio

People also ask

1 Answers

Chewie

Recent Activity

Donate For Us