I need to work with large files and must find differences between two. And I don't need the different bits, but the number of differences. To find the number of different rows I come up with <pre class="prettyprint"><code>diff --suppress-common-lines --speed-large-files -y File1 File2 | wc -l </code></pre> And it works, but is there a better way to do it? And how to count the exact number of differences (with standard tools like bash, diff, awk, sed some old version of perl)?

If you want to count the number of lines that are different use this: <pre class="prettyprint"><code>diff -U 0 file1 file2 | grep ^@ | wc -l </code></pre> Doesn't John's answer double count the different lines?

How to count differences between two files on linux?

Tags:

shell

diff

count

I need to work with large files and must find differences between two. And I don't need the different bits, but the number of differences.

To find the number of different rows I come up with

diff --suppress-common-lines --speed-large-files -y File1 File2 | wc -l

And it works, but is there a better way to do it?

And how to count the exact number of differences (with standard tools like bash, diff, awk, sed some old version of perl)?

463

asked Oct 14 '09 14:10

Zsolt Botykai

1 Answers

If you want to count the number of lines that are different use this:

diff -U 0 file1 file2 | grep ^@ | wc -l

Doesn't John's answer double count the different lines?

128

answered Oct 22 '22 06:10

Josh

Related questions
                            
                                What is start-stop-daemon in linux scripting?
                            
                                ^word^replacement^ on all matches in Bash?
                            
                                What does double slash // in `cd //` mean in Linux? [duplicate]
                            
                                How do I get my Golang web server to run in the background?
                            
                                How to get the last line of a file using cat command
                            
                                How to check with PHP if the script is being run from the console or browser request?
                            
                                Shell Scripting: Using a variable to define a path
                            
                                What version of MongoDB is installed on Ubuntu
                            
                                Pipe input into a script
                            
                                Best way to do a find/replace in several files?
                            
                                Count occurrences of character per line/field on Unix
                            
                                How to kill all processes with the same name using OS X Terminal
                            
                                Recursively List all directories and files
                            
                                How do I escape a string for a shell command in node?
                            
                                Array of arrays in bash
                            
                                How do I write a batch file which opens the GitBash shell and runs a command in the shell?
                            
                                Unix time and leap seconds
                            
                                Dockerfile CMD instruction will exit the container just after running it
                            
                                How to run a python file using cron jobs
                            
                                total size of group of files selected with 'find'