I want to find the difference between two files and then put only the differences in a third file. I saw different approaches using awk, diff and comm. Are there any more ? eg.Compare two files line by line and generate the difference in another file eg.Copy differences between two files in unix I need to know which is the fastest way of finding all the differences and listing them in a file for each of the cases below - <pre class="prettyprint"><code>Case 1 - file2 = file1 + extra text appended. Case 2 - file2 and file1 are different. </code></pre>

You could try.. <pre class="prettyprint"><code>comm -13 <(sort file1) <(sort file2) > file3 </code></pre> or <pre class="prettyprint"><code>grep -Fxvf file1 file2 > file3 </code></pre> or <pre class="prettyprint"><code>diff file1 file2 | grep "<" | sed 's/^<//g' > file3 </code></pre> or <pre class="prettyprint"><code>join -v 2 <(sort file1) <(sort file2) > file3 </code></pre>

Another option: <pre class="prettyprint"><code>sort file1 file2 | uniq -u > file3 </code></pre> If you want to see just the duplicate entries use "uniq -d" option: <pre class="prettyprint"><code>sort file1 file2 | uniq -d > file3 </code></pre>

Fastest way of finding differences between two files in unix?

Tags:

I want to find the difference between two files and then put only the differences in a third file. I saw different approaches using awk, diff and comm. Are there any more ?

eg.Compare two files line by line and generate the difference in another file

eg.Copy differences between two files in unix

I need to know which is the fastest way of finding all the differences and listing them in a file for each of the cases below -

Case 1 - file2 = file1 + extra text appended. Case 2 - file2 and file1 are different.

481

asked Aug 05 '13 23:08

Steam

2 Answers

You could try..

comm -13 <(sort file1) <(sort file2) > file3

grep -Fxvf file1 file2 > file3

diff file1 file2 | grep "<" | sed 's/^<//g'  > file3

join -v 2 <(sort file1) <(sort file2) > file3

169

answered Oct 01 '22 15:10

danmc

Another option:

sort file1 file2 | uniq -u > file3

If you want to see just the duplicate entries use "uniq -d" option:

sort file1 file2 | uniq -d > file3

answered Oct 01 '22 15:10

pron

Related questions
                            
                                remove required property from input field on form submit
                            
                                How to pass django rest framework response to html?
                            
                                How to add a space after ng-repeat element?
                            
                                C++ OpenMP Parallel For Loop - Alternatives to std::vector [closed]
                            
                                Microsoft.Office.Interop.Excel Reference Cannot be found
                            
                                IAB startSetup NullPointerException
                            
                                Check supported architectures of framework in Objective-C
                            
                                HTML5 Validation error with select required attribute
                            
                                Append text to file using sed
                            
                                Moving a vector element to the back of the vector
                            
                                return object property using lodash from array
                            
                                iOS - Find top constraint for a view?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With