I have two files and I would like to display the duplicate line. I tried this but it doesn't work : <code> cat id1.txt | while read id; do grep "$id" id2.txt; done </code> I am wondering if there are any other way to display the duplicate lines in the file. Both of my 2 files contain list of ids. Thank you.

Are the files sorted? Can they be sorted? If sorted: <pre class="prettyprint"><code>comm -12 id1.txt id2.txt </code></pre> If not sorted but using <code>bash</code> 4.x: <pre class="prettyprint"><code>comm -12 <(sort id1.txt) <(sort id2.txt) </code></pre> There are solutions using temporary files if you don't have <code>bash</code> 4.x and 'process substitution'. You could also use <code>grep -F</code>: <pre class="prettyprint"><code>grep -F -f id1.txt id2.txt </code></pre> This looks for the words in <code>id1.txt</code> that appear in <code>id2.txt</code>. The only problem here is ensuring that an ID <code>1</code> doesn't match every ID containing a <code>1</code> somewhere. The <code>-w</code> or <code>-x</code> options available in some versions of <code>grep</code> will work here.

Display duplicate lines in two different files

2 Answers

use awk will save your time.

awk 'FNR==NR{lines[$0]=1;next} $0 in lines' id1.txt id2.txt

#explaination
FNR==NR #check whether the File NR equal to NR, 
#which will only be true for the first file
lines[$0]=1 #put the contents into a dictionary, 
#value is 1, key is the lines of the first file
next #do not do the next commands if FNR==NR
$0 in lines #check whether the line in the second file
# is in the dictionary
#if yes, will print the $0
#acturally, I omitted the {print},
#which is default to print by awk if condition is true

answered Oct 28 '22 18:10

Sandy

Are the files sorted? Can they be sorted?

If sorted:

comm -12 id1.txt id2.txt

If not sorted but using bash 4.x:

comm -12 <(sort id1.txt) <(sort id2.txt)

There are solutions using temporary files if you don't have bash 4.x and 'process substitution'.

You could also use grep -F:

grep -F -f id1.txt id2.txt

This looks for the words in id1.txt that appear in id2.txt. The only problem here is ensuring that an ID 1 doesn't match every ID containing a 1 somewhere. The -w or -x options available in some versions of grep will work here.

answered Oct 28 '22 19:10

Jonathan Leffler

Related questions
                            
                                sed how to delete first 17 lines and last 8 lines in a file
                            
                                How to check if interface is up
                            
                                expected identifier or ‘(’ before numeric constant?
                            
                                why to register struct cdev in driver code
                            
                                Extract lines between two patterns from a file [duplicate]
                            
                                Inject shared library into a process
                            
                                Command to insert lines before first match
                            
                                How do I send a message to my socket.io websocket from the command line in linux?
                            
                                gcc - removing "is used uninitialized in this function" warning
                            
                                SSH "Login monitor" for Linux
                            
                                Locking files in linux with c/c++
                            
                                Can python detect which OS is it running under?
                            
                                Linux - Save only recent 10 folders and delete the rest
                            
                                Check whether a path is absolute or relative
                            
                                wget and htaccess: username only
                            
                                Using both basename and full path in find -exec
                            
                                Creating a bootable ISO image with custom bootloader
                            
                                No Presto metadata available for base Error downloading packages:
                            
                                Apache installing and running php files
                            
                                Unrar archive with folders in Debian? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Display duplicate lines in two different files

Tags:

linux

bash

Chad D

People also ask

2 Answers

Sandy

Jonathan Leffler

Recent Activity

Donate For Us