How to remove duplicate lines from a file

Tags:

I have a tool that generates tests and predicts the output. The idea is that if I have a failure I can compare the prediction to the actual output and see where they diverged. The problem is the actual output contains some lines twice, which confuses diff. I want to remove the duplicates, so that I can compare them easily. Basically, something like sort -u but without the sorting.

Is there any unix command line tool that can do this?

238

asked Apr 14 '09 07:04

Nathan Fellman

1 Answers

Complementary to the uniq answers, which work great if you don't mind sorting your file first. If you need to remove non-adjacent lines (or if you want to remove duplicates without rearranging your file), the following Perl one-liner should do it (stolen from here):

cat textfile | perl -ne '$H{$_}++ or print'

answered Oct 10 '22 12:10

Matt J

Related questions
                            
                                Padding Empty Field in Unix Join Operation
                            
                                How can I find out what my symbolic link is pointing to?
                            
                                xargs with command that open editor leaves shell in weird state
                            
                                How do I change a shell scripts character encoding?
                            
                                Find the owner of a file in unix [closed]
                            
                                Extract tar file without creating folder
                            
                                new line separator for each grep result sh script [closed]
                            
                                Why doesn't "sort file1 > file1" work?
                            
                                Using 'find' to return filenames without extension
                            
                                Centos - "locate" command doesn't work
                            
                                Difference between $() and () in Bash
                            
                                What is the access time in Unix
                            
                                Extract package.json version using shell script
                            
                                Script to create individual zip files for each .txt file it finds and move them after
                            
                                How to add values in a variable in Unix shell scripting?
                            
                                Recent files in folder
                            
                                Can't install CRON job [closed]
                            
                                using bash: write bit representation of integer to file
                            
                                Parse string with bash and extract number
                            
                                How do I read the Nth line of a file and print it to a new file? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to remove duplicate lines from a file

Tags:

unix

command-line

duplicates

Nathan Fellman

People also ask

1 Answers

Matt J

Recent Activity

Donate For Us