How do you filter out all unique lines in a file?

Tags:

Is there a way to filter out all unique lines in a file via commandline tools without sorting the lines? I'd like to essentially do this:

sort -u myFile

without the performance hit of sorting.

215

asked Apr 03 '13 20:04

xdhmoore

1 Answers

Remove duplicated lines:

awk '!a[$0]++' file

This is famous awk one-liner. there are many explanations on inet. Here is one explanation:

This one-liner is very idiomatic. It registers the lines seen in the associative-array "a" (arrays are always associative in Awk) and at the same time tests if it had seen the line before. If it had seen the line before, then a[line] > 0 and !a[line] == 0. Any expression that evaluates to false is a no-op, and any expression that evals to true is equal to "{ print }".

197

answered Nov 08 '22 23:11

Kent

Related questions
                            
                                Qt, Mouse skipping, not updating every pixel, mouseMoveEvent()
                            
                                detect new or modified files with python [duplicate]
                            
                                Explain void (*signal(int signo, void *(func)(int)))(int)
                            
                                can't get syntax highlighting to work with R code in vim
                            
                                How to make C program wait (on Linux)?
                            
                                Use the output of a command as input of the next command
                            
                                Python library for monitoring /proc/diskstats?
                            
                                Converting UTF-8 Characters to Upper/Lower case C++
                            
                                svcutil.exe - How to get WSDL
                            
                                Problem in Timers and signal
                            
                                Sending TCP frames of fixed length
                            
                                ruby executing remote scripts in one line. (like installing rvm)
                            
                                Undefined reference in Linux makefile
                            
                                "call 0x80482f0 <puts@plt>"? Just need clarification of one line of code in a 'hello world' program in x86 assembly
                            
                                Linux Screen for session management in PuTTY [closed]
                            
                                Wait for user input in C?
                            
                                select the second line to last line of a file
                            
                                Only able to read one byte via serial
                            
                                Bulk renaming files with bash and Perl based on file name
                            
                                Simple RSync EXCLUDE option?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you filter out all unique lines in a file?

Tags:

linux

bash

shell

command-line

xdhmoore

People also ask

1 Answers

Kent

Recent Activity

Donate For Us