Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

print unique lines based on field

Tags:

awk

Would like to print unique lines based on first field , keep the first occurrence of that line and remove duplicate other occurrences.

Input.csv

10,15-10-2014,abc
20,12-10-2014,bcd
10,09-10-2014,def
40,06-10-2014,ghi
10,15-10-2014,abc

Desired Output:

10,15-10-2014,abc
20,12-10-2014,bcd
40,06-10-2014,ghi

Have tried below command and in-complete

awk 'BEGIN { FS = OFS = "," }  { !seen[$1]++ } END { for ( i in seen) print $0}' Input.csv

Looking for your suggestions ...

like image

716

asked Nov 11 '14 14:11

VNA

People also ask

How do you find unique lines in Unix?

The uniq command in UNIX is a command line utility for reporting or filtering repeated lines in a file. It can remove duplicates, show a count of occurrences, show only repeated lines, ignore certain characters and compare on specific fields.

1 Answers

You put your test for "seen" in the action part of the script instead of the condition part. Change it to:

awk -F, '!seen[$1]++' Input.csv

Yes, that's the whole script:

$ cat Input.csv
10,15-10-2014,abc
20,12-10-2014,bcd
10,09-10-2014,def
40,06-10-2014,ghi
10,15-10-2014,abc
$
$ awk -F, '!seen[$1]++' Input.csv
10,15-10-2014,abc
20,12-10-2014,bcd
40,06-10-2014,ghi

like image

92

answered Sep 21 '22 16:09

Ed Morton

Sign in to Comment

Related questions
                            
                                vi keep only first 10 characters of a column
                            
                                Get N'th line of multiple files in linux
                            
                                How to get process id from process name?
                            
                                How to print two arrays side by side with bash script?
                            
                                How to cut the first Sunday to Saturday of each month in a year?
                            
                                How can i reorder a file by ascending order (column)?
                            
                                Other solutions/languages that are superior to the TCL-based Expect? [closed]
                            
                                select multiple lines using the linux command sed
                            
                                Replace Column if equal to a specific value
                            
                                Bash split string
                            
                                Parsing the output of Bash's time builtin
                            
                                Random numbers generation with awk in BASH shell
                            
                                How To Delete First X Lines Based On Minimum Lines In File
                            
                                awk, sed: one liner command for removing spaces from _all_ file names in a given folder?
                            
                                awk + How do I find duplicates in a column?
                            
                                Replace delimited block of text in file with the contents of another file
                            
                                Replace every comma not enclosed in a pair of double quotes with '|' [closed]
                            
                                Why does AWK refuse to sum up floats
                            
                                How do I convert a tab-separated values (TSV) file to a comma-separated values (CSV) file in BASH?
                            
                                How to not print a line if it and the following line starts with the same pattern?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With