I have a file that looks like this:
2011-03-21 name001 line1 2011-03-21 name002 line2 2011-03-21 name003 line3 2011-03-22 name002 line4 2011-03-22 name001 line5
for each name, I only want its last appearance. So, I expect the result to be:
2011-03-21 name003 line3 2011-03-22 name002 line4 2011-03-22 name001 line5
Could someone give me a solution with bash/awk/sed?
1. If you want to remove all duplicates but leave the highest ones, you can apply this formula =MAX(IF($A$2:$A$12=D2,$B$2:$B$12)), remember to press Shift + Ctrl + Enter keys. 2. In the above formulas, A2:A12 is the original list you need to remove duplicates from.
This code get uniq lines by second field but from the end of file or text (like in your result example)
tac temp.txt | sort -k2,2 -r -u
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With