counting duplicates in a sorted sequence using command line tools

Tags:

I have a command (cmd1) that greps through a log file to filter out a set of numbers. The numbers are in random order, so I use sort -gr to get a reverse sorted list of numbers. There may be duplicates within this sorted list. I need to find the count for each unique number in that list.

For e.g. if the output of cmd1 is:

100  100  100  99  99  26  25  24  24

I need another command that I can pipe the above output to, so that, I get:

100     3 99      2 26      1 25      1 24      2

803

asked Jul 07 '09 13:07

letronje

2 Answers

how about;

$ echo "100 100 100 99 99 26 25 24 24" \     | tr " " "\n" \     | sort \     | uniq -c \     | sort -k2nr \     | awk '{printf("%s\t%s\n",$2,$1)}END{print}'

The result is :

100 3 99  2 26  1 25  1 24  2

answered Oct 11 '22 11:10

Stephen Paul Lesniewski

uniq -c works for GNU uniq 8.23 at least, and does exactly what you want (assuming sorted input).

answered Oct 11 '22 11:10

Ibrahim

Related questions
                            
                                How to execute ssh-keygen without prompt
                            
                                How do I merge one directory into another using Bash?
                            
                                How to split a file and keep the first line in each of the pieces?
                            
                                gpg encrypt file without keyboard interaction [closed]
                            
                                How to hide wget output in Linux? [closed]
                            
                                How do I escape double and single quotes in sed?
                            
                                What are my environment variables? [closed]
                            
                                Using dot or "source" while calling another script - what is the difference?
                            
                                When are square brackets required in a Bash if statement?
                            
                                How can I remove the last character of a file in unix?
                            
                                Execute a shell function with timeout
                            
                                Find multiple files and rename them in Linux
                            
                                Arrays in unix shell?
                            
                                Pipe output to bash function
                            
                                Run parallel multiple commands at once in the same terminal
                            
                                Recursively find all files newer than a given time
                            
                                How to extract duration time from ffmpeg output?
                            
                                How do I create a configure script?
                            
                                Usage of :- (colon dash) in bash
                            
                                How to declare 2D array in bash

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

counting duplicates in a sorted sequence using command line tools

Tags:

bash

sorting

command-line

duplicates

count

letronje

People also ask

2 Answers

Stephen Paul Lesniewski

Ibrahim

Recent Activity

Donate For Us