Send it through <code>sort</code> (to put adjacent items together) then <code>uniq -c</code> to give counts, i.e.: <pre class="prettyprint"><code>sort filename | uniq -c </code></pre> and to get that list in sorted order (by frequency) you can <pre class="prettyprint"><code>sort filename | uniq -c | sort -nr </code></pre> Almost the same as borribles' but if you add the <code>d</code> param to <code>uniq</code> it only shows duplicates. <pre class="prettyprint lang-bsh prettyprint-override"><code>sort filename | uniq -cd | sort -nr </code></pre> <code>uniq -c file</code> and in case the file is not sorted already: <code>sort file | uniq -c</code> <pre class="prettyprint"><code>cat <filename> | sort | uniq -c </code></pre> Try this <pre class="prettyprint"><code>cat myfile.txt| sort| uniq </code></pre> Can you live with an alphabetical, ordered list: <pre class="prettyprint"><code>echo "red apple > green apple > green apple > orange > orange > orange > " | sort -u </code></pre> ? <pre class="prettyprint"><code>green apple orange red apple </code></pre> or <pre class="prettyprint"><code>sort -u FILE </code></pre> -u stands for unique, and uniqueness is only reached via sorting. A solution which preserves the order: <pre class="prettyprint"><code>echo "red apple green apple green apple orange orange orange " | { old=""; while read line ; do if [[ $line != $old ]]; then echo $line; old=$line; fi ; done } red apple green apple orange </code></pre> and, with a file <pre class="prettyprint"><code>cat file | { old="" while read line do if [[ $line != $old ]] then echo $line old=$line fi done } </code></pre> The last two only remove duplicates, which follow immediately - which fits to your example. <pre class="prettyprint"><code>echo "red apple green apple lila banana green apple " ... </code></pre> Will print two apples, split by a banana.

Linux command or script counting duplicated lines in a text file?

Tags:

linux

text

duplicates

Send it through sort (to put adjacent items together) then uniq -c to give counts, i.e.:

sort filename | uniq -c

and to get that list in sorted order (by frequency) you can

sort filename | uniq -c | sort -nr

Almost the same as borribles' but if you add the d param to uniq it only shows duplicates.

sort filename | uniq -cd | sort -nr

uniq -c file

and in case the file is not sorted already:

sort file | uniq -c

cat <filename> | sort | uniq -c

Try this

cat myfile.txt| sort| uniq

Can you live with an alphabetical, ordered list:

echo "red apple
> green apple
> green apple
> orange
> orange
> orange
> " | sort -u

green apple
orange
red apple

sort -u FILE

-u stands for unique, and uniqueness is only reached via sorting.

A solution which preserves the order:

echo "red apple
green apple
green apple
orange
orange
orange
" | { old=""; while read line ; do   if [[ $line != $old ]]; then  echo $line;   old=$line; fi ; done }
red apple
green apple
orange

and, with a file

cat file | { 
old=""
while read line
do
  if [[ $line != $old ]]
  then
    echo $line
    old=$line
  fi
done }

The last two only remove duplicates, which follow immediately - which fits to your example.

echo "red apple
green apple
lila banana
green apple
" ...

Will print two apples, split by a banana.

Related questions
                            
                                python-dev installation error: ImportError: No module named apt_pkg
                            
                                What is better, curl or wget? [closed]
                            
                                How do I edit /etc/sudoers from a script?
                            
                                Have bash script answer interactive prompts [duplicate]
                            
                                Signal handling with multiple threads in Linux
                            
                                Why does multiprocessing use only a single core after I import numpy?
                            
                                Is it possible to use "/" in a filename?
                            
                                Is \d not supported by grep's basic expressions? [closed]
                            
                                Iterating over each line of ls -l output
                            
                                How do I pipe or redirect the output of curl -v?
                            
                                How to append the output to a file?
                            
                                How to insert a new line in Linux shell script? [duplicate]
                            
                                Redirect STDERR / STDOUT of a process AFTER it's been started, using command line?
                            
                                Difference between Real User ID, Effective User ID and Saved User ID
                            
                                Running a Python script from PHP
                            
                                Count occurrences of a char in plain text file
                            
                                Unix - create path of folders and file
                            
                                How do I measure separate CPU core usage for a process?
                            
                                IOCTL Linux device driver [closed]
                            
                                Compare integer in bash, unary operator expected

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With