Suppose I have file <code>text.txt</code> as below: <code>she likes cats, and he likes cats too.</code> I'd like my result to look like: <pre class="prettyprint"><code>she 1 likes 2 cats 2 and 1 he 1 too 1 </code></pre> If putting <code>space , .</code> into it would make the scripts easier, that would be fine. Is there a simple shell pipeline that could achieve this?

Here's a one-liner near and dear to my heart: <pre class="prettyprint"><code>cat text.txt | sed 's|[,.]||g' | tr ' ' '\n' | sort | uniq -c </code></pre> The sed strips punctuation (tune regex to taste), the tr puts the results one word per line.

List all the words in a text file with occurrence counts?

Tags:

bash

sed

awk

Suppose I have file text.txt as below:

she likes cats, and he likes cats too.

I'd like my result to look like:

she 1
likes 2
cats 2
and 1
he 1
too 1

If putting space , . into it would make the scripts easier, that would be fine.

Is there a simple shell pipeline that could achieve this?

322

asked Mar 14 '13 03:03

JackWM

1 Answers

Here's a one-liner near and dear to my heart:

cat text.txt | sed 's|[,.]||g' | tr ' ' '\n' | sort | uniq -c

The sed strips punctuation (tune regex to taste), the tr puts the results one word per line.

105

answered Nov 15 '22 05:11

phs

Related questions
                            
                                Bash - $PATH and ${PATH}
                            
                                Compress a Mysqldump that is SSH'd to another machine
                            
                                Inside python code, how do I run a .sh script?
                            
                                BASH: Is there a simple way to check whether a string is a valid SHA-1 (or MD5) hash?
                            
                                How to combine the data from two CSV files in BASH?
                            
                                Why piping to the same file doesn't work on some platforms?
                            
                                Bash while loop, how to read input until a condition is false
                            
                                how to aggregate counts in a bash one-liner
                            
                                mongo: drop collection using the terminal
                            
                                how to use `amp;` & `gt;` commands in linux-shell?
                            
                                replace xml value with sed [duplicate]
                            
                                Bash trace mode (bash -x) in the shebang
                            
                                How to get no. of lines count that matches a string from all the files in a folder
                            
                                How to remove the last line from a variable in bash or sh?
                            
                                Replace value in yaml if name : xxx with bash
                            
                                How can I copy files with names containing spaces and UNICODE, when using a shell script?
                            
                                how to use a shell script to supervise a program?
                            
                                Bash: Variable in single quote
                            
                                command to print out large files, sorted, with sizes in human readable format
                            
                                Extracting version number from a filename

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With