Unix uniq command to CSV file

Tags:

I have a text file (list.txt) containing single and multi-word English phrases. My goal is to do a word count for each word and write the results to a CSV file.

I have figured out the command to write the amount of unique instances of each word, sorted from largest to smallest. That command is:

Click to copy

$ tr 'A-Z' 'a-z' < list.txt | tr -sc 'A-Za-z' '\n' | sort | uniq -c | sort -n -r | less > output.txt

The problem is the way the new file (output.txt) is formatted. There are 3 leading spaces, followed by the number of occurrences, followed by a space, followed by the word. Then on to a next line. Example:

Click to copy

   9784 the
   6368 and
   4211 for
   2929 to

What would I need to do in order to get the results in a more desired format, such as CSV? For example, I'd like it to be:

Click to copy

9784,the
6368,and
4211,for
2929,to

Even better would be:

Click to copy

the,9784
and,6368
for,4211
to,2929

Is there a way to do this with a Unix command, or do I need to do some post-processing within a text editor or Excel?

381

asked Mar 11 '13 18:03

Abundnce10

1 Answers

Use awk as follows:

Click to copy

 > cat input 
   9784 the
   6368 and
   4211 for
   2929 to
 > cat input | awk '{ print $2 "," $1}'
the,9784
and,6368
for,4211
to,2929

You full pipeline will be:

Click to copy

$ tr 'A-Z' 'a-z' < list.txt | tr -sc 'A-Za-z' '\n' | sort | uniq -c | sort -n -r | awk '{ print $2 "," $1}' > output.txt

147

answered Oct 02 '22 17:10

Andrew Stein

Related questions
                            
                                sed: replacing nth word with matched pattern?
                            
                                Using a python script as the filter for git filter-branch
                            
                                Why does wget give me two different total download times?
                            
                                Java execute command line program 'find' returns error
                            
                                RRDTool GPRINT formatting with printf
                            
                                does bash -c work like nohup?
                            
                                egrep results to vim as a line referenced filelist
                            
                                Creating .deb to install bash script program
                            
                                Bash script - mass modify files sed regular expression
                            
                                Can't run bash script in PHP
                            
                                Why read per second (r/s) in linux command (iostat) all the time is zero?
                            
                                .tmux.conf: update status commands on panel focus
                            
                                Passing GET Variable from one Bash/PHP Script to another
                            
                                Bash: extract (percent) number of variable length from a string
                            
                                scp and remote mkdir -p
                            
                                OSX bash & expect
                            
                                Display output of a Bash command and keeping the output in a variable
                            
                                Restart terminal without closing on MacOS
                            
                                Sorting space delimited numbers with Linux/Bash
                            
                                Bash on Ubuntu on Windows not starting

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Unix uniq command to CSV file

Tags:

bash

unix

csv

uniq

Abundnce10

People also ask

1 Answers

Andrew Stein

Recent Activity

Donate For Us