Bash - How to count occurences in a column of a .csv file (without awk)

Tags:

bash

recently i've started to learn bash scripting and im wondering how i can count occurences in a column of a .csv file, the file is structured like this:

    DAYS,SOMEVALUE,SOMEVALUE
    sunday,something,something
    monday,something,something
    wednesday,something,something
    sunday,something,something
    monday,something,something

so my question is: how can i count each time every value of first column (days) appear? In this case the outputs must be:

    Sunday : 2
    Monday : 2
    Wednesday: 1

The first column is named DAYS, so the script has to not take care of the single value DAYS, DAYS is just a way to identify the column.

if possible i want to see a solution without the awk command and without phyton ecc..

Thx guys and sorry for my bad English

Edit: I thought to do this:

    count="$( cat "${FILE}" | grep -c "OCCURENCE")"
    echo "OCCURENCE": ${count}

Where OCCURENCE is the single values (sunday,monday...) But this solution is not automatic, i need to make a list of single occurences in the first column of .csv file and put each one on an array and then count each one with the code i written before. I need some help to do this thx

916

asked Mar 29 '18 06:03

mark Franz

2 Answers

cut -f1 -d, test.csv | tail -n +2 | sort | uniq -c

This gets you this far:

  2 monday
  2 sunday
  1 wednesday

To get your format (Sunday : 1), I think awk would be an easy and clear way (something like awk '{print $2 " : " $1}', but if you really really must, here's a complete non-awk version:

cut -f1 -d, test.csv | uniq -c | tail -n +2 | while read line; do words=($line); echo ${words[1]} : ${words[0]}; done

answered Oct 16 '22 09:10

sneep

A variation of @sneep's answer that uses sed to format the result:

cut -f1 -d, /tmp/data  | tail -n +2 | sort | uniq -c | sed 's|^ *\([0-9]*\) \(.*\)|\u\2: \1|g'

Output:

Monday: 2
Sunday: 2
Wednesday: 1

The sed is matching:

^ *: Beginning of line and then any number of spaces
$[0-9]*$: Any number of numbers (storing them in a group \1)
: A single space
$.*$: Any character until the end, storing it in group \2

And replaces the match with:

\u\2: Second group, capitalizing first character
: \1: Colon, space and the first group

answered Oct 16 '22 10:10

urban

Related questions
                            
                                Pausing a Pingdom check via the Pingdom API using curl (BASH)
                            
                                How to parse a string with multiple characters to split on-Bash Scripting
                            
                                grep -vf too slow with large files
                            
                                Why is '<<<' filtering null bytes in gdb where '<()' does not?
                            
                                Replace an attribute or key in JSON using jq or sed
                            
                                How to strip some form of new line character at end of parsed Jenkinsfile variable
                            
                                fasta file: replace header with filename
                            
                                parameter expansion using bang dollar (`!$`)
                            
                                How to sanitize a string in bash?
                            
                                Git Bash hangs on CTRL + I
                            
                                How to retrieve the real redirect location header with Curl? without using {redirect_url}
                            
                                Using csvtool call to filter csv in bash
                            
                                Set environment variables in Docker
                            
                                Using command & + disown instead of nohup
                            
                                Bcrypt for BASH Shell?
                            
                                Bash complex pipeline dependencies
                            
                                Sorting groups of lines
                            
                                different result from git log and git rev-list
                            
                                How to add new line in bashrc file in ubuntu?
                            
                                How can I connect php-apache and MySQL using Docker?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With