Given a file with data like this (ie stores.dat file) <pre class="prettyprint"><code>sid|storeNo|latitude|longitude 2tt|1|-28.0372000t0|153.42921670 9|2t|-33tt.85t09t0000|15t1.03274200 </code></pre> What is the command that would return the number of occurrences of the 't' character per line? eg. would return: <pre class="prettyprint"><code>count lineNum 4 1 3 2 6 3 </code></pre> <hr> Also, to do it by count of occurrences by field what is the command to return the following results? eg. input of column 2 and character 't' <pre class="prettyprint"><code>count lineNum 1 1 0 2 1 3 </code></pre> eg. input of column 3 and character 't' <pre class="prettyprint"><code>count lineNum 2 1 1 2 4 3 </code></pre>

To count occurrence of a character per line you can do: <pre class="prettyprint"><code>awk -F'|' 'BEGIN{print "count", "lineNum"}{print gsub(/t/,"") "\t" NR}' file count lineNum 4 1 3 2 6 3 </code></pre> To count occurrence of a character per field/column you can do: column 2: <pre class="prettyprint"><code>awk -F'|' -v fld=2 'BEGIN{print "count", "lineNum"}{print gsub(/t/,"",$fld) "\t" NR}' file count lineNum 1 1 0 2 1 3 </code></pre> column 3: <pre class="prettyprint"><code>awk -F'|' -v fld=3 'BEGIN{print "count", "lineNum"}{print gsub(/t/,"",$fld) "\t" NR}' file count lineNum 2 1 1 2 4 3 </code></pre> <ul> <li> <code>gsub()</code> function's return value is number of substitution made. So we use that to print the number. </li> <li> <code>NR</code> holds the line number so we use it to print the line number. </li> <li>For printing occurrences of particular field, we create a variable <code>fld</code> and put the field number we wish to extract counts from. </li> </ul>

<pre class="prettyprint"><code>grep -n -o "t" stores.dat | sort -n | uniq -c | cut -d : -f 1 </code></pre> gives almost exactly the output you want: <pre class="prettyprint"><code> 4 1 3 2 6 3 </code></pre> Thanks to @raghav-bhushan for the <code>grep -o</code> hint, what a useful flag. The -n flag includes the line number as well.

Count occurrences of character per line/field on Unix

Tags:

linux

bash

shell

unix

scripting

Given a file with data like this (ie stores.dat file)

sid|storeNo|latitude|longitude 2tt|1|-28.0372000t0|153.42921670 9|2t|-33tt.85t09t0000|15t1.03274200

What is the command that would return the number of occurrences of the 't' character per line?

eg. would return:

count   lineNum    4       1    3       2    6       3

Also, to do it by count of occurrences by field what is the command to return the following results?

eg. input of column 2 and character 't'

count   lineNum    1       1    0       2    1       3

eg. input of column 3 and character 't'

count   lineNum    2       1    1       2    4       3

709

asked Dec 25 '11 11:12

toop

2 Answers

To count occurrence of a character per line you can do:

awk -F'|' 'BEGIN{print "count", "lineNum"}{print gsub(/t/,"") "\t" NR}' file count lineNum 4       1 3       2 6       3

To count occurrence of a character per field/column you can do:

column 2:

awk -F'|' -v fld=2 'BEGIN{print "count", "lineNum"}{print gsub(/t/,"",$fld) "\t" NR}' file count lineNum 1       1 0       2 1       3

column 3:

awk -F'|' -v fld=3 'BEGIN{print "count", "lineNum"}{print gsub(/t/,"",$fld) "\t" NR}' file count lineNum 2       1 1       2 4       3

gsub() function's return value is number of substitution made. So we use that to print the number.
NR holds the line number so we use it to print the line number.
For printing occurrences of particular field, we create a variable fld and put the field number we wish to extract counts from.

141

answered Oct 08 '22 02:10

jaypal singh

grep -n -o "t" stores.dat | sort -n | uniq -c | cut -d : -f 1

gives almost exactly the output you want:

  4 1   3 2   6 3

Thanks to @raghav-bhushan for the grep -o hint, what a useful flag. The -n flag includes the line number as well.

answered Oct 08 '22 04:10

Gabriel Burt

Related questions
                            
                                /dev/random Extremely Slow?
                            
                                In bash, how to store a return value in a variable?
                            
                                How to create a device node from the init_module code of a Linux kernel module?
                            
                                Linking against older symbol version in a .so file
                            
                                Make (install from source) python without running tests
                            
                                No module named 'virtualenvwrapper'
                            
                                Get all modules/packages used by a python project
                            
                                I do not understand how execlp() works in Linux
                            
                                MySQL - ERROR 1045 - Access denied
                            
                                How to install python developer package?
                            
                                Architecture of i386 input file is incompatible with i386:x86-64
                            
                                How to compile GLUT + OpenGL project with CMake and Kdevelop in linux?
                            
                                How do I delete virtual interface in Linux? [closed]
                            
                                PHP exec - check if enabled or disabled
                            
                                Is the UNIX `time` command accurate enough for benchmarks? [closed]
                            
                                How to edit a text file in my terminal
                            
                                Hierarchical ldd(1)
                            
                                After forking, are global variables shared?
                            
                                how to Validate a XML file with XSD through xmllint [duplicate]
                            
                                find and copy file using Bash [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With