counting the number of strings from text files

Question

I have a text file with 10 columns say f.txt which looks like below:

aab abb  263-455
aab abb  263-455
aab abb  263-455
bbb abb  26-455
bbb abb  26-455
bbb aka  264-266
bga bga  230-232
bga bga  230-232

I want to count the unique number of each string in the first and second columns based on the numbers of third column.

Output:

aab - 1
abb - 2
bbb - 2
aka - 1
bga - 2

Total no - 8

perreal · Accepted Answer

awk '
       !s[1":"$1":"$3]++{sU[$1]++;tot++} 
       !s[2":"$2":"$3]++{sU[$2]++;tot++} 
       END{
         for (x in sU) print x, sU[x]; 
         print "Total No -",tot;
       }' input

Output

bga 1
aab 1
bbb 2
aka 1
bga 1
abb 2
Total No - 8

Chris Seymour · Answer

This will do the trick:

$ awk '!a[$0]++{c[$1]++;c[$2]++}
       END{for(k in c){print k" - "c[k];s+=c[k]}print "
Total No -",s}' file
aka - 1
bga - 2
aab - 1
abb - 2
bbb - 2

Total No - 8

In the more readable script form:

!lines[$0]++{
    count[$1]++
    count[$2]++
}
END {
    for (line in count) {
        print line" - "count[line]
        sum += count[line]
    }
    print "
Total No -",sum
}

To run it in this form save it to a file script.awk and:

$ awk -f script.awk file
aka - 1
bga - 2
aab - 1
abb - 2
bbb - 2

Total No - 8

counting the number of strings from text files

Tags:

awk

user2416563

2 Answers

perreal

Chris Seymour

Recent Activity

Donate For Us

counting the number of strings from text files

Tags:

awk

user2416563

2 Answers

perreal

Chris Seymour

Related questions

Recent Activity

Donate For Us