I have a file (A.txt) with 4 columns of numbers and another file with 3 columns of numbers (B.txt). I need to solve the following problems: <ol> <li> Find all lines in A.txt whose 3rd column has a number that appears any where in the 3rd column of B.txt. </li> <li> Assume that I have many files like A.txt in a directory and I need to run this for every file in that directory. </li> </ol> How do I do this?

Here is an example. Create the following files and run <pre class="prettyprint"><code>awk -f c.awk B.txt A*.txt </code></pre> c.awk <pre class="prettyprint"><code>FNR==NR { s[$3] next } $3 in s { print FILENAME, $0 } </code></pre> A1.txt <pre class="prettyprint"><code>1 2 3 1 2 6 1 2 5 </code></pre> A2.txt <pre class="prettyprint"><code>1 2 3 1 2 6 1 2 5 </code></pre> B.txt <pre class="prettyprint"><code>1 2 3 1 2 5 2 1 8 </code></pre> The output should be: <pre class="prettyprint"><code>A1.txt 1 2 3 A1.txt 1 2 5 A2.txt 1 2 3 A2.txt 1 2 5 </code></pre>

Using grep and awk together

2 Answers

You should never see someone using grep and awk together because whatever grep can do, you can also do in awk:

Grep and Awk

grep "foo" file.txt | awk '{print $1}'

Using Only Awk:

awk '/foo/ {print $1}' file.txt

I had to get that off my chest. Now to your problem...

Awk is a programming language that assumes a single loop through all the lines in a set of files. And, you don't want to do this. Instead, you want to treat B.txt as a special file and loop though your other files. That normally calls for something like Python or Perl. (Older versions of BASH didn't handle hashed key arrays, so these versions of BASH won't work.) However, slitvinov looks like he found an answer.

Here's a Perl solution anyway:

use strict;
use warnings;
use feature qw(say);
use autodie;

my $b_file = shift;
open my $b_fh, "<", $b_file;

#
# This tracks the values in "B"
#
my %valid_lines;
while ( my $line = <$b_file> ) {
    chomp $line;
    my @array = split /\s+/, $line;
    $valid_lines{$array[2]} = 1;   #Third column
}
close $b_file;

#
# This handles the rest of the files
#
while ( my $line = <> ) {  # The rest of the files
   chomp $line;
   my @array = split /\s+/, $line;
   next unless exists $valid_lines{$array[2]};  # Next unless field #3 was in b.txt too
   say $line;
}

answered Nov 02 '22 04:11

David W.

Here is an example. Create the following files and run

awk -f c.awk B.txt A*.txt

c.awk

FNR==NR {
    s[$3]
    next
}

$3 in s {
    print FILENAME, $0
}

A1.txt

1 2 3
1 2 6
1 2 5

A2.txt

1 2 3
1 2 6
1 2 5

B.txt

1 2 3
1 2 5
2 1 8

The output should be:

A1.txt 1 2 3
A1.txt 1 2 5
A2.txt 1 2 3
A2.txt 1 2 5

answered Nov 02 '22 04:11

slitvinov

Related questions
                            
                                GNU Screen - create screen in background run command from shell or script
                            
                                Cygwin: color coding and branch info for git?
                            
                                Bash: How to trim whitespace before using cut
                            
                                Dynamically get a running container id/name created by docker run command
                            
                                List all leaf subdirectories in linux
                            
                                Propagate value of variable to outside of the loop [duplicate]
                            
                                How do I put basename into a variable?
                            
                                setenv equivalent on mac?
                            
                                CoreOS - get docker container name by PID?
                            
                                Converting JSON pretty print to one line
                            
                                Using 'find' to return filenames without extension
                            
                                removing extension from file without knowing it
                            
                                Put command output into string
                            
                                echo "#!" fails -- "event not found"
                            
                                How to detect if a git clone failed in a bash script
                            
                                Git : Determine if branch is in a merge conflict state
                            
                                Difference between $() and () in Bash
                            
                                Can a python script execute a function inside a bash script?
                            
                                Extract one word after a specific word on the same line
                            
                                How can I display the output of a Opscode Chef bash command in my console?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using grep and awk together

Tags:

grep

bash

awk

duli

People also ask