I need to process two file contents. I was wondering if we can pull it off using a single nawk statement. File A contents: <pre class="prettyprint"><code>AAAAAAAAAAAA 1 BBBBBBBBBBBB 2 CCCCCCCCCCCC 3 </code></pre> File B contents: <pre class="prettyprint"><code>XXXXXXXXXXX 3 YYYYYYYYYYY 2 ZZZZZZZZZZZ 1 </code></pre> I would like to compare if <code>$2</code> (2nd field ) in file A is the reverse of <code>$2</code> in file B. I was wondering how to write rules in nawk for multi-file processing ? How would we distinguish A's <code>$2</code> from B's <code>$2</code> EDIT: I need to compare $2 of A's first line (which is 1) with the $2 of B's last line (which is 1 again) .Then compare $2 of line 2 in A with $2 in NR-1 th line of B. And so on.....

You can do something like this - <pre class="prettyprint"><code>[jaypal:~/Temp] cat f1 AAAAAAAAAAAA 1 BBBBBBBBBBBB 2 CCCCCCCCCCCC 3 DDDDDDDDDDDD 4 [jaypal:~/Temp] cat f2 AAAAAAAAAAA 5 XXXXXXXXXXX 3 YYYYYYYYYYY 2 ZZZZZZZZZZZ 1 </code></pre> Solution: <pre class="prettyprint"><code>awk ' NR==FNR {a[i++]=$2; next} {print (a[--i] == $2 ? "Match " $2 FS a[i] : "Do not match " $2 FS a[i])}' FileB FileA Match 1 1 Match 2 2 Match 3 3 Do not match 4 5 </code></pre>

You can make <code>awk</code> process files serially, but you can't easily make it process two files in parallel. You probably can achieve the effect with careful use of <code>getline</code> but 'careful' is the operative term. I think in this case, with simple two-column files, I'd be inclined to use: <pre class="prettyprint"><code>paste "File A" "File B" | awk '{ process fields $1, $2 from File A and fields $3, $4 from file B }' </code></pre> You would need to make sure the two files are in the appropriate order, etc. If your input is more complex, then this may not work so well, though you can choose the character that separates the data from the two files with <code>paste -d'|' ...</code> to use a pipe to separate the two records, and <code>awk -F'|' '{ ... }'</code> to read <code>$1</code> as the info from File A and <code>$2</code> as the info from File B.

Is Awk and multiple file processing possible?

Tags:

unix

file-io

ksh

awk

nawk

I need to process two file contents. I was wondering if we can pull it off using a single nawk statement.

File A contents:

Click to copy

AAAAAAAAAAAA  1
BBBBBBBBBBBB  2
CCCCCCCCCCCC  3

File B contents:

Click to copy

XXXXXXXXXXX  3
YYYYYYYYYYY  2
ZZZZZZZZZZZ  1

I would like to compare if $2 (2nd field ) in file A is the reverse of $2 in file B. I was wondering how to write rules in nawk for multi-file processing ? How would we distinguish A's $2 from B's $2

EDIT: I need to compare $2 of A's first line (which is 1) with the $2 of B's last line (which is 1 again) .Then compare $2 of line 2 in A with $2 in NR-1 th line of B. And so on.....

511

asked Dec 14 '11 07:12

tomkaith13

2 Answers

You can do something like this -

Click to copy

[jaypal:~/Temp] cat f1
AAAAAAAAAAAA  1
BBBBBBBBBBBB  2
CCCCCCCCCCCC  3
DDDDDDDDDDDD  4

[jaypal:~/Temp] cat f2
AAAAAAAAAAA  5
XXXXXXXXXXX  3
YYYYYYYYYYY  2
ZZZZZZZZZZZ  1

Solution:

Click to copy

awk '
NR==FNR {a[i++]=$2; next}
{print (a[--i] == $2 ? "Match " $2 FS a[i] : "Do not match " $2 FS a[i])}' FileB FileA
Match 1 1
Match 2 2
Match 3 3
Do not match 4 5

129

answered Nov 27 '22 10:11

jaypal singh

You can make awk process files serially, but you can't easily make it process two files in parallel. You probably can achieve the effect with careful use of getline but 'careful' is the operative term.

I think in this case, with simple two-column files, I'd be inclined to use:

Click to copy

paste "File A" "File B" |
awk '{ process fields $1, $2 from File A and fields $3, $4 from file B }'

You would need to make sure the two files are in the appropriate order, etc.

If your input is more complex, then this may not work so well, though you can choose the character that separates the data from the two files with paste -d'|' ... to use a pipe to separate the two records, and awk -F'|' '{ ... }' to read $1 as the info from File A and $2 as the info from File B.

answered Nov 27 '22 09:11

Jonathan Leffler

Related questions
                            
                                Set Cronjob to Run Every 5 Minutes From 9:30am to 4:00pm
                            
                                'pdfseparate': Format output file name as page number with leading zeroes
                            
                                Getting `Cannot mix POST with other methods` error when using `ab -p`
                            
                                How do I suspend and resume a sequence of commands in Bash?
                            
                                Passing string as an argument in C
                            
                                Enabling curl in php5
                            
                                How to extract files inside a directory from a tar file in terminal?
                            
                                Do some programs not accept process substitution for input files?
                            
                                Programmatically enable/disable UNIX network interface
                            
                                Split delimited file into smaller files by column
                            
                                UNIX evaluate expression from a variable
                            
                                Printing my Mac's serial number in java using Unix commands
                            
                                detect memory leak with htop
                            
                                Alternative ways to issue multiple commands on a remote machine using SSH?
                            
                                How to push (i.e. flush) data sent to a TCP stream
                            
                                How can i see Timestamps in Unix files [closed]
                            
                                ordering in bash "for" loop
                            
                                Using quotes and double quotes in Java Runtime.getRuntime().exec(...)
                            
                                Java - how to check whether another (non-Java) process is running on Linux
                            
                                Find directories created less than a week ago

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is Awk and multiple file processing possible?

Tags:

unix

file-io

ksh

awk

nawk

tomkaith13

People also ask

2 Answers

jaypal singh

Jonathan Leffler

Recent Activity

Donate For Us