Suppose I have a file <code>input.txt</code> with few columns and few rows, the first column is the key, and a directory <code>dir</code> with files which contain some of these keys. I want to find all lines in the files in <code>dir</code> which contain these key words. At first I tried to run the command <pre class="prettyprint"><code>cat input.txt | awk '{print $1}' | xargs grep dir </code></pre> This doesn't work because it thinks the keys are paths on my file system. Next I tried something like <pre class="prettyprint"><code>cat input.txt | awk '{system("grep -rn dir $1")}' </code></pre> But this didn't work either, eventually I have to admit that even this doesn't work <pre class="prettyprint"><code>cat input.txt | awk '{system("echo $1")}' </code></pre> After I tried to use <code>\</code> to escape the white space and the <code>$</code> sign, I came here to ask for your advice, any ideas? Of course I can do something like <pre class="prettyprint"><code>for x in `cat input.txt` ; do grep -rn $x dir ; done </code></pre> This is not good enough, because it takes two commands, but I want only one. This also shows why <code>xargs</code> doesn't work, the parameter is not the last argument

You don't need <code>grep</code> with <code>awk</code>, and you don't need <code>cat</code> to open files: <pre class="prettyprint"><code>awk 'NR==FNR{keys[$1]; next} {for (key in keys) if ($0 ~ key) {print FILENAME, $0; next} }' input.txt dir/* </code></pre> Nor do you need xargs, or shell loops or anything else - just one simple awk command does it all. If input.txt is not a file, then tweak the above to: <pre class="prettyprint"><code>real_input_generating_command | awk 'NR==FNR{keys[$1]; next} {for (key in keys) if ($0 ~ key) {print FILENAME, $0; next} }' - dir/* </code></pre> All it's doing is creating an array of keys from the first file (or input stream) and then looking for each key from that array in every file in the dir directory.

Try following <pre class="prettyprint"><code>awk '{print $1}' input.txt | xargs -n 1 -I pattern grep -rn pattern dir </code></pre>

First thing you should do is research this. Next ... you don't need to grep inside awk. That's completely redundant. It's like ... stuffing your turkey with .. a turkey. Awk can process input and do "grep" like things itself, without the need to launch the grep command. But you don't even need to do this. Adapting your first example: <pre class="prettyprint"><code>awk '{print $1}' input.txt | xargs -n 1 -I % grep % dir </code></pre> This uses xargs' <code>-I</code> option to put xargs' input into a different place on the command line it runs. In FreeBSD or OSX, you would use a <code>-J</code> option instead. But I prefer your for loop idea, converted into a while loop: <pre class="prettyprint"><code>while read key junk; do grep -rn "$key" dir ; done < input.txt </code></pre>

How to run grep inside awk?

Tags:

linux

grep

bash

awk

Suppose I have a file input.txt with few columns and few rows, the first column is the key, and a directory dir with files which contain some of these keys. I want to find all lines in the files in dir which contain these key words. At first I tried to run the command

cat input.txt | awk '{print $1}' | xargs grep dir

This doesn't work because it thinks the keys are paths on my file system. Next I tried something like

cat input.txt | awk '{system("grep -rn dir $1")}'

But this didn't work either, eventually I have to admit that even this doesn't work

cat input.txt | awk '{system("echo $1")}'

After I tried to use \ to escape the white space and the $ sign, I came here to ask for your advice, any ideas?

Of course I can do something like

for x in `cat input.txt` ; do grep -rn $x dir ; done

This is not good enough, because it takes two commands, but I want only one. This also shows why xargs doesn't work, the parameter is not the last argument

787

asked Nov 19 '13 19:11

e271p314

3 Answers

You don't need grep with awk, and you don't need cat to open files:

awk 'NR==FNR{keys[$1]; next} {for (key in keys) if ($0 ~ key) {print FILENAME, $0; next} }' input.txt dir/*

Nor do you need xargs, or shell loops or anything else - just one simple awk command does it all.

If input.txt is not a file, then tweak the above to:

real_input_generating_command |
awk 'NR==FNR{keys[$1]; next} {for (key in keys) if ($0 ~ key) {print FILENAME, $0; next} }' - dir/*

All it's doing is creating an array of keys from the first file (or input stream) and then looking for each key from that array in every file in the dir directory.

127

answered Sep 20 '22 18:09

Ed Morton

Try following

awk '{print $1}' input.txt | xargs -n 1 -I pattern grep -rn pattern dir

answered Sep 17 '22 18:09

jkshah

First thing you should do is research this.

Next ... you don't need to grep inside awk. That's completely redundant. It's like ... stuffing your turkey with .. a turkey.

Awk can process input and do "grep" like things itself, without the need to launch the grep command. But you don't even need to do this. Adapting your first example:

awk '{print $1}' input.txt | xargs -n 1 -I % grep % dir

This uses xargs' -I option to put xargs' input into a different place on the command line it runs. In FreeBSD or OSX, you would use a -J option instead.

But I prefer your for loop idea, converted into a while loop:

while read key junk; do grep -rn "$key" dir ; done < input.txt

answered Sep 21 '22 18:09

ghoti

Related questions
                            
                                The stock item was unable to be saved. Please try again. Magento 2.4.0
                            
                                Email contact form without PHP
                            
                                docker mounting volume with permission denied
                            
                                Can I change the order of the output fields from the Linux cut command? [duplicate]
                            
                                Which POSIX flavor of regex does Perl use?
                            
                                Difference between the address space of parent process and its child process in Linux?
                            
                                How to remove all files NOT ending with certain formats?
                            
                                Installing Maven 3.0.5 in RedHat Linux
                            
                                curl response says "HTTP version not supported", error 505
                            
                                Check the output of "make" and exit bash script if it fails
                            
                                Learning Perl, but how do I get 5.14 on Windows?
                            
                                Eclipse ADT Unexpected exception 'Cannot run program'
                            
                                Does Linux Bash have a do-while loop? [duplicate]
                            
                                Update python on linux 2.7 to 3.5
                            
                                what is default path for header file included in c program?
                            
                                Async connect and disconnect with epoll (Linux)
                            
                                "No Connected Devices", trying to connect my LG to my Ubuntu machine
                            
                                Execute program from within a C program
                            
                                SVN - How to upload a single file?
                            
                                Trying to delete non-ASCII characters only [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With