grep pattern from file, print the pattern instead matched string

Tags:

I want to grep with patterns from file containing regex. When the pattern matches, it prints the matched stringa but not the pattern. How can I get the pattern instead matched strings?

pattern.txt

Click to copy

Apple (Ball|chocolate|fall) Donut
donut (apple|ball) Chocolate
Donut Gorilla Chocolate
Chocolate (English|Fall) apple gorilla
gorilla chocolate (apple|ball)
(ball|donut) apple

strings.txt

Click to copy

apple ball Donut
donut ball chocolate
donut Ball Chocolate
apple donut
chocolate ball Apple

This is grep command

Click to copy

grep -Eix -f pattern.txt strings.txt

This command prints matched strings from strings.txt

Click to copy

apple ball Donut
donut ball chocolate
donut Ball Chocolate

But I want to find which patterns were used to match from pattern.txt

Click to copy

Apple (Ball|chocolate|fall) Donut
donut (apple|ball) Chocolate

The pattern.txt can be lower cases, upper cases, line with regex and without, free numbers of words and regex elements. There is no other kind of regex than brackets and pipe.

I don't want to use loop to read pattern.txt each line to grep as it's slow. Is there way to print which pattern or line number of pattern file in grep command? or any other command than grep can do the job not too slow?

796

asked Aug 13 '18 12:08

haru

1 Answers

Using grep I have no idea but with GNU awk:

Click to copy

$ awk '
BEGIN { IGNORECASE = 1 }      # for case insensitivity
NR==FNR {                     # process pattern file
    a[$0]                     # hash the entries to a
    next                      # process next line
}
{                             # process strings file
    for(i in a)               # loop all pattern file entries
        if($0 ~ "^" i "$") {  # if there is a match (see comments)
            print i           # output the matching pattern file entry
            # delete a[i]     # uncomment to delete matched patterns from a
            # next            # uncomment to end searching after first match
        }
}' pattern strings

outputs:

Click to copy

D (A|B) C

For each line in strings script will loop every pattern line to see if there are more than one match. There is only one match due to case-sensitivity. You can battle that, for example, using GNU awk's IGNORECASE.

Also, if you want each matched one pattern file entry to be outputed once, you could delete them from a after first match: add delete a[i] after the print. That might give you some performance advantage also.

121

answered Oct 05 '22 10:10

James Brown

Related questions
                            
                                execute git command inside bash script
                            
                                after running program leave interactive shell to use
                            
                                How to escape unicode characters in bash prompt correctly
                            
                                Heroku Rails Console Write to Local File
                            
                                Bash: Wrap Long Lines Inside the Same Column
                            
                                Symfony based autocomplete breaks SCP autocomplete
                            
                                Airflow parameter passing
                            
                                Make environment variables accessible to Gradle task
                            
                                How to handle color codes when trying to use grep, sed, etc
                            
                                Tramp using ssh does not source .bash_profile / .profile
                            
                                How to echo line with multiple quotes/special characters into file?
                            
                                Bash script: can not properly handle SIGTSTP
                            
                                Please explain: trap 'sudo kill -9 -- -$$' EXIT
                            
                                Difference between ( ) & and ( &)?
                            
                                Using an environment variable to pass arguments to a command
                            
                                String replacement (to lowercase) in Bash 4.3.33 - bad substitution error
                            
                                How to make RETURN trap in bash preserve the return code?
                            
                                How to import modules from site-packages when in a different directory?
                            
                                Parameter substitution bad substitution error on macOS High Sierra
                            
                                Linux "echo -n" not being flushed

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

grep pattern from file, print the pattern instead matched string

Tags:

grep

bash

awk

haru

People also ask

1 Answers

James Brown

Recent Activity

Donate For Us