Read line by line and print matches line by line

Tags:

I am new to shell scripting, it would be great if I can get some help with the question below.

I want to read a text file line by line, and print all matched patterns in that line to a line in a new text file.

For example:

$ cat input.txt

SYSTEM ERROR: EU-1C0A  Report error -- SYSTEM ERROR: TM-0401 DEFAULT Test error
SYSTEM ERROR: MG-7688 DEFAULT error -- SYSTEM ERROR: DN-0A00 Error while getting object -- ERROR: DN-0A52 DEFAULT Error -- ERROR: MG-3218 error occured in HSSL
SYSTEM ERROR: DN-0A00 Error while getting object -- ERROR: DN-0A52 DEFAULT Error
SYSTEM ERROR: EU-1C0A  error Failed to fill in test report -- ERROR: MG-7688

The intended output is as follows:

$ cat output.txt

EU-1C0A TM-0401
MG-7688 DN-0A00 DN-0A52 MG-3218
DN-0A00 DN-0A52
EU-1C0A MG-7688

I tried the following code:

while read p; do
    grep -o '[A-Z]\{2\}-[A-Z0-9]\{4\}' | xargs
done < input.txt > output.txt

which produced this output:

EU-1C0A TM-0401 MG-7688 DN-0A00 DN-0A52 MG-3218 DN-0A00 DN-0A52 EU-1C0A MG-7688 .......

Then I also tried this:

while read p; do
    grep -o '[A-Z]\{2\}-[A-Z0-9]\{4\}' | xargs > output.txt
done < input.txt

But did not help :(

Maybe there is another way, I am open to awk/sed/cut or whatever... :)

Note: There can be any number of Error codes (i.e. XX:XXXX, the pattern of interest in a single line).

360

asked Dec 09 '16 19:12

Dinesh Kumar

1 Answers

% awk 'BEGIN{RS=": "};NR>1{printf "%s%s", $1, ($0~/\n/)?"\n":" "}' input.txt 
EU-1C0A TM-0401
MG-7688 DN-0A00 DN-0A52 MG-3218
DN-0A00 DN-0A52
EU-1C0A MG-7688

Explanation in longform:

awk '
    BEGIN{ RS=": " } # Set the record separator to colon-space
    NR>1 {           # Ignore the first record
        printf("%s%s", # Print two strings:
            $1,      # 1. first field of the record (`$1`)
            ($0~/\n/) ? "\n" : " ")
                     # Ternary expression, read as `if condition (thing
                     # between brackets), then thing after `?`, otherwise
                     # thing after `:`.
                     # So: If the record ($0) matches (`~`) newline (`\n`),
                     # then put a newline. Otherwise, put a space.
    }
' input.txt

Previous answer to the unmodified question:

% awk 'BEGIN{RS=": "};NR>1{printf "%s%s", $1, (NR%2==1)?"\n":" "}' input.txt 
EU-1C0A TM-0401
MG-7688 MG-3218
DN-0A00 DN-0A52
EU-1C0A MG-7688

edit: With safeguard against :-injection (thx @e0k). Tests that the first field after the record seperator looks like how we expect it to be.

awk 'BEGIN{RS=": "};NR>1 && $1 ~ /^[A-Z]{2}-[A-Z0-9]{4}$/ {printf "%s%s", $1, ($0~/\n/)?"\n":" "}' input.txt

133

answered Oct 15 '22 02:10

joepd

Related questions
                            
                                /dev/random returning always the same sequence
                            
                                How to reset emacs to save files in utf-8-unix character encoding?
                            
                                Running a node.js script every 10 seconds
                            
                                linux ami nginx sites_enabled missing [closed]
                            
                                Finding memory usage of a process in Linux [closed]
                            
                                Does mmap allocate a page or part of a page?
                            
                                Bash script commands not working in cron
                            
                                Launching a shell script when Button Pressed on GUI made with Qt
                            
                                Looping dots until a process is complete? Bash
                            
                                How can i find the location of installed software in linux?
                            
                                Error Loading Shared Library (glew)
                            
                                Docker port isn't accessible from host
                            
                                Change the default find-grep command in emacs
                            
                                How to list all of the .bb and .bbappend files used to build a specific package with bitbake?
                            
                                Append new line in all files of the folder
                            
                                diff two rpms? -- linux
                            
                                How to 'read -s' in shell?
                            
                                Recursively list files from a given directory in Bash
                            
                                How to remove special characters in file names?
                            
                                error: 'strdup' was not declared in this scope

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Read line by line and print matches line by line

Tags:

linux

grep

bash

shell

text-processing

Dinesh Kumar

People also ask

1 Answers

joepd

Recent Activity

Donate For Us