I'm trying to delete two lines either side of a pattern match from a file full of transactions. Ie. find the match then delete two lines before it, then delete two lines after it and then delete the match. The write this back to the original file. So the input data is <pre class="prettyprint"><code>D28/10/2011 T-3.48 PINITIAL BALANCE M ^ </code></pre> and my pattern is <pre class="prettyprint"><code>sed -i '/PINITIAL BALANCE/,+2d' test.txt </code></pre> However this is only deleting two lines after the pattern match and then deleting the pattern match. I can't work out any logical way to delete all 5 lines of data from the original file using sed.

an awk one-liner may do the job: <pre class="prettyprint"><code>awk '/PINITIAL BALANCE/{for(x=NR-2;x<=NR+2;x++)d[x];}{a[NR]=$0}END{for(i=1;i<=NR;i++)if(!(i in d))print a[i]}' file </code></pre> test: <pre class="prettyprint"><code>kent$ cat file ###### foo D28/10/2011 T-3.48 PINITIAL BALANCE M x bar ###### this line will be kept here comes PINITIAL BALANCE again blah this line will be kept too ######## kent$ awk '/PINITIAL BALANCE/{for(x=NR-2;x<=NR+2;x++)d[x];}{a[NR]=$0}END{for(i=1;i<=NR;i++)if(!(i in d))print a[i]}' file ###### foo bar ###### this line will be kept this line will be kept too ######## </code></pre> add some explanation <pre class="prettyprint"><code> awk '/PINITIAL BALANCE/{for(x=NR-2;x<=NR+2;x++)d[x];} #if match found, add the line and +- 2 lines' line number in an array "d" {a[NR]=$0} # save all lines in an array with line number as index END{for(i=1;i<=NR;i++)if(!(i in d))print a[i]}' #finally print only those index not in array "d" file # your input file </code></pre>

<code>sed</code> will do it: <pre class="prettyprint"><code>sed '/\n/!N;/\n.*\n/!N;/\n.*\n.*PINITIAL BALANCE/{$d;N;N;d};P;D' </code></pre> It works this way: <ul> <li>if sed has only one string in pattern space it joins another one</li> <li>if there are only two it joins the third one</li> <li>if it does natch to pattern LINE + LINE + LINE with BALANCE it joins two following strings, deletes them and goes at the beginning </li> <li>if not, it prints the first string from pattern and deletes it and goes at the beginning without swiping the pattern space</li> </ul> To prevent the appearance of pattern on the first string you should modify the script: <pre class="prettyprint"><code>sed '1{/PINITIAL BALANCE/{N;N;d}};/\n/!N;/\n.*\n/!N;/\n.*\n.*PINITIAL BALANCE/{$d;N;N;d};P;D' </code></pre> However, it fails in case you have another <code>PINITIAL BALANCE</code> in string which are going to be deleted. However, other solutions fails too =)

Delete lines before and after a match in bash (with sed or awk)?

Tags:

shell

sed

awk

I'm trying to delete two lines either side of a pattern match from a file full of transactions. Ie. find the match then delete two lines before it, then delete two lines after it and then delete the match. The write this back to the original file.

So the input data is

D28/10/2011
T-3.48
PINITIAL BALANCE
M
^

and my pattern is

sed -i '/PINITIAL BALANCE/,+2d' test.txt

However this is only deleting two lines after the pattern match and then deleting the pattern match. I can't work out any logical way to delete all 5 lines of data from the original file using sed.

604

asked Aug 03 '12 10:08

juliushibert

3 Answers

an awk one-liner may do the job:

awk '/PINITIAL BALANCE/{for(x=NR-2;x<=NR+2;x++)d[x];}{a[NR]=$0}END{for(i=1;i<=NR;i++)if(!(i in d))print a[i]}' file

test:

kent$  cat file
######
foo
D28/10/2011
T-3.48
PINITIAL BALANCE
M
x
bar
######
this line will be kept
here
comes
PINITIAL BALANCE
again
blah
this line will be kept too
########

kent$  awk '/PINITIAL BALANCE/{for(x=NR-2;x<=NR+2;x++)d[x];}{a[NR]=$0}END{for(i=1;i<=NR;i++)if(!(i in d))print a[i]}' file
######
foo
bar
######
this line will be kept
this line will be kept too
########

add some explanation

  awk '/PINITIAL BALANCE/{for(x=NR-2;x<=NR+2;x++)d[x];}   #if match found, add the line and +- 2 lines' line number in an array "d"
      {a[NR]=$0} # save all lines in an array with line number as index
      END{for(i=1;i<=NR;i++)if(!(i in d))print a[i]}' #finally print only those index not in array "d"
     file  # your input file

120

answered Oct 11 '22 13:10

Kent

sed will do it:

sed '/\n/!N;/\n.*\n/!N;/\n.*\n.*PINITIAL BALANCE/{$d;N;N;d};P;D'

It works this way:

if sed has only one string in pattern space it joins another one
if there are only two it joins the third one
if it does natch to pattern LINE + LINE + LINE with BALANCE it joins two following strings, deletes them and goes at the beginning
if not, it prints the first string from pattern and deletes it and goes at the beginning without swiping the pattern space

To prevent the appearance of pattern on the first string you should modify the script:

sed '1{/PINITIAL BALANCE/{N;N;d}};/\n/!N;/\n.*\n/!N;/\n.*\n.*PINITIAL BALANCE/{$d;N;N;d};P;D'

However, it fails in case you have another PINITIAL BALANCE in string which are going to be deleted. However, other solutions fails too =)

answered Oct 11 '22 13:10

rush

For such a task, I would probably reach for a more advanced tool like Perl:

perl -ne 'push @x, $_;
          if (@x > 4) {
              if ($x[2] =~ /PINITIAL BALANCE/) { undef @x }
                  else { print shift @x }
          }
          END { print @x }' input-file > output-file

This will remove 5 lines from the input file. These lines will be the 2 lines before the match, the matched line, and the two lines afterwards. You can change the total number of lines being removed modifying @x > 4 (this removes 5 lines) and the line being matched modifying $x[2] (this makes the match on the third line to be removed and so removes the two lines before the match).

answered Oct 11 '22 13:10

choroba

Related questions
                            
                                Bash script - determine vendor and install system (apt-get, yum etc)
                            
                                List the first few lines of every file in a directory
                            
                                Are Unix/Linux pipes producer or consumer driven?
                            
                                How to measure time from adb shell with milliseconds resolution?
                            
                                Difference between braces {} and brackets () in shell scripting
                            
                                Why cannot I define an empty function in shell?
                            
                                How to remove files without certain extension?
                            
                                What is the difference between base64 and MIME base 64? [closed]
                            
                                How to safely escape a string from C++
                            
                                Python: Persistent shell variables in subprocess
                            
                                Python/Django shell won't start
                            
                                KornShell - Set "-x" (debug) flag globally?
                            
                                Using sed to search and replace an ip address in a file
                            
                                How to compare in shell script?
                            
                                Linux Script- Date Manipulations
                            
                                how to source a shell script [environment variables] in perl script without forking a subshell?
                            
                                Is there any mechanism in Shell script alike "include guard" in C++?
                            
                                redirecting output to a file in C
                            
                                Bash parameter quotes and eval
                            
                                "more" command alternative that does support colors? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Delete lines before and after a match in bash (with sed or awk)?

Tags:

shell

sed

awk

juliushibert

People also ask

3 Answers

Kent

rush

choroba

Recent Activity

Donate For Us