I want a unix command to find the lines between first & last occurence of a word For example: let's imagine we have 1000 lines. Tenth line contains word "stackoverflow", thirty fifth line also contains word "stackoverflow". I want to print lines between 10 and 35 and write it to a new file.

You can make it in two steps. The basic idea is to: 1) get the line number of the first and last match. 2) print the range of lines in between these range. <pre class="prettyprint"><code>$ read first last <<< $(grep -n stackoverflow your_file | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}') $ awk -v f=$first -v l=$last 'NR>=f && NR<=l' your_file </code></pre> <h3>Explanation</h3> <ul> <li> <code>read first last</code> reads two values and stores them in <code>$first</code> and <code>$last</code>.</li> <li> <code>grep -n stackoverflow your_file</code> greps and shows the output like this: <code>number_of_line:output</code> </li> <li> <code>awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}')</code> prints the number of line of the first and last match of <code>stackoverflow</code> in the file.</li> </ul> And <ul> <li> <code>awk -v f=$first -v l=$last 'NR>=f && NR<=l' your_file</code> prints all lines from <code>$first</code> line number till <code>$last</code> line number.</li> </ul> <h3>Test</h3> <pre class="prettyprint"><code>$ cat a here we have some text stackoverflow and other things bla bla bla bla stackoverflow and whatever else stackoverflow to make more fun blablabla $ read first last <<< $(grep -n stackoverflow a | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}') $ awk -v f=$first -v l=$last 'NR>=f && NR<=l' a stackoverflow and other things bla bla bla bla stackoverflow and whatever else stackoverflow </code></pre> By steps: <pre class="prettyprint"><code>$ grep -n stackoverflow a 3:stackoverflow 9:stackoverflow 11:stackoverflow $ grep -n stackoverflow a | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}' 3 11 $ read first last <<< $(grep -n stackoverflow a | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}') $ echo "first=$first, last=$last" first=3, last=11 </code></pre>

If you know an upper bound of how many lines there can be (say, a million), then you can use this simple abusive script: <pre class="prettyprint"><code>(grep -A 100000 stackoverflow | grep -B 1000000 stackoverflow) < file </code></pre> You can append <code>| tail -n +2 | head -n -1</code> to strip the border lines as well: <pre class="prettyprint"><code>(grep -A 100000 stackoverflow | grep -B 1000000 stackoverflow | tail -n +2 | head -n -1) < file </code></pre>

unix command to get lines from in between first and last occurence of a word and write to a file

2 Answers

You can make it in two steps. The basic idea is to:

1) get the line number of the first and last match.

2) print the range of lines in between these range.

$ read first last <<< $(grep -n stackoverflow your_file | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}')
$ awk -v f=$first -v l=$last 'NR>=f && NR<=l' your_file

Explanation

read first last reads two values and stores them in $first and $last.
grep -n stackoverflow your_file greps and shows the output like this: number_of_line:output
awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}') prints the number of line of the first and last match of stackoverflow in the file.

And

awk -v f=$first -v l=$last 'NR>=f && NR<=l' your_file prints all lines from $first line number till $last line number.

Test

$ cat a
here we
have some text
stackoverflow

and other things
bla
bla
bla bla
stackoverflow
and whatever else
stackoverflow
to make more fun
blablabla

$ read first last <<< $(grep -n stackoverflow a | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}')
$ awk -v f=$first -v l=$last 'NR>=f && NR<=l' a
stackoverflow

and other things
bla
bla
bla bla
stackoverflow
and whatever else
stackoverflow

By steps:

$ grep -n stackoverflow a
3:stackoverflow
9:stackoverflow
11:stackoverflow

$ grep -n stackoverflow a | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}'
3 11

$ read first last <<< $(grep -n stackoverflow a | awk -F: 'NR==1 {printf "%d ", $1}; END{print $1}')

$ echo "first=$first, last=$last"
first=3, last=11

167

answered Sep 22 '22 20:09

fedorqui 'SO stop harming'

If you know an upper bound of how many lines there can be (say, a million), then you can use this simple abusive script:

(grep -A 100000 stackoverflow | grep -B 1000000 stackoverflow) < file

You can append | tail -n +2 | head -n -1 to strip the border lines as well:

(grep -A 100000 stackoverflow | grep -B 1000000 stackoverflow
  | tail -n +2 | head -n -1) < file

answered Sep 23 '22 20:09

Alfe

Related questions
                            
                                Why can't string literals be used in bash regular expression tests?
                            
                                sed replace with variable with multiple lines [duplicate]
                            
                                Add md5sum in the ls bash output
                            
                                What does ${VARIABLE+set} mean?
                            
                                Bash script to convert a date and time column to unix timestamp in .csv
                            
                                Emacs shell script mode hook
                            
                                Bash: How to apply two string operations in one line?
                            
                                How does one properly "forward" function arguments in bash?
                            
                                How to prefill command line input
                            
                                Running IDL program from bash with variables
                            
                                sed - pass match to external command
                            
                                Capturing the output of bash time in script variable
                            
                                chef logging of wget
                            
                                bash and telnet to test an email
                            
                                getting bash newgrp functionality on startup
                            
                                order of files unix find on two directories with or command
                            
                                How does conditional expression compare strings?
                            
                                Launch nano editor passing piped command
                            
                                How to print file names in find despite processing the result and grep
                            
                                Bash script to watch execution time of other scripts

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

unix command to get lines from in between first and last occurence of a word and write to a file

Tags:

grep

bash

shell

unix

Krishna

People also ask