I have a file that contains a list of URLs. It looks like below: file1: <pre class="prettyprint"><code>http://www.google.com http://www.bing.com http://www.yahoo.com http://www.baidu.com http://www.yandex.com .... </code></pre> I want to get all the records after: http://www.yahoo.com, results looks like below: file2: <pre class="prettyprint"><code>http://www.baidu.com http://www.yandex.com .... </code></pre> I know that I could use grep to find the line number of where yahoo.com lies using <pre class="prettyprint"><code>grep -n 'http://www.yahoo.com' file1 3 http://www.yahoo.com </code></pre> But I don't know how to get the file after line number 3. Also, I know there is a flag in grep -A print the lines after your match. However, you need to specify how many lines you want after the match. I am wondering is there something to get around that issue. Like: <pre class="prettyprint"><code>Pseudocode: grep -n 'http://www.yahoo.com' -A all file1 > file2 </code></pre> I know we could use the line number I got and <code>wc -l</code> to get the number of lines after yahoo.com, however... it feels pretty lame.

<h3>AWK</h3> If you don't mind using AWK: <pre class="prettyprint"><code>awk '/yahoo/{y=1;next}y' data.txt </code></pre> This script has two parts: <pre class="prettyprint"><code>/yahoo/ { y = 1; next } y </code></pre> The first part states that if we encounter a line with yahoo, we set the variable y=1, and then skip that line (the <code>next</code> command will jump to the next line, thus skip any further processing on the current line). Without the <code>next</code> command, the line yahoo will be printed. The second part is a short hand for: <pre class="prettyprint"><code>y != 0 { print } </code></pre> Which means, for each line, if variable y is non-zero, we print that line. In AWK, if you refer to a variable, that variable will be created and is either zero or empty string, depending on context. Before encounter yahoo, variable y is 0, so the script does not print anything. After encounter yahoo, y is 1, so every line after that will be printed. <h3>Sed</h3> Or, using sed, the following will delete everything up to and including the line with yahoo: <pre class="prettyprint"><code>sed '1,/yahoo/d' data.txt </code></pre>

'grep +A': print everything after a match [duplicate]

Tags:

grep

bash

sed

awk

I have a file that contains a list of URLs. It looks like below:

file1:

http://www.google.com http://www.bing.com http://www.yahoo.com http://www.baidu.com http://www.yandex.com ....

I want to get all the records after: http://www.yahoo.com, results looks like below:

file2:

http://www.baidu.com http://www.yandex.com ....

I know that I could use grep to find the line number of where yahoo.com lies using

grep -n 'http://www.yahoo.com' file1  3 http://www.yahoo.com

But I don't know how to get the file after line number 3. Also, I know there is a flag in grep -A print the lines after your match. However, you need to specify how many lines you want after the match. I am wondering is there something to get around that issue. Like:

Pseudocode:  grep -n 'http://www.yahoo.com' -A all file1 > file2

I know we could use the line number I got and wc -l to get the number of lines after yahoo.com, however... it feels pretty lame.

298

asked Aug 10 '13 21:08

B.Mr.W.

1 Answers

AWK

If you don't mind using AWK:

awk '/yahoo/{y=1;next}y' data.txt

This script has two parts:

/yahoo/ { y = 1; next } y

The first part states that if we encounter a line with yahoo, we set the variable y=1, and then skip that line (the next command will jump to the next line, thus skip any further processing on the current line). Without the next command, the line yahoo will be printed.

The second part is a short hand for:

y != 0 { print }

Which means, for each line, if variable y is non-zero, we print that line. In AWK, if you refer to a variable, that variable will be created and is either zero or empty string, depending on context. Before encounter yahoo, variable y is 0, so the script does not print anything. After encounter yahoo, y is 1, so every line after that will be printed.

Sed

Or, using sed, the following will delete everything up to and including the line with yahoo:

sed '1,/yahoo/d' data.txt

190

answered Oct 03 '22 10:10

Hai Vu

Related questions
                            
                                Redirecting command output to a variable in bash fails
                            
                                How do I capture the output from the ls or find command to store all file names in an array?
                            
                                How can I sort file names by version numbers?
                            
                                Looping through all files in a directory [duplicate]
                            
                                How can I get the variable value inside the EOF tags?
                            
                                How to use sed to replace regex capture group?
                            
                                Custom Bash prompt is overwriting itself
                            
                                Unix: merge many files, while deleting first line of all files
                            
                                How to iterate over an array using indirect reference?
                            
                                What's the difference between ln -s and alias?
                            
                                How to check the checksum through commandline?
                            
                                Find and Replace string in all files recursive using grep and sed [duplicate]
                            
                                Is there any mutex/semaphore mechanism in shell scripts?
                            
                                How to set 4 space tab in bash
                            
                                Integer expression expected error in shell script
                            
                                How to add a new line in the bash string? [duplicate]
                            
                                Can I use shell wildcards to select filenames ranging across double-digit numbers (e.g., from foo_1.jpg to foo_54.jpg)?
                            
                                mysqldump with db in a separate file
                            
                                How to run given function in Bash in parallel?
                            
                                Suppressing "null device" output with R in batch mode

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With