I'm parsing a large file in Perl line-by-line (terminated by \n), but when I reach a certain keyword, say "TARGET", I need to grab all the lines between TARGET and the next completely empty line. So, given a segment of a file: Line 1 Line 2 Line 3 Line 4 Target Line 5 Grab this line Line 6 Grab this line \n It should become: Line 4 Target Line 5 Grab this line Line 6 Grab this line The reason I'm having trouble is I'm already going through the file line-by-line; how do I change what I delimit by midway through the parsing process?

You want something like this: <pre class="prettyprint"><code>my @grabbed; while (<FILE>) { if (/TARGET/) { push @grabbed, $_; while (<FILE>) { last if /^$/; push @grabbed, $_; } } } </code></pre>

The short answer: line delimiter in perl is <code>$/</code>, so when you hit TARGET, you can set <code>$/</code> to <code>"\n\n"</code>, read the next "line", then set it back to "\n"... et voilà! Now for the longer one: if you use the <code>English</code> module (which gives sensible names to all of Perl's magic variable, then <code>$/</code> is called <code>$RS</code> or <code>$INPUT_RECORD_SEPARATOR</code>. If you use <code>IO::Handle</code>, then <code>IO::Handle->input_record_separator( "\n\n")</code> will work. And if you're doing this as part of a bigger piece of code, don't forget to either localize (using <code>local $/;</code> in the appropriate scope) or to set back <code>$/</code> to its original value of <code>"\n"</code>.

From perlfaq6's answer to How can I pull out lines between two patterns that are themselves on different lines? <hr> You can use Perl's somewhat exotic .. operator (documented in perlop): <pre class="prettyprint"><code>perl -ne 'print if /START/ .. /END/' file1 file2 ... </code></pre> If you wanted text and not lines, you would use <pre class="prettyprint"><code>perl -0777 -ne 'print "$1\n" while /START(.*?)END/gs' file1 file2 ... </code></pre> But if you want nested occurrences of START through END, you'll run up against the problem described in the question in this section on matching balanced text. Here's another example of using ..: <pre class="prettyprint"><code>while (<>) { $in_header = 1 .. /^$/; $in_body = /^$/ .. eof; # now choose between them } continue { $. = 0 if eof; # fix $. } </code></pre>

<pre class="prettyprint"><code>while(<FILE>) { if (/target/i) { $buffer .= $_; while(<FILE>) { $buffer .= $_; last if /^\n$/; } } } </code></pre>

How can I grab multiple lines after a matching line in Perl?

Q: How do I skip a line in Perl?

To skip over blanks lines in a perl script, you have several choices. You could use a "next if /^$/" (skip if empty) command or a "next if /^\s*$/" skip if empty or only white space.

Q: What is\ b in Perl?

Depending on how it is used, \b can have a special meaning within a Perl command: \b is the backspace character only inside a character class. Outside a character class, \b alone is a word-character/non-word-character boundary.

6 Answers

You want something like this:

my @grabbed;
while (<FILE>) {
    if (/TARGET/) {
        push @grabbed, $_;
        while (<FILE>) {
            last if /^$/;
            push @grabbed, $_;
        }
    }
}

132

answered Oct 21 '22 07:10

dave4420

The range operator is ideal for this sort of task:

$ cat try
#! /usr/bin/perl

while (<DATA>) {
  print if /\btarget\b/i .. /^\s*$/
}

__DATA__
Line 1
Line 2
Line 3
Line 4 Target
Line 5 Grab this line
Line 6 Grab this line

Nope
Line 7 Target
Linu 8 Yep

Nope again

$ ./try
Line 4 Target
Line 5 Grab this line
Line 6 Grab this line

Line 7 Target
Linu 8 Yep

answered Oct 21 '22 08:10

Greg Bacon

The short answer: line delimiter in perl is $/, so when you hit TARGET, you can set $/ to "\n\n", read the next "line", then set it back to "\n"... et voilà!

Now for the longer one: if you use the English module (which gives sensible names to all of Perl's magic variable, then $/ is called $RS or $INPUT_RECORD_SEPARATOR. If you use IO::Handle, then IO::Handle->input_record_separator( "\n\n") will work.

And if you're doing this as part of a bigger piece of code, don't forget to either localize (using local $/; in the appropriate scope) or to set back $/ to its original value of "\n".

answered Oct 21 '22 07:10

mirod

From perlfaq6's answer to How can I pull out lines between two patterns that are themselves on different lines?

You can use Perl's somewhat exotic .. operator (documented in perlop):

perl -ne 'print if /START/ .. /END/' file1 file2 ...

If you wanted text and not lines, you would use

perl -0777 -ne 'print "$1\n" while /START(.*?)END/gs' file1 file2 ...

But if you want nested occurrences of START through END, you'll run up against the problem described in the question in this section on matching balanced text.

Here's another example of using ..:

while (<>) {
    $in_header =   1  .. /^$/;
    $in_body   = /^$/ .. eof;
# now choose between them
} continue {
    $. = 0 if eof;  # fix $.
}

answered Oct 21 '22 08:10

brian d foy

while(<FILE>)
{
    if (/target/i)
    {
        $buffer .= $_;
        while(<FILE>)
        {
            $buffer .= $_;
            last if /^\n$/;
        }
    }
}

answered Oct 21 '22 08:10

user105033

use strict;
use warnings;

my $inside = 0;
my $data = '';
while (<DATA>) {
    $inside = 1 if /Target/;
    last if /^$/ and $inside;
    $data .= $_ if $inside;
}

print '[' . $data . ']';

__DATA__
Line 1
Line 2
Line 3
Line 4 Target
Line 5 Grab this line
Line 6 Grab this line

Next Line

Edit to fix the exit condition as per the note below.

answered Oct 21 '22 09:10

telesphore4

Related questions
                            
                                How do I control the variable names in Perl's Data::Dumper?
                            
                                How can I send an HTML email with Perl?
                            
                                how to run piece of code just before the exit of perl script
                            
                                How to take substring of a given string until the first appearance of specified character?
                            
                                Why can't I use the diamond operator with an array in Perl?
                            
                                What characters are allowed in Perl identifiers?
                            
                                Capture first 8 characters Perl
                            
                                Is regex in perl faster than in Java or other languages? [closed]
                            
                                Why can't I fetch wikipedia pages with LWP::Simple?
                            
                                Is there a Perl solution for lazy lists this side of Perl 6?
                            
                                How can I read the lines of a file into an array in Perl?
                            
                                What dynamic programming features of Perl should I be using?
                            
                                In Perl, how can I make a deep copy of an array? [duplicate]
                            
                                Best way to avoid "isn't numeric in numeric eq (==)"-warning
                            
                                How do I assign the result of a regex match to a new variable, in a single line?
                            
                                Perl regular expression: match nested brackets
                            
                                How do you read the system time and date in Perl?
                            
                                What should I put in my starter template for my Perl programs? [closed]
                            
                                Why would I return a hash or a hash reference in Perl?
                            
                                In Perl, how can I tell if a string is a number?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I grab multiple lines after a matching line in Perl?

Tags:

perl

Dirk

People also ask

6 Answers

dave4420

Greg Bacon

mirod

brian d foy

user105033

telesphore4

Recent Activity

Donate For Us