Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

Reading a specific line from large file in Perl

Tags:

file

perl

line

Is there any fast and memory efficient way to read specific lines of large file, without loading it to memory?

I wrote a perl script, that runs many forks and I would like them to read specific lines from a file.

At the moment Im using an external command:

sub getFileLine {
    my ( $filePath, $lineWanted ) = @_;
    $SIG{PIPE} = '_IGNORE_';
    open( my $fh, '-|:utf8', "tail -q -n +$lineWanted \"$filePath\" | head -n 1" );
    my $line = <$fh>;
    close $fh;
    chomp( $line );
    return $line;
}

Its fast and it works - but maybe there's a more "Perl-ish" way, as fast and as memory efficient as this one?

As you know, creating a fork process in Perl duplicates the main process memory - so if the main process is using 10MB, the fork will use at least that much.

My goal is to keep fork process (so main process until running forks also) memory use as low as possible. Thats why I dont want to load the whole file into memory.

like image

939

asked Dec 18 '11 10:12

gib

People also ask

How do I print a specific line in a file in Perl?

With the -p switch, Perl wraps a while loop around the code you specify with -e, and -i turns on in-place editing. The current line is in $. With -p, Perl automatically prints the value of $ at the end of the loop.

2 Answers

Before you go further, it's important to understand how fork works. When you fork a process, the OS uses copy-on-write semantics to share the bulk of the parent and child processes' memory; only the amount of memory that differs between the parent and child need to be separately allocated.

For reading a single line of a file in Perl, here's a simple way:

open my $fh, '<', $filePath or die "$filePath: $!";
my $line;
while( <$fh> ) {
    if( $. == $lineWanted ) { 
        $line = $_;
        last;
    }
}

This uses the special $. variable which holds the line number of the current filehandle.

like image

88

answered Oct 05 '22 17:10

friedo

Take a look at Tie::File core module.

like image

35

answered Oct 05 '22 17:10

cirne100

Sign in to Comment

Related questions
                            
                                How to cut the first Sunday to Saturday of each month in a year?
                            
                                How do I allow two concurrent processes to communicate?
                            
                                How can I tell if my Perl script is running under Windows?
                            
                                How can I create a Perl script to get some "named" command line arguments?
                            
                                Why does Perl evaluate code in ${...} during string interpolation?
                            
                                How can I initialize a 2D array in Perl?
                            
                                Other solutions/languages that are superior to the TCL-based Expect? [closed]
                            
                                Looking for a faster way to perform string searches
                            
                                Perl search and replace the last character occurrence
                            
                                Operator precedence issue in Perl and PHP
                            
                                Perl - Array of Objects
                            
                                Perl Regex: Does the at sign @ need to be escaped?
                            
                                Error: 500 Can't connect to example.com:443 (certificate verify failed)
                            
                                Perl rounding error again
                            
                                Recursive sorting in Perl
                            
                                How can I read the files in a directory in sorted order?
                            
                                What's the difference between system, exec, and backticks in Perl?
                            
                                How do I define private or internal methods in object oriented Perl?
                            
                                Is it necessary to use warnings when already use strict?
                            
                                How can I detect that a symlink is broken in Perl?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With