I need to serve a large file (500+ MB) for download from a location that is not accessible to the web server. I found the question Serving large files with PHP, which is identical to my situation, but I'm using Perl instead of PHP. I tried simply printing the file line by line, but this does not cause the browser to prompt for download before grabbing the entire file: <pre class="prettyprint"><code>use Tie::File; open my $fh, '<', '/path/to/file.txt'; tie my @file, 'Tie::File', $fh or die 'Could not open file: $!'; my $size_in_bytes = -s $fh; print "Content-type: text/plain\n"; print "Content-Length: $size_in_bytes\n"; print "Content-Disposition: attachment; filename=file.txt\n\n"; for my $line (@file) { print $line; } untie @file; close $fh; exit; </code></pre> Does Perl have an equivalent to PHP's <code>readfile()</code> function (as suggested with PHP) or is there a way to accomplish what I'm trying to do here?

If you just want to slurp input to output, this should do the trick. <pre class="prettyprint"><code>use Carp (); { #Lexical For FileHandle and $/ open my $fh, '<' , '/path/to/file.txt' or Carp::croak("File Open Failed"); local $/ = undef; print scalar <$fh>; close $fh or Carp::carp("File Close Failed"); } </code></pre> I guess in response to the "Does Perl have a PHP ReadFile Equivelant" , and I guess my answer would be "But it doesn't really need one". I've used PHP's manual File IO controls and they're a pain, Perls are just so easy to use by comparison that shelling out for a one-size-fits-all function seems over-kill. Also, you might want to look at <code>X-SendFile</code> support, and basically send a header to your webserver to tell it what file to send: http://john.guen.in/past/2007/4/17/send_files_faster_with_xsendfile/ ( assuming of course it has permissions enough to access the file, but the file is just NOT normally accessible via a standard URI ) Edit Noted, it is better to do it in a loop, I tested the above code with a hard-drive and it does implicitly try store the whole thing in an invisible temporary variable and eat all your ram. <h3>Alternative using blocks</h3> The following improved code reads the given file in blocks of 8192 chars, which is much more memory efficient, and gets a throughput respectably comparable with my disk raw read rate. ( I also pointed it at /dev/full for fits and giggles and got a healthy 500mb/s throughput, and it didn't eat all my rams, so that must be good ) <pre class="prettyprint"><code>{ open my $fh , '<', '/dev/sda' ; local $/ = \8192; # this tells IO to use 8192 char chunks. print $_ while defined ( $_ = scalar <$fh> ); close $fh; } </code></pre> <h3>Applying jrockways suggestions</h3> <pre class="prettyprint"><code>{ open my $fh , '<', '/dev/sda5' ; print $_ while ( sysread $fh, $_ , 8192 ); close $fh; } </code></pre> This literally doubles performance, ... and in some cases, gets me better throughput than DD does O_o.

The readline function is called <code>readline</code> (and can also be written as <code><></code>). I'm not sure what problem you're having. Perhaps that for loops aren't lazily evaluated (which they're not). Or, perhaps Tie::File is screwing something up? Anyway, the idiomatic Perl for reading a file a line at a time is: <pre class="prettyprint"><code>open my $fh, '<', $filename or die ...; while(my $line = <$fh>){ # process $line } </code></pre> No need to use Tie::File. Finally, you should not be handling this sort of thing yourself. This is a job for a web framework. If you were using Catalyst (or HTTP::Engine), you would just say: <pre class="prettyprint"><code>open my $fh, '<', $filename ... $c->res->body( $fh ); </code></pre> and the framework would automatically serve the data in the file efficiently. (Using stdio via readline is not a good idea here, it's better to read the file in blocks from the disk. But who cares, it's abstracted!)

How do I serve a large file for download with Perl?

Tags:

file-io

download

perl

I need to serve a large file (500+ MB) for download from a location that is not accessible to the web server. I found the question Serving large files with PHP, which is identical to my situation, but I'm using Perl instead of PHP.

I tried simply printing the file line by line, but this does not cause the browser to prompt for download before grabbing the entire file:

use Tie::File;

open my $fh, '<', '/path/to/file.txt';
tie my @file, 'Tie::File', $fh
    or die 'Could not open file: $!';
my $size_in_bytes = -s $fh;
print "Content-type: text/plain\n";
print "Content-Length: $size_in_bytes\n";
print "Content-Disposition: attachment; filename=file.txt\n\n";
for my $line (@file) {
    print $line;
}
untie @file;
close $fh;
exit;

Does Perl have an equivalent to PHP's readfile() function (as suggested with PHP) or is there a way to accomplish what I'm trying to do here?

689

asked Feb 21 '09 00:02

cowgod

2 Answers

If you just want to slurp input to output, this should do the trick.

use Carp ();

{ #Lexical For FileHandle and $/ 
  open my $fh, '<' , '/path/to/file.txt' or Carp::croak("File Open Failed");
  local $/ = undef; 
  print scalar <$fh>; 
  close $fh or Carp::carp("File Close Failed");
}

I guess in response to the "Does Perl have a PHP ReadFile Equivelant" , and I guess my answer would be "But it doesn't really need one".

I've used PHP's manual File IO controls and they're a pain, Perls are just so easy to use by comparison that shelling out for a one-size-fits-all function seems over-kill.

Also, you might want to look at X-SendFile support, and basically send a header to your webserver to tell it what file to send: http://john.guen.in/past/2007/4/17/send_files_faster_with_xsendfile/ ( assuming of course it has permissions enough to access the file, but the file is just NOT normally accessible via a standard URI )

Edit Noted, it is better to do it in a loop, I tested the above code with a hard-drive and it does implicitly try store the whole thing in an invisible temporary variable and eat all your ram.

Alternative using blocks

The following improved code reads the given file in blocks of 8192 chars, which is much more memory efficient, and gets a throughput respectably comparable with my disk raw read rate. ( I also pointed it at /dev/full for fits and giggles and got a healthy 500mb/s throughput, and it didn't eat all my rams, so that must be good )

{ 
    open my $fh , '<', '/dev/sda' ; 
    local $/ = \8192; # this tells IO to use 8192 char chunks. 
    print $_ while defined ( $_ = scalar <$fh> ); 
    close $fh; 
}

Applying jrockways suggestions

{ 
    open my $fh , '<', '/dev/sda5' ; 
    print $_ while ( sysread $fh, $_ , 8192 ); 
    close $fh; 
}

This literally doubles performance, ... and in some cases, gets me better throughput than DD does O_o.

190

answered Nov 09 '22 11:11

Kent Fredric

The readline function is called readline (and can also be written as <>).

I'm not sure what problem you're having. Perhaps that for loops aren't lazily evaluated (which they're not). Or, perhaps Tie::File is screwing something up? Anyway, the idiomatic Perl for reading a file a line at a time is:

open my $fh, '<', $filename or die ...;
while(my $line = <$fh>){
   # process $line
}

No need to use Tie::File.

Finally, you should not be handling this sort of thing yourself. This is a job for a web framework. If you were using Catalyst (or HTTP::Engine), you would just say:

open my $fh, '<', $filename ...
$c->res->body( $fh );

and the framework would automatically serve the data in the file efficiently. (Using stdio via readline is not a good idea here, it's better to read the file in blocks from the disk. But who cares, it's abstracted!)

answered Nov 09 '22 11:11

jrockway

Related questions
                            
                                awk syntax for getting part of a matched regex
                            
                                Matching two overlapping patterns with Perl
                            
                                Perl exceptions instead of return values
                            
                                Perl: How to match FULLWIDTH LATIN SMALL
                            
                                Approximating pi in Perl - what am I doing wrong?
                            
                                What is the colon meaning in the qw declaration?
                            
                                Dynamic variables in perl
                            
                                Regex: Matching 4-Digits within words
                            
                                Perl one-liner to match all occurrences of regex
                            
                                Extracting matches from perl regex using global modifier in foreach loop
                            
                                Can DBI infer or be informed about numeric column types when fetching rows?
                            
                                How to read file in Perl and if it doesn't exist create it?
                            
                                perlcritic: eval "require $module";
                            
                                Use of reference to elements in @_ to avoid duplicating code
                            
                                Why to shift bits "$? >> 8" when using Perl system function to execute command
                            
                                Fallback Open File Perl
                            
                                What does the caret ^ in (?^:…) mean in the string form of a Perl qr// Regex?
                            
                                When was the -P option removed from perl?
                            
                                Conditional subexpression replacement using regular expressions
                            
                                How to get number of entries in a hash with values less than n?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With