Parse fixed-width files

Tags:

1 Answers

As user604939 mentions, unpack is the tool to use for fixed width fields. However, unpack needs to be passed a template to work with. Since you say your fields can change width, the solution is to build this template from the first line of your file:

my @template = map {'A'.length}        # convert each to 'A##'
               <DATA> =~ /(\S+\s*)/g;  # split first line into segments
$template[-1] = 'A*';                  # set the last segment to be slurpy

my $template = "@template";
print "template: $template\n";

my @data;
while (<DATA>) {
    push @data, [unpack $template, $_]
}

use Data::Dumper;

print Dumper \@data;

__DATA__
<c>     <c>       <c>
Dave    Thomas    123 Main
Dan     Anderson  456 Center
Wilma   Rainbow   789 Street

which prints:

template: A8 A10 A*
$VAR1 = [
          [
            'Dave',
            'Thomas',
            '123 Main'
          ],
          [
            'Dan',
            'Anderson',
            '456 Center'
          ],
          [
            'Wilma',
            'Rainbow',
            '789 Street'
          ]
        ];

180

answered Oct 13 '22 16:10

Eric Strom

Related questions
                            
                                How can I get Perl to detect bad UTF-8 sequences?
                            
                                Perl equivalent of PHP's preg_callback
                            
                                Embedding evaluations in Perl regex
                            
                                How to format perl code?
                            
                                Replace only first match in multiple files with perl
                            
                                Why do I get a duplicate declaration in same scope warning in an if/elsif tree?
                            
                                Getting classname of a object in Perl
                            
                                LWP::UserAgent HTTP Basic Authentication
                            
                                Is there such a thing as a list in scalar context?
                            
                                Parsing in Perl a string separated by null bytes
                            
                                threads vs. pthread in perl
                            
                                How to find out what version of mod_perl is installed?
                            
                                Why is Test::WWW::Mechanize::PSGI using a port?
                            
                                Perl: Matching string not containing PATTERN
                            
                                Why does this (mostly) empty Perl subroutine return an empty string?
                            
                                How can I override Perl functions, enabling multiple overrides?
                            
                                Which is the simple and fast UNIX command to print all lines from the last occurrence of a pattern?
                            
                                How can I merge PDF files with Perl?
                            
                                How can I show Perl console output in a GUI?
                            
                                What is the difference between writing to STDOUT and a filehandle opened to "/dev/tty"?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Parse fixed-width files

Tags:

parsing

perl

user_78361084

People also ask

1 Answers

Eric Strom

Recent Activity

Donate For Us