I have a fixed-sized array where the size of the array is always in factor of 3. <pre class="prettyprint"><code>my @array = ('foo', 'bar', 'qux', 'foo1', 'bar', 'qux2', 3, 4, 5); </code></pre> How can I cluster the member of array such that we can get an array of array group by 3: <pre class="prettyprint"><code>$VAR = [ ['foo','bar','qux'], ['foo1','bar','qux2'], [3, 4, 5] ]; </code></pre>

I really like List::MoreUtils and use it frequently. However, I have never liked the <code>natatime</code> function. It doesn't produce output that can be used with a for loop or <code>map</code> or <code>grep</code>. I like to chain map/grep/apply operations in my code. Once you understand how these functions work, they can be very expressive and very powerful. But it is easy to make a function to work like natatime that returns a list of array refs. <pre class="prettyprint"><code>sub group_by ($@) { my $n = shift; my @array = @_; croak "group_by count argument must be a non-zero positive integer" unless $n > 0 and int($n) == $n; my @groups; push @groups, [ splice @array, 0, $n ] while @array; return @groups; } </code></pre> Now you can do things like this: <pre class="prettyprint"><code>my @grouped = map [ reverse @$_ ], group_by 3, @array; </code></pre> ** Update re Chris Lutz's suggestions ** Chris, I can see merit in your suggested addition of a code ref to the interface. That way a map-like behavior is built in. <pre class="prettyprint"><code># equivalent to my map/group_by above group_by { [ reverse @_ ] } 3, @array; </code></pre> This is nice and concise. But to keep the nice <code>{}</code> code ref semantics, we have put the count argument <code>3</code> in a hard to see spot. I think I like things better as I wrote it originally. A chained map isn't that much more verbose than what we get with the extended API. With the original approach a grep or other similar function can be used without having to reimplement it. For example, if the code ref is added to the API, then you have to do: <pre class="prettyprint"><code>my @result = group_by { $_[0] =~ /foo/ ? [@_] : () } 3, @array; </code></pre> to get the equivalent of: <pre class="prettyprint"><code>my @result = grep $_->[0] =~ /foo/, group_by 3, @array; </code></pre> Since I suggested this for the sake of easy chaining, I like the original better. Of course, it would be easy to allow either form: <pre class="prettyprint"><code>sub _copy_to_ref { [ @_ ] } sub group_by ($@) { my $code = \&_copy_to_ref; my $n = shift; if( reftype $n eq 'CODE' ) { $code = $n; $n = shift; } my @array = @_; croak "group_by count argument must be a non-zero positive integer" unless $n > 0 and int($n) == $n; my @groups; push @groups, $code->(splice @array, 0, $n) while @array; return @groups; } </code></pre> Now either form should work (untested). I'm not sure whether I like the original API, or this one with the built in map capabilities better. Thoughts anyone? ** Updated again ** Chris is correct to point out that the optional code ref version would force users to do: <pre class="prettyprint"><code>group_by sub { foo }, 3, @array; </code></pre> Which is not so nice, and violates expectations. Since there is no way to have a flexible prototype (that I know of), that puts the kibosh on the extended API, and I'd stick with the original. On a side note, I started with an anonymous sub in the alternate API, but I changed it to a named sub because I was subtly bothered by how the code looked. No real good reason, just an intuitive reaction. I don't know if it matters either way.

<pre class="prettyprint"><code>my @VAR; push @VAR, [ splice @array, 0, 3 ] while @array; </code></pre> or you could use <code>natatime</code> from <code>List::MoreUtils</code> <pre class="prettyprint"><code>use List::MoreUtils qw(natatime); my @VAR; { my $iter = natatime 3, @array; while( my @tmp = $iter->() ){ push @VAR, \@tmp; } } </code></pre>

How can I partition a Perl array into equal sized chunks?

Tags:

perl

I have a fixed-sized array where the size of the array is always in factor of 3.

my @array = ('foo', 'bar', 'qux', 'foo1', 'bar', 'qux2', 3, 4, 5);

How can I cluster the member of array such that we can get an array of array group by 3:

$VAR = [ ['foo','bar','qux'],
         ['foo1','bar','qux2'],
         [3, 4, 5] ];

816

asked Sep 29 '09 06:09

neversaint

2 Answers

I really like List::MoreUtils and use it frequently. However, I have never liked the natatime function. It doesn't produce output that can be used with a for loop or map or grep.

I like to chain map/grep/apply operations in my code. Once you understand how these functions work, they can be very expressive and very powerful.

But it is easy to make a function to work like natatime that returns a list of array refs.

sub group_by ($@) {
    my $n     = shift;
    my @array = @_;

    croak "group_by count argument must be a non-zero positive integer"
        unless $n > 0 and int($n) == $n;

    my @groups;
    push @groups, [ splice @array, 0, $n ] while @array;

    return @groups;
}

Now you can do things like this:

my @grouped = map [ reverse @$_ ],
              group_by 3, @array;

** Update re Chris Lutz's suggestions **

Chris, I can see merit in your suggested addition of a code ref to the interface. That way a map-like behavior is built in.

# equivalent to my map/group_by above
group_by { [ reverse @_ ] } 3, @array;

This is nice and concise. But to keep the nice {} code ref semantics, we have put the count argument 3 in a hard to see spot.

I think I like things better as I wrote it originally.

A chained map isn't that much more verbose than what we get with the extended API. With the original approach a grep or other similar function can be used without having to reimplement it.

For example, if the code ref is added to the API, then you have to do:

my @result = group_by { $_[0] =~ /foo/ ? [@_] : () } 3, @array;

to get the equivalent of:

my @result = grep $_->[0] =~ /foo/,
             group_by 3, @array;

Since I suggested this for the sake of easy chaining, I like the original better.

Of course, it would be easy to allow either form:

sub _copy_to_ref { [ @_ ] }

sub group_by ($@) {
    my $code = \&_copy_to_ref;
    my $n = shift;

    if( reftype $n eq 'CODE' ) {
        $code = $n;
        $n = shift;
    }

    my @array = @_;

    croak "group_by count argument must be a non-zero positive integer"
        unless $n > 0 and int($n) == $n;

    my @groups;
    push @groups, $code->(splice @array, 0, $n) while @array;

    return @groups;
}

Now either form should work (untested). I'm not sure whether I like the original API, or this one with the built in map capabilities better.

Thoughts anyone?

** Updated again **

Chris is correct to point out that the optional code ref version would force users to do:

group_by sub { foo }, 3, @array;

Which is not so nice, and violates expectations. Since there is no way to have a flexible prototype (that I know of), that puts the kibosh on the extended API, and I'd stick with the original.

On a side note, I started with an anonymous sub in the alternate API, but I changed it to a named sub because I was subtly bothered by how the code looked. No real good reason, just an intuitive reaction. I don't know if it matters either way.

answered Oct 13 '22 15:10

daotoad

my @VAR;
push @VAR, [ splice @array, 0, 3 ] while @array;

or you could use natatime from List::MoreUtils

use List::MoreUtils qw(natatime);

my @VAR;
{
  my $iter = natatime 3, @array;
  while( my @tmp = $iter->() ){
    push @VAR, \@tmp;
  }
}

194

answered Oct 13 '22 15:10

Brad Gilbert

Related questions
                            
                                Remove leading zeroes but not all zeroes
                            
                                How do I inherit subroutines in Perl with 'use base'?
                            
                                What are the best practices for error handling in Perl?
                            
                                What does for (;;) mean in Perl?
                            
                                How do I push a value onto a Perl hash of arrays?
                            
                                Decrypt obfuscated perl script
                            
                                Perl: Alternatives to template toolkit
                            
                                Installing Perl module LWP::Protocol::https
                            
                                Why does perl warn that open my $fh, $file is missing parentheses?
                            
                                Why doesn't print output anything on each iteration of a loop when I use sleep?
                            
                                How to fix: 'YAML' not installed when installing XML::Simple?
                            
                                How do I convert decimal numbers to binary in Perl?
                            
                                How can I set a default value for a Perl variable?
                            
                                How to split a string with multiple patterns in perl?
                            
                                Should I escape shell arguments in Perl?
                            
                                Read file into variable in Perl [duplicate]
                            
                                Only print matching lines in perl from the command line
                            
                                What is the best way to match only letters in a regex?
                            
                                Printing out the code of an anonymous subroutine
                            
                                Cannot run cgi, show plain text only (Ubuntu 13.10 Apache 2.4)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With