How can I modify my perl script to use multiple processors?

Tags:

Hi I have a simple script that takes a file and runs another Perl script on it. The script does this to every picture file in the current folder. This is running on a machine with 2 quad core Xeon processors, 16gb of ram, running RedHat Linux.

The first script work.pl basically calls magicplate.pl passes some parameters and the name of the file for magicplate.pl to process. Magic Plate takes about a minute to process each image. Because work.pl is preforming the same function over 100 times and because the system has multiple processors and cores I was thinking about splitting the task up so that it could run multiple times in parallel. I could split the images up to different folders if necessary. Any help would be great. Thank you

Here is what I have so far:

use strict;
use warnings;


my @initialImages = <*>;

foreach my $file (@initialImages) {

    if($file =~ /.png/){
        print "processing $file...\n";
        my @tmp=split(/\./,$file);
        my $name="";
        for(my $i=0;$i<(@tmp-1);$i++) {
            if($name eq "") { $name = $tmp[$i]; } else { $name=$name.".".$tmp[$i];}
        }

        my $exten=$tmp[(@tmp-1)];
        my $orig=$name.".".$exten;

        system("perl magicPlate.pl -i ".$orig." -min 4 -max 160 -d 1");
     }
}

892

asked Dec 13 '10 14:12

Alos

2 Answers

You should consider NOT creating a new process for each file that you want to process -- It's horribly inefficient, and probably what is taking most of your time here. Just loading up Perl and whatever modules you use over and over ought to be creating some overhead. I recall a poster on PerlMonks that did something similar, and ended up transforming his second script into a module, reducing the worktime from an hour to a couple of minutes. Not that you should expect such a dramatic improvement, but one can dream..

With the second script refactored as a module, here's an example of thread usage, in which BrowserUK creates a thread pool, feeding it jobs through a queue.

answered Nov 03 '22 00:11

Hugmeir

You could use Parallel::ForkManager (set $MAX_PROCESSES to the number of files processed at the same time):

use Parallel::ForkManager;
use strict;
use warnings;

my @initialImages = <*>;

foreach my $file (@initialImages) {

    if($file =~ /.png/){
        print "processing $file...\n";
        my @tmp=split(/\./,$file);
        my $name="";
        for(my $i=0;$i<(@tmp-1);$i++) {
            if($name eq "") { $name = $tmp[$i]; } else { $name=$name.".".$tmp[$i];}
        }

        my $exten=$tmp[(@tmp-1)];
        my $orig=$name.".".$exten;

  $pm = new Parallel::ForkManager($MAX_PROCESSES);
    my $pid = $pm->start and next;
        system("perl magicPlate.pl -i ".$orig." -min 4 -max 160 -d 1");
    $pm->finish; # Terminates the child process

     }
}

But as suggested by Hugmeir running perl interpreter again and again for each new file is not a good idea.

answered Nov 03 '22 01:11

gangabass

Related questions
                            
                                Docker buildx with node app on Apple M1 Silicon - standard_init_linux.go:211: exec user process caused "exec format error
                            
                                Which resources should one monitor on a Linux server running a web-server or database
                            
                                How to set up headers and libraries for Linux development
                            
                                Where does pp (PAR) unpack add (-a) files?
                            
                                How i configure logrotate to not delete my log files after rotation?
                            
                                Qt does not create output files in debug/release folders in Linux
                            
                                Getting Java JDK to compile on ubuntu
                            
                                Google perftool cannot read file "libprofiler.so.0"
                            
                                How to generate random file name for socket under Linux?
                            
                                Pseudo filesystems on *nix
                            
                                Java exec() does not return expected result of pipes' connected commands
                            
                                Is it possible to set up a gcc cross compiler on Linux to compile 64 bit targets on a 32 bit architecture?
                            
                                Drop a single sample from munin data
                            
                                How can I find out where is my code causing GLib-GObject-CRITICAL?
                            
                                Debian Start Qt GUI application with no desktop
                            
                                How do I detect an animated GIF's ticks per second?
                            
                                How can I tell if a file is text using PHP?
                            
                                Get latest 100MB Of text file in linux
                            
                                Declaring User Defined Variable in Shell Scripting (csh shell)
                            
                                Disassembler for Linux capable of disassembling old DOS .COM/.EXE files

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I modify my perl script to use multiple processors?

Tags:

file

linux

multithreading

perl

multiprocessor

Alos

People also ask

2 Answers

Hugmeir

gangabass

Recent Activity

Donate For Us