To fork or not to fork?

Tags:

I am re-developing a system that will send messages via http to one of a number of suppliers. The original is perl scripts and it's likely that the re-development will also use perl.

In the old system, there were a number of perl scripts all running at the same time, five for each supplier. When a message was put into the database, a random thread number (1-5) and the supplier was chosen to ensure that no message was processed twice while avoiding having to lock the table/row. Additionally there was a "Fair Queue Position" field in the database to ensure that a large message send didn't delay small sends that happened while the large one was being sent.

At some times there would be just a couple of messages per minute, but at other times there would be a dump of potentially hundreds of thousands of messages. It seems to me like a resource waste to have all the scripts running and checking for messages all of the time so I am trying to work out if there is a better way to do it, or if the old way is acceptable.

My thoughts right now lie with the idea of having one script that runs and forks as many child processes as are needed (up to a limit) depending on how much traffic there is, but I am not sure how best to implement it such that each message is processed only once, while the fair queuing is maintained.

My best guess right now is that the parent script updates the DB to indicate which child process should handle it, however I am concerned that this will end up being less efficient than the original method. I have little experience of writing forking code (last time I did it was about 15 years ago).

Any thoughts or links to guides on how best to process message queues appreciated!

345

asked Oct 31 '12 16:10

Ben Holness

2 Answers

You could use Thread::Queue or any other from this: Is there a multiprocessing module for Perl?

If the old system was written in Perl this way you could reuse most part of it.

Non working example:

Click to copy

use strict;
use warnings;

use threads;
use Thread::Queue;

my $q = Thread::Queue->new();    # A new empty queue

# Worker thread
my @thrs = threads->create(sub {
                            while (my $item = $q->dequeue()) {
                                # Do work on $item
                            }
                         })->detach() for 1..10;#for 10 threads
my $dbh = ...
while (1){
  #get items from db
  my @items = get_items_from_db($dbh);
  # Send work to the thread
  $q->enqueue(@items);
  print "Pending items: "$q->pending()."\n";
  sleep 15;#check DB in every 15 secs
}

169

answered Nov 14 '22 22:11

user1126070

I would suggest using a message queue server like RabbitMQ.

One process feeds work into the queue, and you can have multiple worker processes consume the queue.

Advantages of this approach:

workers block when waiting for work (no busy waiting)
more worker processes can be started up manually if needed
worker processes don't have to be a child of a special parent process
RabbitMQ will distribute the work among all workers which are ready to accept work
RabbitMQ will put work back into the queue if the worker doesn't return an ACK
you don't have to assign work in the database
every "agent" (worker, producer, etc.) is an independent process which means you can kill it or restart it without affecting other processes

To dynamically scale-up or down the number workers, you can implement something like:

have workers automatically die if they don't get work for a specified amount of time
have another process monitor the length of the queue and spawn more workers if the queue is getting too big

answered Nov 14 '22 23:11

ErikR

Related questions
                            
                                PHP how to specify "back one directory level" in an url
                            
                                MySQL INSERT - Do field names require backtick/accent delimination?
                            
                                PostgreSQL csv import from a MySQL csv export?
                            
                                MySQL LIKE %string% not quite forgiving enough. Anything else I can use?
                            
                                MySQL non primary foreign key
                            
                                MySQL Procedure within a Select?
                            
                                Speeding up perl DBI fetchrow_hashref
                            
                                How to filter rows based on the character length of a field
                            
                                MySQL fetch array adds duplicate values?
                            
                                Java MYSQL/JDBC query is returning stale data from cached Connection
                            
                                Is my mysql.general_log table getting too big?
                            
                                MySQL retrieve latest record for Group
                            
                                Difference in between mysql_num_rows and mysql_affected_rows
                            
                                MySQL longtext analogue in Microsoft SQL?
                            
                                PHP MySQL Get column names While iterating through all records
                            
                                Using time zones in a PHP web application
                            
                                If i change mysql engine from Myisam to innodb, will it affect on my data
                            
                                Import some database entries through PHPMyAdmin with overwrite
                            
                                how to mysql_fetch_array on joined tables, but columns have same name [duplicate]
                            
                                why is the `tcgetattr` error seen when ssh is used for dumping the backup file on another server?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

To fork or not to fork?

Tags:

fork

mysql

multithreading

perl

message-queue

Ben Holness

People also ask

2 Answers

user1126070

ErikR

Recent Activity

Donate For Us