The event-driven programming model of node.js makes it somewhat tricky to coordinate the program flow. Simple sequential execution gets turned into nested callbacks, which is easy enough (though a bit convoluted to write down). But how about parallel execution? Say you have three tasks A,B,C that can run in parallel and when they are done, you want to send their results to task D. With a fork/join model this would be <ul> <li>fork A</li> <li>fork B</li> <li>fork C</li> <li>join A,B,C, run D</li> </ul> How do I write that in node.js ? Are there any best practices or cookbooks? Do I have to hand-roll a solution every time, or is there some library with helpers for this?

Nothing is truly parallel in node.js since it is single threaded. However, multiple events can be scheduled and run in a sequence you can't determine beforehand. And some things like database access are actually "parallel" in that the database queries themselves are run in separate threads but are re-integrated into the event stream when completed. So, how do you schedule a callback on multiple event handlers? Well, this is one common technique used in animations in browser side javascript: use a variable to track the completion. This sounds like a hack and it is, and it sounds potentially messy leaving a bunch of global variables around doing the tracking and in a lesser language it would be. But in javascript we can use closures: <pre class="prettyprint"><code>function fork (async_calls, shared_callback) { var counter = async_calls.length; var callback = function () { counter --; if (counter == 0) { shared_callback() } } for (var i=0;i<async_calls.length;i++) { async_calls[i](callback); } } // usage: fork([A,B,C],D); </code></pre> In the example above we keep the code simple by assuming the async and callback functions require no arguments. You can of course modify the code to pass arguments to the async functions and have the callback function accumulate results and pass it to the shared_callback function. <hr> <h3>Additional answer:</h3> Actually, even as is, that <code>fork()</code> function can already pass arguments to the async functions using a closure: <pre class="prettyprint"><code>fork([ function(callback){ A(1,2,callback) }, function(callback){ B(1,callback) }, function(callback){ C(1,2,callback) } ],D); </code></pre> the only thing left to do is to accumulate the results from A,B,C and pass them on to D. <hr> <h3>Even more additional answer:</h3> I couldn't resist. Kept thinking about this during breakfast. Here's an implementation of <code>fork()</code> that accumulates results (usually passed as arguments to the callback function): <pre class="prettyprint"><code>function fork (async_calls, shared_callback) { var counter = async_calls.length; var all_results = []; function makeCallback (index) { return function () { counter --; var results = []; // we use the arguments object here because some callbacks // in Node pass in multiple arguments as result. for (var i=0;i<arguments.length;i++) { results.push(arguments[i]); } all_results[index] = results; if (counter == 0) { shared_callback(all_results); } } } for (var i=0;i<async_calls.length;i++) { async_calls[i](makeCallback(i)); } } </code></pre> That was easy enough. This makes <code>fork()</code> fairly general purpose and can be used to synchronize multiple non-homogeneous events. Example usage in Node.js: <pre class="prettyprint"><code>// Read 3 files in parallel and process them together: function A (c){ fs.readFile('file1',c) }; function B (c){ fs.readFile('file2',c) }; function C (c){ fs.readFile('file3',c) }; function D (result) { file1data = result[0][1]; file2data = result[1][1]; file3data = result[2][1]; // process the files together here } fork([A,B,C],D); </code></pre> <hr> <h3>Update</h3> This code was written before the existence of libraries like async.js or the various promise based libraries. I'd like to believe that async.js was inspired by this but I don't have any proof of it. Anyway.. if you're thinking of doing this today take a look at async.js or promises. Just consider the answer above a good explanation/illustration of how things like async.parallel work. For completeness sake the following is how you'd do it with <code>async.parallel</code>: <pre class="prettyprint"><code>var async = require('async'); async.parallel([A,B,C],D); </code></pre> Note that <code>async.parallel</code> works exactly the same as the <code>fork</code> function we implemented above. The main difference is it passes an error as the first argument to <code>D</code> and the callback as the second argument as per node.js convention. Using promises, we'd write it as follows: <pre class="prettyprint"><code>// Assuming A, B & C return a promise instead of accepting a callback Promise.all([A,B,C]).then(D); </code></pre>

Coordinating parallel execution in node.js

1 Answers

Nothing is truly parallel in node.js since it is single threaded. However, multiple events can be scheduled and run in a sequence you can't determine beforehand. And some things like database access are actually "parallel" in that the database queries themselves are run in separate threads but are re-integrated into the event stream when completed.

So, how do you schedule a callback on multiple event handlers? Well, this is one common technique used in animations in browser side javascript: use a variable to track the completion.

This sounds like a hack and it is, and it sounds potentially messy leaving a bunch of global variables around doing the tracking and in a lesser language it would be. But in javascript we can use closures:

function fork (async_calls, shared_callback) {   var counter = async_calls.length;   var callback = function () {     counter --;     if (counter == 0) {       shared_callback()     }   }    for (var i=0;i<async_calls.length;i++) {     async_calls[i](callback);   } }  // usage: fork([A,B,C],D);

In the example above we keep the code simple by assuming the async and callback functions require no arguments. You can of course modify the code to pass arguments to the async functions and have the callback function accumulate results and pass it to the shared_callback function.

Additional answer:

Actually, even as is, that fork() function can already pass arguments to the async functions using a closure:

fork([   function(callback){ A(1,2,callback) },   function(callback){ B(1,callback) },   function(callback){ C(1,2,callback) } ],D);

the only thing left to do is to accumulate the results from A,B,C and pass them on to D.

Even more additional answer:

I couldn't resist. Kept thinking about this during breakfast. Here's an implementation of fork() that accumulates results (usually passed as arguments to the callback function):

function fork (async_calls, shared_callback) {   var counter = async_calls.length;   var all_results = [];   function makeCallback (index) {     return function () {       counter --;       var results = [];       // we use the arguments object here because some callbacks        // in Node pass in multiple arguments as result.       for (var i=0;i<arguments.length;i++) {         results.push(arguments[i]);       }       all_results[index] = results;       if (counter == 0) {         shared_callback(all_results);       }     }   }    for (var i=0;i<async_calls.length;i++) {     async_calls[i](makeCallback(i));   } }

That was easy enough. This makes fork() fairly general purpose and can be used to synchronize multiple non-homogeneous events.

Example usage in Node.js:

// Read 3 files in parallel and process them together:  function A (c){ fs.readFile('file1',c) }; function B (c){ fs.readFile('file2',c) }; function C (c){ fs.readFile('file3',c) }; function D (result) {   file1data = result[0][1];   file2data = result[1][1];   file3data = result[2][1];    // process the files together here }  fork([A,B,C],D);

Update

This code was written before the existence of libraries like async.js or the various promise based libraries. I'd like to believe that async.js was inspired by this but I don't have any proof of it. Anyway.. if you're thinking of doing this today take a look at async.js or promises. Just consider the answer above a good explanation/illustration of how things like async.parallel work.

For completeness sake the following is how you'd do it with async.parallel:

var async = require('async');  async.parallel([A,B,C],D);

Note that async.parallel works exactly the same as the fork function we implemented above. The main difference is it passes an error as the first argument to D and the callback as the second argument as per node.js convention.

Using promises, we'd write it as follows:

// Assuming A, B & C return a promise instead of accepting a callback  Promise.all([A,B,C]).then(D);

answered Sep 24 '22 14:09

slebetman

Related questions
                            
                                JSLint message: Unused variables
                            
                                jQuery/JavaScript collision detection
                            
                                Snap.svg vs Svg.js [closed]
                            
                                Correct async function export in node.js
                            
                                Javascript errors from Google Adsense
                            
                                Angular2 testing: What's the difference between a DebugElement and a NativeElement object in a ComponentFixture?
                            
                                Why is 'for(var item in list)' with arrays considered bad practice in JavaScript?
                            
                                Where should ReactDOM be imported from?
                            
                                Difference between using Array.isArray and instanceof Array
                            
                                What do double brackets mean in javascript and how to access them
                            
                                How do I re-trigger a WebKit CSS animation via JavaScript?
                            
                                Confused about useBuiltIns option of @babel/preset-env (using Browserslist Integration)
                            
                                What is Vue way to access to data from methods?
                            
                                How can I make a program wait for a variable change in javascript?
                            
                                How to detect browser back button click event using angular?
                            
                                Are nested promises normal in node.js?
                            
                                Spawn and kill a process in node.js
                            
                                Performance with infinite scroll or a lot of dom elements?
                            
                                How can I remove a buggy service worker, or implement a "kill switch"?
                            
                                How to force JavaScript to deep copy a string?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Coordinating parallel execution in node.js

Tags:

javascript

node.js

concurrency

parallel-processing

fork-join

Thilo

People also ask