Node.js on multi-core machines

Tags:

Node.js looks interesting, BUT I must miss something - isn't Node.js tuned only to run on a single process and thread?

Then how does it scale for multi-core CPUs and multi-CPU servers? After all, it is all great to make fast as possible single-thread server, but for high loads I would want to use several CPUs. And the same goes for making applications faster - seems today the way is use multiple CPUs and parallelize the tasks.

How does Node.js fit into this picture? Is its idea to somehow distribute multiple instances or what?

586

asked Mar 05 '10 15:03

zaharpopov

1 Answers

[This post is up-to-date as of 2012-09-02 (newer than above).]

Node.js absolutely does scale on multi-core machines.

Yes, Node.js is one-thread-per-process. This is a very deliberate design decision and eliminates the need to deal with locking semantics. If you don't agree with this, you probably don't yet realize just how insanely hard it is to debug multi-threaded code. For a deeper explanation of the Node.js process model and why it works this way (and why it will NEVER support multiple threads), read my other post.

So how do I take advantage of my 16 core box?

Two ways:

For big heavy compute tasks like image encoding, Node.js can fire up child processes or send messages to additional worker processes. In this design, you'd have one thread managing the flow of events and N processes doing heavy compute tasks and chewing up the other 15 CPUs.
For scaling throughput on a webservice, you should run multiple Node.js servers on one box, one per core and split request traffic between them. This provides excellent CPU-affinity and will scale throughput nearly linearly with core count.

Scaling throughput on a webservice

Since v6.0.X Node.js has included the cluster module straight out of the box, which makes it easy to set up multiple node workers that can listen on a single port. Note that this is NOT the same as the older learnboost "cluster" module available through npm.

Click to copy

if (cluster.isMaster) {   // Fork workers.   for (var i = 0; i < numCPUs; i++) {     cluster.fork();   } } else {   http.Server(function(req, res) { ... }).listen(8000); }

Workers will compete to accept new connections, and the least loaded process is most likely to win. It works pretty well and can scale up throughput quite well on a multi-core box.

If you have enough load to care about multiple cores, then you are going to want to do a few more things too:

Run your Node.js service behind a web-proxy like Nginx or Apache - something that can do connection throttling (unless you want overload conditions to bring the box down completely), rewrite URLs, serve static content, and proxy other sub-services.
Periodically recycle your worker processes. For a long-running process, even a small memory leak will eventually add up.
Setup log collection / monitoring

PS: There's a discussion between Aaron and Christopher in the comments of another post (as of this writing, its the top post). A few comments on that:

A shared socket model is very convenient for allowing multiple processes to listen on a single port and compete to accept new connections. Conceptually, you could think of preforked Apache doing this with the significant caveat that each process will only accept a single connection and then die. The efficiency loss for Apache is in the overhead of forking new processes and has nothing to do with the socket operations.
For Node.js, having N workers compete on a single socket is an extremely reasonable solution. The alternative is to set up an on-box front-end like Nginx and have that proxy traffic to the individual workers, alternating between workers for assigning new connections. The two solutions have very similar performance characteristics. And since, as I mentioned above, you will likely want to have Nginx (or an alternative) fronting your node service anyways, the choice here is really between:

Shared Ports: nginx (port 80) --> Node_workers x N (sharing port 3000 w/ Cluster)

Individual Ports: nginx (port 80) --> {Node_worker (port 3000), Node_worker (port 3001), Node_worker (port 3002), Node_worker (port 3003) ...}

There are arguably some benefits to the individual ports setup (potential to have less coupling between processes, have more sophisticated load-balancing decisions, etc.), but it is definitely more work to set up and the built-in cluster module is a low-complexity alternative that works for most people.

143

answered Oct 18 '22 00:10

Dave Dopson

Related questions
                            
                                How to skip over an element in .map()?
                            
                                How to break/exit from a each() function in JQuery? [duplicate]
                            
                                Set cookie and get cookie with JavaScript [duplicate]
                            
                                How do I replace all line breaks in a string with <br /> elements?
                            
                                In JavaScript, how to conditionally add a member to an object?
                            
                                how to get the last character of a string?
                            
                                Maximum call stack size exceeded error
                            
                                How do I split a string with multiple separators in JavaScript?
                            
                                Is JavaScript guaranteed to be single-threaded?
                            
                                Move an array element from one array position to another
                            
                                jQuery checkbox change and click event
                            
                                Invoking JavaScript code in an iframe from the parent page
                            
                                How to append to a file in Node?
                            
                                How can I force clients to refresh JavaScript files?
                            
                                HTML5 Local storage vs. Session storage
                            
                                How to export JavaScript array info to csv (on client side)?
                            
                                Convert Array to Object
                            
                                Why does JavaScript only work after opening developer tools in IE once?
                            
                                Add a tooltip to a div
                            
                                Count the number of occurrences of a character in a string in Javascript

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Node.js on multi-core machines

Tags:

javascript

node.js

node-cluster