Best practices in setting number of dask workers

1 Answers

By "node" people typically mean a physical or virtual machine. That node can run several programs or processes at once (much like how my computer can run a web browser and text editor at once). Each process can parallelize within itself with many threads. Processes have isolated memory environments, meaning that sharing data within a process is free, while sharing data between processes is expensive.

Typically things work best on larger nodes (like 36 cores) if you cut them up into a few processes, each of which have several threads. You want the number of processes times the number of threads to equal the number of cores. So for example you might do something like the following for a 36 core machine:

Four processes with nine threads each
Twelve processes with three threads each
One process with thirty-six threads

Typically one decides between these choices based on the workload. The difference here is due to Python's Global Interpreter Lock, which limits parallelism for some kinds of data. If you are working mostly with Numpy, Pandas, Scikit-Learn, or other numerical programming libraries in Python then you don't need to worry about the GIL, and you probably want to prefer few processes with many threads each. This helps because it allows data to move freely between your cores because it all lives in the same process. However, if you're doing mostly Pure Python programming, like dealing with text data, dictionaries/lists/sets, and doing most of your computation in tight Python for loops then you'll want to prefer having many processes with few threads each. This incurs extra communication costs, but lets you bypass the GIL.

In short, if you're using mostly numpy/pandas-style data, try to get at least eight threads or so in a process. Otherwise, maybe go for only two threads in a process.

177

answered Sep 18 '22 06:09

MRocklin

Related questions
                            
                                Why Kafka is not P in CAP theorem
                            
                                Smooth vue collapse transition on v-if
                            
                                How does minReadySeconds affect readiness probe?
                            
                                Dotnet Unit test with Coverlet- How to get coverage for entire solution and not just a project
                            
                                A basic Monoid definition gives "No instance for (Semigroup MyMonoid) arising from the superclasses of an instance declaration"
                            
                                Should a component's render method have return type React.ReactNode or JSX.Element?
                            
                                BeginTransaction with IsolationLevel in EF Core
                            
                                TryGetValue pattern with C# 8 nullable reference types
                            
                                Async/Await with Vuex dispatch
                            
                                Gradle duplicate entry error: META-INF/MANIFEST.MF (Or how to delete a file from jar)
                            
                                How to import existing VPC in aws cdk?
                            
                                How can you quickly compute the integer logarithm for any base?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best practices in setting number of dask workers

Tags:

kristofarkas

People also ask

1 Answers

MRocklin

Recent Activity

Donate For Us