What are the scaling limits of Dask.distributed?

1 Answers

Yes

The largest Dask.distributed cluster I've seen is around one thousand nodes. We could theoretically go larger, but not by a huge amount.

The current limit is that the scheduler incurs around a 200 microsecond overhead per task. This translates to about 5000 tasks per second. If each of your tasks take around one second then the scheduler can saturate around 5000 cores.

Historically we ran into other limitations like open file handle limits and such. These have all been cleaned up to the scale that we've seen (1000 nodes) and generally things are fine on Linux or OSX. Dask schedulers on Windows stop scaling in the low hundreds of nodes (though you can use a Linux scheduler with Windows workers). I would not be surprised to see other issues pop up as we scale out to 10k nodes.

In short, you probably don't want to use Dask to replace MPI workloads on your million core Big Iron SuperComputer or at Google Scale. Otherwise you're probably fine.

131

answered Oct 20 '22 01:10

MRocklin

Related questions
                            
                                kivy android share image
                            
                                How to compare requirement file and actually installed Python modules?
                            
                                Extract specific section from LaTeX file with python
                            
                                Calculate percentiles/quantiles for a timeseries with resample or groupby - pandas
                            
                                Pydrive deleting file from Google Drive
                            
                                How to drop null values in Pandas? [duplicate]
                            
                                How to crop zero edges of a numpy array?
                            
                                Can you use self.assertRaises as an async context manager?
                            
                                How to respond with HTTP 500 on any unhandled exception in Falcon framework
                            
                                Numpy matrix exponentiation gives negative value
                            
                                lambda function of another function but force fixed argument
                            
                                create dynamic fields in WTform in Flask
                            
                                Pandas, split dataframe by monotonic increase of column value
                            
                                Creating a python object in C++ and calling its method
                            
                                Variable scope in case of an exception in python
                            
                                Flask-login: remember me not working if login_manager's session_protection is set to "strong"
                            
                                Passing parameter (directory) to %cd command in ipython notebook
                            
                                How to plot multiple lines in one figure in Pandas Python based on data from multiple columns? [duplicate]
                            
                                Scrape Yahoo Finance Financial Ratios
                            
                                coreapi must be installed to use 'get_schema_fields()'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are the scaling limits of Dask.distributed?

Tags:

python

distributed-computing

dask

bcollins

People also ask

1 Answers

MRocklin

Recent Activity

Donate For Us