I would like to see a progress bar on Jupyter notebook while I'm running a compute task using Dask, I'm counting all values of <code>id</code> column from a large csv file +4GB, so any ideas? <pre class="prettyprint"><code>import dask.dataframe as dd df = dd.read_csv('data/train.csv') df.id.count().compute() </code></pre>

If you're using the single machine scheduler then do this: <pre class="prettyprint"><code>from dask.diagnostics import ProgressBar ProgressBar().register() </code></pre> http://dask.pydata.org/en/latest/diagnostics-local.html If you're using the distributed scheduler then do this: <pre class="prettyprint"><code>from dask.distributed import progress result = df.id.count.persist() progress(result) </code></pre> Or just use the dashboard http://dask.pydata.org/en/latest/diagnostics-distributed.html

How to see progress of Dask compute task?

Tags:

I would like to see a progress bar on Jupyter notebook while I'm running a compute task using Dask, I'm counting all values of id column from a large csv file +4GB, so any ideas?

import dask.dataframe as dd  df = dd.read_csv('data/train.csv') df.id.count().compute()

989

asked Feb 28 '18 22:02

ambigus9

1 Answers

If you're using the single machine scheduler then do this:

from dask.diagnostics import ProgressBar ProgressBar().register()

http://dask.pydata.org/en/latest/diagnostics-local.html

If you're using the distributed scheduler then do this:

from dask.distributed import progress  result = df.id.count.persist() progress(result)

Or just use the dashboard

http://dask.pydata.org/en/latest/diagnostics-distributed.html

125

answered Sep 17 '22 22:09

MRocklin

Related questions
                            
                                Python error : TypeError: Object of type 'Timestamp' is not JSON serializable'
                            
                                Random values seem to be not really random?
                            
                                NuGet: references to assemblies in runtimes folder not added
                            
                                Angular 6 add items into Observable
                            
                                int x; int y; int *ptr; is not initialization, right?
                            
                                std::is_constant_evaluated behavior
                            
                                Should I use softmax as output when using cross entropy loss in pytorch?
                            
                                How to auto-reject a pull request if tests are failing (Github actions)
                            
                                Why does the Count() method use the "checked" keyword?
                            
                                Deleting the middle element of a list
                            
                                Xdebug Could not connect to debugging client. Tried: localhost:9000
                            
                                What is the easiest way to add compression to WCF in Silverlight?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With