Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Would it make sense to use Snakemake and Dask together?

I have a Snakemake workflow that I've been using to train DL TensorFlow models. At a high level there are a few longish-running jobs (model training) that can be run in parallel. I would like to run these on the cloud and dask-cloudprovider seems like a promising option since I can leverage GPU's easily on ECS. To do this, though, would I have to rewrite my workflow using the Dask functions (maybe dask delayed)? Or is there some way to get Snakemake to use Dask?

like image 532
j sad Avatar asked Sep 19 '25 18:09

j sad


1 Answers

If you do a web search for "dask snakemake" you'll find a Github issue from 2017 that you might want to read through. It's certainly possible, but someone would need to write the integration.

You may also want to try Dask's integration with Airflow, or, perhaps a bit more modern, the Prefect library.

like image 101
MRocklin Avatar answered Sep 22 '25 07:09

MRocklin