Multiple processes sharing a single Joblib cache

Tags:

I'm using Joblib to cache results of a computationally expensive function in my python script. The function's input arguments and return values are numpy arrays. The cache works fine for a single run of my python script. Now I want to spawn multiple runs of my python script in parallel for sweeping some parameter in an experiment. (The definition of the function remains same across all the runs).

Is there a way to share the joblib cache among multiple python scripts running in parallel? This would save a lot of function evaluations which are repeated across different runs but do not repeat within a single run. I couldn't find if this is possible in Joblib's documentation

503

asked Jul 30 '14 09:07

Neha Karanjkar

1 Answers

Specify a common, fixed cachedir and decorate the function that you want to cache using

from joblib import Memory
mem = Memory(cachedir=cachedir)

@mem.cache
def f(arguments):
    """do things"""
    pass

or simply

def g(arguments):
   pass

cached_g = mem.cache(g)

Then, even if you are working across processes, across machines, if all instances of your program have access to cachedir, then common function calls can be cached there transparently.

answered Oct 19 '22 04:10

eickenberg

Related questions
                            
                                Can Python slicing be used to skip one specific element by index?
                            
                                The `uwsgi_modifier1 30` directive is not removing the SCRIPT_NAME from PATH_INFO as documented
                            
                                Python import as tuple
                            
                                sqlite3.ProgrammingError: Cannot operate on a closed database. [Python] [sqlite]
                            
                                Why are some numpy calls not implemented as methods?
                            
                                Why doesn't `print` work in Python multiprocessing pool.map
                            
                                How to extract sheet from *.xlsm and save it as *.csv in Python?
                            
                                Python SQL Alchemy how to query by excluding selected columns
                            
                                Can't import LoginManager() in Flask
                            
                                Python SyntaxError: Non-ASCII character '\xe2' in file
                            
                                MySQLdb Stored Procedure Out Parameter not working
                            
                                Why can't we **unsplat 'self' into a method? [duplicate]
                            
                                Able to instantiate python class, in spite of it being Abstract (using abc)
                            
                                Python delimited line split problems
                            
                                Getting "error: Unable to find vcvarsall.bat" when running "pip install numpy" on windows7 64bit
                            
                                A very basic setting issue about spyder and anaconda for python
                            
                                matplotlib function conventions: subplots vs one figure
                            
                                Eigen Matrix vs Numpy Array multiplication performance
                            
                                Is it possible to tune parameters with grid search for custom kernels in scikit-learn?
                            
                                How to install Python 3.3 (not 3.4) on OS X with Homebrew?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multiple processes sharing a single Joblib cache

Tags:

python

caching

numpy

joblib

Neha Karanjkar

People also ask

1 Answers

eickenberg

Recent Activity

Donate For Us