I am trying to make a cache decorator for functions with numpy array input parameters <pre class="prettyprint"><code>from functools import lru_cache import numpy as np from time import sleep a = np.array([1,2,3,4]) @lru_cache() def square(array): sleep(1) return array * array square(a) </code></pre> But numpy arrays are not hashable, <pre class="prettyprint"><code>TypeError Traceback (most recent call last) <ipython-input-13-559f69d0dec3> in <module>() ----> 1 square(a) TypeError: unhashable type: 'numpy.ndarray' </code></pre> So they need to be converted to tuples. I have this working and caching correctly: <pre class="prettyprint"><code>@lru_cache() def square(array_hashable): sleep(1) array = np.array(array_hashable) return array * array square(tuple(a)) </code></pre> But I wanted to wrap it all up in a decorator, so far I have tried: <pre class="prettyprint"><code>def np_cache(function): def outter(array): array_hashable = tuple(array) @lru_cache() def inner(array_hashable_inner): array_inner = np.array(array_hashable_inner) return function(array_inner) return inner(array_hashable) return outter @np_cache def square(array): sleep(1) return array * array </code></pre> But caching is not working. Computation is performed but not cached properly, as I am always waiting 1 second. What am I missing here? I'm guessing <code>lru_cache</code> isn't getting the context right and its being instantiated in each call, but I don't know how to fix it. I have tried blindly throwing the <code>functools.wraps</code> decorator here and there with no luck.

Your wrapper function creates a new <code>inner()</code> function each time you call it. And that new function object is decorated at that time, so the end result is that each time <code>outter()</code> is called, a new <code>lru_cache()</code> is created and that'll be empty. An empty cache will always have to re-calculate the value. You need to create a decorator that attaches the cache to a function created just once per decorated target. If you are going to convert to a tuple before calling the cache, then you'll have to create two functions: <pre class="prettyprint"><code>from functools import lru_cache, wraps def np_cache(function): @lru_cache() def cached_wrapper(hashable_array): array = np.array(hashable_array) return function(array) @wraps(function) def wrapper(array): return cached_wrapper(tuple(array)) # copy lru_cache attributes over too wrapper.cache_info = cached_wrapper.cache_info wrapper.cache_clear = cached_wrapper.cache_clear return wrapper </code></pre> The <code>cached_wrapper()</code> function is created just once per call to <code>np_cache()</code> and is available to the <code>wrapper()</code> function as a closure. So <code>wrapper()</code> calls <code>cached_wrapper()</code>, which has a <code>@lru_cache()</code> attached to it, caching your tuples. I also copied across the two function references that <code>lru_cache</code> puts on a decorated function, so they are accessible via the returned wrapper as well. In addition, I also used the <code>@functools.wraps()</code> decorator to copy across metadata from the original function object to the wrapper, such as the name, annotations and documentation string. This is always a good idea, because that means your decorated function will be clearly identified in tracebacks, when debugging and when you need to access documentation or annotations. The decorator also adds a <code>__wrapped__</code> attribute pointing back to the original function, which would let you unwrap the decorator again if need be.

Cache decorator for numpy arrays

Tags:

python

python-3.x

decorator

caching

numpy

I am trying to make a cache decorator for functions with numpy array input parameters

from functools import lru_cache
import numpy as np
from time import sleep

a = np.array([1,2,3,4])

@lru_cache()
def square(array):
    sleep(1)
    return array * array

square(a)

But numpy arrays are not hashable,

TypeError                                 Traceback (most recent call last)
<ipython-input-13-559f69d0dec3> in <module>()
----> 1 square(a)

TypeError: unhashable type: 'numpy.ndarray'

So they need to be converted to tuples. I have this working and caching correctly:

@lru_cache()
def square(array_hashable):
    sleep(1)
    array = np.array(array_hashable)
    return array * array

square(tuple(a))

But I wanted to wrap it all up in a decorator, so far I have tried:

def np_cache(function):
    def outter(array):
        array_hashable = tuple(array)

        @lru_cache()
        def inner(array_hashable_inner):
            array_inner = np.array(array_hashable_inner)
            return function(array_inner)

        return inner(array_hashable)

    return outter

@np_cache
def square(array):
    sleep(1)
    return array * array

But caching is not working. Computation is performed but not cached properly, as I am always waiting 1 second.

What am I missing here? I'm guessing lru_cache isn't getting the context right and its being instantiated in each call, but I don't know how to fix it.

I have tried blindly throwing the functools.wraps decorator here and there with no luck.

995

asked Sep 14 '18 12:09

Susensio

1 Answers

Your wrapper function creates a new inner() function each time you call it. And that new function object is decorated at that time, so the end result is that each time outter() is called, a new lru_cache() is created and that'll be empty. An empty cache will always have to re-calculate the value.

You need to create a decorator that attaches the cache to a function created just once per decorated target. If you are going to convert to a tuple before calling the cache, then you'll have to create two functions:

from functools import lru_cache, wraps

def np_cache(function):
    @lru_cache()
    def cached_wrapper(hashable_array):
        array = np.array(hashable_array)
        return function(array)

    @wraps(function)
    def wrapper(array):
        return cached_wrapper(tuple(array))

    # copy lru_cache attributes over too
    wrapper.cache_info = cached_wrapper.cache_info
    wrapper.cache_clear = cached_wrapper.cache_clear

    return wrapper

The cached_wrapper() function is created just once per call to np_cache() and is available to the wrapper() function as a closure. So wrapper() calls cached_wrapper(), which has a @lru_cache() attached to it, caching your tuples.

I also copied across the two function references that lru_cache puts on a decorated function, so they are accessible via the returned wrapper as well.

In addition, I also used the @functools.wraps() decorator to copy across metadata from the original function object to the wrapper, such as the name, annotations and documentation string. This is always a good idea, because that means your decorated function will be clearly identified in tracebacks, when debugging and when you need to access documentation or annotations. The decorator also adds a __wrapped__ attribute pointing back to the original function, which would let you unwrap the decorator again if need be.

answered Sep 20 '22 17:09

Martijn Pieters

Related questions
                            
                                How to apply Polynomial Transformation to subset of features in scikitlearn
                            
                                Python debugger pdb not found on macOS High Sierra
                            
                                webbrowser.get — could not locate runnable browser
                            
                                AttributeError: 'Client' object has no attribute 'send_message' (Discord Bot)
                            
                                Adding regularizer to an existing layer of a trained model without resetting weights?
                            
                                What does it mean that a tf.variable is trainable in TensorFlow
                            
                                Seaborn heatmap - colorbar label font size
                            
                                How can I download and skip VGG weights that have no counterpart with my CNN in Keras?
                            
                                How do I calculate the Adjusted R-squared score using scikit-learn?
                            
                                How to ignore a warning inside a test using pytest?
                            
                                Does PIL image.convert("RGB") convert images to sRGB or AdobeRGB?
                            
                                Python-pptx: copy slide
                            
                                Is there any pytorch function can combine the specific continuous dimensions of tensor into one?
                            
                                Do the `if __name__ == "__main__": ` like idioms have a name of design pattern?
                            
                                convert a list of list into a list of list of tuple
                            
                                Python hypothesis: Ensure that input lists have same length
                            
                                Faster kNN algorithm in Python
                            
                                Maximum recursion depth exceeded. Multiprocessing and bs4
                            
                                Removing dupes in list of lists in Python
                            
                                Convert DatetimeIndex to datetime.date in pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With