Numpy Vectorization of sliding-window operation

Tags:

I have the following numpy arrays:

arr_1 = [[1,2],[3,4],[5,6]]   # 3 X 2 
arr_2 = [[0.5,0.6],[0.7,0.8],[0.9,1.0],[1.1,1.2],[1.3,1.4]]  # 5 X 2

arr_1 is clearly a 3 X 2 array, whereas arr_2 is a 5 X 2 array.

Now without looping, I want to element-wise multiply arr_1 and arr_2 so that I apply a sliding window technique (window size 3) to arr_2.

Click to copy

Example:

Multiplication 1:  np.multiply(arr_1,arr_2[:3,:])

Multiplication 2: np.multiply(arr_1,arr_2[1:4,:])

Multiplication 3: np.multiply(arr_1,arr_2[2:5,:])

I want to do this in some sort of a matrix multiplication form to make it faster than my current solution which is of the form:

Click to copy

for i in (2):
   np.multiply(arr_1,arr_2[i:i+3,:])

So if the number of rows in arr_2 are large (of the order of tens of thousands), this solution doesn't really scale very well.

Any help would be much appreciated.

568

asked Aug 30 '16 16:08

Nikhil

2 Answers

We can use NumPy broadcasting to create those sliding windowed indices in a vectorized manner. Then, we can simply index into arr_2 with those to create a 3D array and perform element-wise multiplication with 2D array arr_1, which in turn will bring on broadcasting again.

So, we would have a vectorized implementation like so -

Click to copy

W = arr_1.shape[0] # Window size
idx = np.arange(arr_2.shape[0]-W+1)[:,None] + np.arange(W)
out = arr_1*arr_2[idx]

Runtime test and verify results -

Click to copy

In [143]: # Input arrays
     ...: arr_1 = np.random.rand(3,2)
     ...: arr_2 = np.random.rand(10000,2)
     ...: 
     ...: def org_app(arr_1,arr_2):
     ...:     W = arr_1.shape[0] # Window size
     ...:     L = arr_2.shape[0]-W+1
     ...:     out = np.empty((L,W,arr_1.shape[1]))
     ...:     for i in range(L):
     ...:        out[i] = np.multiply(arr_1,arr_2[i:i+W,:])
     ...:     return out
     ...: 
     ...: def vectorized_app(arr_1,arr_2):
     ...:     W = arr_1.shape[0] # Window size
     ...:     idx = np.arange(arr_2.shape[0]-W+1)[:,None] + np.arange(W)
     ...:     return arr_1*arr_2[idx]
     ...: 

In [144]: np.allclose(org_app(arr_1,arr_2),vectorized_app(arr_1,arr_2))
Out[144]: True

In [145]: %timeit org_app(arr_1,arr_2)
10 loops, best of 3: 47.3 ms per loop

In [146]: %timeit vectorized_app(arr_1,arr_2)
1000 loops, best of 3: 1.21 ms per loop

129

answered Sep 22 '22 17:09

Divakar

This is a nice case to test the speed of as_strided and Divakar's broadcasting.

Click to copy

In [281]: %%timeit 
     ...: out=np.empty((L,W,arr1.shape[1]))
     ...: for i in range(L):
     ...:    out[i]=np.multiply(arr1,arr2[i:i+W,:])
     ...: 
10 loops, best of 3: 48.9 ms per loop
In [282]: %%timeit
     ...: idx=np.arange(L)[:,None]+np.arange(W)
     ...: out=arr1*arr2[idx]
     ...: 
100 loops, best of 3: 2.18 ms per loop
In [283]: %%timeit
     ...: arr3=as_strided(arr2, shape=(L,W,2), strides=(16,16,8))
     ...: out=arr1*arr3
     ...: 
1000 loops, best of 3: 805 µs per loop

Create Numpy array without enumerating array for more of a comparison of these methods.

answered Sep 20 '22 17:09

hpaulj

Related questions
                            
                                How to install PyRTF with Python3?
                            
                                Hashing a generator expression
                            
                                Check for invalid input
                            
                                Where in the python docs does it allow the `in` operator to be chained?
                            
                                How to remove small components from a graph
                            
                                How does one use the official Batch Normalization layer in TensorFlow?
                            
                                Python & Pandas: Strange behavior when Pandas plot histogram to a specific ax
                            
                                Create Numpy 2D Array with data from triplets of (x,y,value)
                            
                                Is there a Matlab's buffer equivalent in numpy?
                            
                                how can argparse set default value of optional parameter to null or empty?
                            
                                MySQL-python install Mac
                            
                                Encode IP address using all printable characters in Python 2.7.x
                            
                                Map over a dict
                            
                                How to slice pandas DataFrame by disjunction statement (logical "or")? [duplicate]
                            
                                Issues downloading Graphlab dependencies get_dependencies()
                            
                                openpyxl chage font size of title & y_axis.title
                            
                                tensorflow periodic padding
                            
                                Map to List error: Series object not callable
                            
                                class attribute lookup rule?
                            
                                Flask admin overrides password when user model is changed

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Numpy Vectorization of sliding-window operation

Tags:

python

arrays

numpy

matrix-multiplication

sliding-window

Nikhil

People also ask

2 Answers

Divakar

hpaulj

Recent Activity

Donate For Us