Using strides for an efficient moving average filter

Tags:

I recently learned about strides in the answer to this post, and was wondering how I could use them to compute a moving average filter more efficiently than what I proposed in this post (using convolution filters).

This is what I have so far. It takes a view of the original array then rolls it by the necessary amount and sums the kernel values to compute the average. I am aware that the edges are not handled correctly, but I can take care of that afterward... Is there a better and faster way? The objective is to filter large floating point arrays up to 5000x5000 x 16 layers in size, a task that scipy.ndimage.filters.convolve is fairly slow at.

Note that I am looking for 8-neighbour connectivity, that is a 3x3 filter takes the average of 9 pixels (8 around the focal pixel) and assigns that value to the pixel in the new image.

import numpy, scipy  filtsize = 3 a = numpy.arange(100).reshape((10,10)) b = numpy.lib.stride_tricks.as_strided(a, shape=(a.size,filtsize), strides=(a.itemsize, a.itemsize)) for i in range(0, filtsize-1):     if i > 0:         b += numpy.roll(b, -(pow(filtsize,2)+1)*i, 0) filtered = (numpy.sum(b, 1) / pow(filtsize,2)).reshape((a.shape[0],a.shape[1])) scipy.misc.imsave("average.jpg", filtered)

EDIT Clarification on how I see this working:

Current code:

use stride_tricks to generate an array like [[0,1,2],[1,2,3],[2,3,4]...] which corresponds to the top row of the filter kernel.
Roll along the vertical axis to get the middle row of the kernel [[10,11,12],[11,12,13],[13,14,15]...] and add it to the array I got in 1)
Repeat to get the bottom row of the kernel [[20,21,22],[21,22,23],[22,23,24]...]. At this point, I take the sum of each row and divide it by the number of elements in the filter, giving me the average for each pixel, (shifted by 1 row and 1 col, and with some oddities around edges, but I can take care of that later).

What I was hoping for is a better use of stride_tricks to get the 9 values or the sum of the kernel elements directly, for the entire array, or that someone can convince me of another more efficient method...

731

asked Feb 08 '11 18:02

Benjamin

1 Answers

For what it's worth, here's how you'd do it using "fancy" striding tricks. I was going to post this yesterday, but got distracted by actual work! :)

@Paul & @eat both have nice implementations using various other ways of doing this. Just to continue things from the earlier question, I figured I'd post the N-dimensional equivalent.

You're not going to be able to significantly beat scipy.ndimage functions for >1D arrays, however. (scipy.ndimage.uniform_filter should beat scipy.ndimage.convolve, though)

Moreover, if you're trying to get a multidimensional moving window, you risk having memory usage blow up whenever you inadvertently make a copy of your array. While the initial "rolling" array is just a view into the memory of your original array, any intermediate steps that copy the array will make a copy that is orders of magnitude larger than your original array (i.e. Let's say that you're working with a 100x100 original array... The view into it (for a filter size of (3,3)) will be 98x98x3x3 but use the same memory as the original. However, any copies will use the amount of memory that a full 98x98x3x3 array would!!)

Basically, using crazy striding tricks is great for when you want to vectorize moving window operations on a single axis of an ndarray. It makes it really easy to calculate things like a moving standard deviation, etc with very little overhead. When you want to start doing this along multiple axes, it's possible, but you're usually better off with more specialized functions. (Such as scipy.ndimage, etc)

At any rate, here's how you do it:

import numpy as np  def rolling_window_lastaxis(a, window):     """Directly taken from Erik Rigtorp's post to numpy-discussion.     <http://www.mail-archive.com/[email protected]/msg29450.html>"""     if window < 1:        raise ValueError, "`window` must be at least 1."     if window > a.shape[-1]:        raise ValueError, "`window` is too long."     shape = a.shape[:-1] + (a.shape[-1] - window + 1, window)     strides = a.strides + (a.strides[-1],)     return np.lib.stride_tricks.as_strided(a, shape=shape, strides=strides)  def rolling_window(a, window):     if not hasattr(window, '__iter__'):         return rolling_window_lastaxis(a, window)     for i, win in enumerate(window):         if win > 1:             a = a.swapaxes(i, -1)             a = rolling_window_lastaxis(a, win)             a = a.swapaxes(-2, i)     return a  filtsize = (3, 3) a = np.zeros((10,10), dtype=np.float) a[5:7,5] = 1  b = rolling_window(a, filtsize) blurred = b.mean(axis=-1).mean(axis=-1)

So what we get when we do b = rolling_window(a, filtsize) is an 8x8x3x3 array, that's actually a view into the same memory as the original 10x10 array. We could have just as easily used different filter size along different axes or operated only along selected axes of an N-dimensional array (i.e. filtsize = (0,3,0,3) on a 4-dimensional array would give us a 6 dimensional view).

We can then apply an arbitrary function to the last axis repeatedly to effectively calculate things in a moving window.

However, because we're storing temporary arrays that are much bigger than our original array on each step of mean (or std or whatever), this is not at all memory efficient! It's also not going to be terribly fast, either.

The equivalent for ndimage is just:

blurred = scipy.ndimage.uniform_filter(a, filtsize, output=a)

This will handle a variety of boundary conditions, do the "blurring" in-place without requiring a temporary copy of the array, and be very fast. Striding tricks are a good way to apply a function to a moving window along one axis, but they're not a good way to do it along multiple axes, usually....

Just my $0.02, at any rate...

180

answered Oct 16 '22 11:10

Joe Kington

Related questions
                            
                                How does mask_zero in Keras Embedding layer work?
                            
                                how to handle an asymptote/discontinuity with Matplotlib
                            
                                Python Iterators: What does iglob()'s Iterator provide over glob()'s list?
                            
                                pip: pulling updates from remote git repository
                            
                                Roll rows of a matrix independently
                            
                                How to import python file from git submodule
                            
                                How to print like jupyter notebook's default cell output
                            
                                how to implement tensorflow's next_batch for own data
                            
                                What's the most efficient way to find one of several substrings in Python?
                            
                                A good wxpython GUI builder?
                            
                                Requirements file for apt-get, similar to pip
                            
                                How I can get cartesian coordinate system in matplotlib?
                            
                                A good way to get the charset/encoding of an HTTP response in Python
                            
                                Catch python 'ImportError' if import from source directory
                            
                                How to use Python sets and add strings to it in as a dictionary value
                            
                                Install anaconda library from a local source
                            
                                Merge two arrays vertically to array of tuples using numpy
                            
                                ImportError: No module named '_curses' when trying to import blessings
                            
                                Any AOP support library for Python?
                            
                                How do you run nosetest from pycharm?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using strides for an efficient moving average filter

Tags:

python

image-processing

filter

numpy

Benjamin

People also ask

1 Answers

Joe Kington

Recent Activity

Donate For Us