How to split a numpy array in fixed size chunks with and without overlap?

Tags:

Lets say I have an array:

>>> arr = np.array(range(9)).reshape(3, 3)
>>> arr
array([[0, 1, 2],
       [3, 4, 5],
       [6, 7, 8]])

I would like to create a function f(arr, shape=(2, 2)) that takes the array and a shape, and splits the array into chunks of the given shape without padding. Thus, by overlapping certain parts if necessary. For example:

>>> f(arr, shape=(2, 2))
array([[[[0, 1],
         [3, 4]],

        [[1, 2],
         [4, 5]]],

       [[[3, 4],
         [6, 7]],

        [[4, 5],
         [7, 8]]]])

I managed to creates to output above with np.lib.stride_tricks.as_strided(arr, shape=(2, 2, 2, 2), strides=(24, 8, 24, 8)). But I don't know how to generalize this for to all arrays and all chunk sizes.

Preferably, for 3D arrays.

If no overlap is necessary, it should avoid that. Another example:

>>> arr = np.array(range(16).reshape(4,4)
>>> arr
array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11],
       [12, 13, 14, 15]])
>>> f(arr, shape=(2,2))
array([[[[0, 1],
         [4, 5]],

        [[2, 3],
         [6, 7]]],

       [[[8, 9],
         [12, 13]],

        [[10, 11],
         [14, 15]]]])

skimage.util.view_as_blocks comes close, but requires that the array and block shape are compatible.

765

asked Mar 16 '17 10:03

Kay Lamerigts

Video Answer

1 Answers

There's a builtin in scikit-image as view_as_windows for doing exactly that -

from skimage.util.shape import view_as_windows

view_as_windows(arr, (2,2))

Sample run -

In [40]: arr
Out[40]: 
array([[0, 1, 2],
       [3, 4, 5],
       [6, 7, 8]])

In [41]: view_as_windows(arr, (2,2))
Out[41]: 
array([[[[0, 1],
         [3, 4]],

        [[1, 2],
         [4, 5]]],


       [[[3, 4],
         [6, 7]],

        [[4, 5],
         [7, 8]]]])

For the second part, use its cousin from the same family/module view_as_blocks -

from skimage.util.shape import view_as_blocks

view_as_blocks(arr, (2,2))

answered Oct 11 '22 18:10

Divakar

Related questions
                            
                                Bulk inserts with Flask-SQLAlchemy
                            
                                numpy: difference between NaN and masked array
                            
                                Plot multiple DataFrame columns in Seaborn FacetGrid
                            
                                Is "__module__" guaranteed to be defined during class creation?
                            
                                Is os.listdir() deterministic?
                            
                                Why Should Homebrew be used to Install Python?
                            
                                Django Abstract Models setting related_name with underscores
                            
                                Best way to override lineno in Python logger
                            
                                Maximum recursion depth error in Python when calling super's init. [duplicate]
                            
                                How do I extend UserCreationForm to include email field
                            
                                AttributeError: lower not found; using a Pipeline with a CountVectorizer in scikit-learn
                            
                                Pandas escape carriage return in to_csv
                            
                                Image recognition using TensorFlow [closed]
                            
                                Multiply scipy.lti transfer functions
                            
                                Fix Conflicting migrations detected in Django1.9
                            
                                Repeating values in a "group by" pandas dataframe
                            
                                py.test session level fixtures in setup_method
                            
                                TypeError: decoding str is not supported
                            
                                How to override Gunicorn's logging config to use a custom formatter
                            
                                import matplotlib failing with No module named _tkinter on heroku

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to split a numpy array in fixed size chunks with and without overlap?

Tags:

python

arrays

multidimensional-array

numpy

Kay Lamerigts

People also ask

Video Answer

1 Answers

Divakar

Recent Activity

Donate For Us