Assume we have an array with <code>NxMxD</code> shape. I want to get a list with <code>D</code> <code>NxM</code> arrays. The correct way of doing it would be: <pre class="prettyprint"><code>np.dsplit(myarray, D) </code></pre> However, this returns <code>D</code> <code>NxMx1</code> arrays. I can achieve the desired result by doing something like: <pre class="prettyprint"><code>[myarray[..., i] for i in range(D)] </code></pre> Or: <pre class="prettyprint"><code>[np.squeeze(subarray) for subarray in np.dsplit(myarray, D)] </code></pre> However, I feel like it is a bit redundant to need to perform an additional operation. Am I missing any <code>numpy</code> function that returns the desired result?

Try <code>D.swapaxes(1,2).swapaxes(1,0)</code> <pre class="prettyprint"><code>>>>import numpy as np >>>a = np.arange(24).reshape(2,3,4) >>>a array([[[ 0, 1, 2, 3], [ 4, 5, 6, 7], [ 8, 9, 10, 11]], [[12, 13, 14, 15], [16, 17, 18, 19], [20, 21, 22, 23]]]) >>>[a[:,:,i] for i in range(4)] [array([[ 0, 4, 8], [12, 16, 20]]), array([[ 1, 5, 9], [13, 17, 21]]), array([[ 2, 6, 10], [14, 18, 22]]), array([[ 3, 7, 11], [15, 19, 23]])] >>>a.swapaxes(1,2).swapaxes(1,0) array([[[ 0, 4, 8], [12, 16, 20]], [[ 1, 5, 9], [13, 17, 21]], [[ 2, 6, 10], [14, 18, 22]], [[ 3, 7, 11], [15, 19, 23]]]) </code></pre> Edit: As pointed out by ajcr (thanks again), the <code>transpose</code> command is more convenient since the two swaps can be done in one step by using <pre class="prettyprint"><code>D.transpose(2,0,1) </code></pre>

<code>np.dsplit</code> uses <code>np.array_split</code>, the core of which is: <pre class="prettyprint"><code>sub_arys = [] sary = _nx.swapaxes(ary, axis, 0) for i in range(Nsections): st = div_points[i]; end = div_points[i+1] sub_arys.append(_nx.swapaxes(sary[st:end], axis, 0)) </code></pre> with <code>axis=-1</code>, this is equivalent to: <pre class="prettyprint"><code>[x[...,i:(i+1)] for i in np.arange(x.shape[-1])] # or [x[...,[i]] for i in np.arange(x.shape[-1])] </code></pre> which accounts for the singleton dimension. So there's nothing wrong or inefficient about your <pre class="prettyprint"><code>[x[...,i] for i in np.arange(x.shape[-1])] </code></pre> Actually in quick time tests, any use of <code>dsplit</code> is slow. It's generality costs. So adding <code>squeeze</code> is relatively cheap. But by accepting the other answer, it looks like you are really looking for an array of the correct shape, rather than a list of arrays. For many operations that makes sense. <code>split</code> is more useful when the subarrays have more than one 'row' along the split axis, or even an uneven number of 'rows'.

Split last dimension of arrays in lower dimensional arrays

Tags:

split

numpy

Assume we have an array with NxMxD shape. I want to get a list with D NxM arrays.

The correct way of doing it would be:

np.dsplit(myarray, D)

However, this returns D NxMx1 arrays.

I can achieve the desired result by doing something like:

[myarray[..., i] for i in range(D)]

Or:

[np.squeeze(subarray) for subarray in np.dsplit(myarray, D)]

However, I feel like it is a bit redundant to need to perform an additional operation. Am I missing any numpy function that returns the desired result?

570

asked Mar 11 '15 14:03

Imanol Luengo

2 Answers

Try D.swapaxes(1,2).swapaxes(1,0)

>>>import numpy as np
>>>a = np.arange(24).reshape(2,3,4)
>>>a
array([[[ 0,  1,  2,  3],
        [ 4,  5,  6,  7],
        [ 8,  9, 10, 11]],

       [[12, 13, 14, 15],
        [16, 17, 18, 19],
        [20, 21, 22, 23]]])

>>>[a[:,:,i] for i in range(4)]
[array([[ 0,  4,  8],
       [12, 16, 20]]),
 array([[ 1,  5,  9],
       [13, 17, 21]]),
 array([[ 2,  6, 10],
       [14, 18, 22]]),
 array([[ 3,  7, 11],
       [15, 19, 23]])]

>>>a.swapaxes(1,2).swapaxes(1,0)
array([[[ 0,  4,  8],
        [12, 16, 20]],

       [[ 1,  5,  9],
        [13, 17, 21]],

       [[ 2,  6, 10],
        [14, 18, 22]],

       [[ 3,  7, 11],
        [15, 19, 23]]])

Edit: As pointed out by ajcr (thanks again), the transpose command is more convenient since the two swaps can be done in one step by using

D.transpose(2,0,1)

191

answered Sep 30 '22 18:09

plonser

np.dsplit uses np.array_split, the core of which is:

sub_arys = []
sary = _nx.swapaxes(ary, axis, 0)
for i in range(Nsections):
    st = div_points[i]; end = div_points[i+1]
    sub_arys.append(_nx.swapaxes(sary[st:end], axis, 0))

with axis=-1, this is equivalent to:

[x[...,i:(i+1)] for i in np.arange(x.shape[-1])]  # or
[x[...,[i]] for i in np.arange(x.shape[-1])]

which accounts for the singleton dimension.

So there's nothing wrong or inefficient about your

[x[...,i] for i in np.arange(x.shape[-1])]

Actually in quick time tests, any use of dsplit is slow. It's generality costs. So adding squeeze is relatively cheap.

But by accepting the other answer, it looks like you are really looking for an array of the correct shape, rather than a list of arrays. For many operations that makes sense. split is more useful when the subarrays have more than one 'row' along the split axis, or even an uneven number of 'rows'.

answered Sep 30 '22 18:09

hpaulj

Related questions
                            
                                How to slice and extend a 2D numpy array?
                            
                                numpy correlation coefficient: np.dot(A, A.T) on large arrays causing seg fault
                            
                                Passing numpy string-format arrays to fortran using f2py
                            
                                Construct single numpy array from smaller arrays of different sizes
                            
                                How to make numpy overloading of __add__ independent on operand order?
                            
                                Generating a heat map using 3D data in matplotlib
                            
                                numpy transform vector to binary matrix
                            
                                Vectorised average K-Nearest Neighbour distance in Python
                            
                                RuntimeWarning: overflow encountered in np.exp(x**2)
                            
                                Scipy ndimage morphology operators saturate my computer memory RAM (8GB)
                            
                                More than one module for lambdify in sympy
                            
                                Rounding errors with floats in Python using Numpy
                            
                                Speedup sympy-lamdified and vectorized function
                            
                                python pandas groupby for first date
                            
                                In-place shuffling of multidimensional arrays
                            
                                How to lambdify a SymPy expression containing the erf function for use with NumPy
                            
                                How to export list of arrays into csv in Python?
                            
                                Order of indexes in a Numpy multidimensional array
                            
                                Timestamp subtraction of time arrays with different timezones
                            
                                Why is np.where's result read-only for multi-dimensional arrays?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With