I am working with multi-dimensional Numpy arrays. I have noticed some inconsistent behavior when accessing these arrays with other index arrays. For example: <pre class="prettyprint"><code>import numpy as np start = np.zeros((7,5,3)) a = start[:,:,np.arange(2)] b = start[0,:,np.arange(2)] c = start[0,:,:2] print 'a:', a.shape print 'b:', b.shape print 'c:', c.shape </code></pre> In this example, I get the result: <pre class="prettyprint"><code>a: (7, 5, 2) b: (2, 5) c: (5, 2) </code></pre> This confuses me. Why do "b" and "c" not have the same dimensions? Why does "b" swap the axis order, but not "a"? I have been able to design my code around these inconsistencies thanks to lots of unit tests, but understanding what is going on would be appreciated. For reference, I am using Python 2.7.3, and Numpy 1.6.2 via MacPorts.

Syntactically, this looks like an inconsistency, but semantically, you're doing two very different things here. In your definition of <code>a</code> and <code>b</code>, you're doing advanced indexing, sometimes called fancy indexing, which returns a copy of the data. In your definition of <code>c</code>, you're doing basic slicing, which returns a view of the data. To tell the difference, it helps to understand how indices are passed to python objects. Here are some examples: <pre class="prettyprint"><code>>>> class ShowIndex(object): ... def __getitem__(self, index): ... print index ... >>> ShowIndex()[:,:] (slice(None, None, None), slice(None, None, None)) >>> ShowIndex()[...,:] (Ellipsis, slice(None, None, None)) >>> ShowIndex()[0:5:2,::-1] (slice(0, 5, 2), slice(None, None, -1)) >>> ShowIndex()[0:5:2,np.arange(3)] (slice(0, 5, 2), array([0, 1, 2])) >>> ShowIndex()[0:5:2] slice(0, 5, 2) >>> ShowIndex()[5, 5] (5, 5) >>> ShowIndex()[5] 5 >>> ShowIndex()[np.arange(3)] [0 1 2] </code></pre> As you can see, there are many different possible configurations. First, individual items may be passed, or tuples of items may be passed. Second, the tuples may contain <code>slice</code> objects, <code>Ellipsis</code> objects, plain integers, or <code>numpy</code> arrays. Basic slicing is activated when you pass only objects like <code>int</code>, <code>slice</code>, or <code>Ellipsis</code> objects, or <code>None</code> (which is the same as <code>numpy.newaxis</code>). These can be passed singly or in a tuple. Here's what the docs have to say about how basic slicing is activated: <blockquote> Basic slicing occurs when obj is a slice object (constructed by start:stop:step notation inside of brackets), an integer, or a tuple of slice objects and integers. Ellipsis and newaxis objects can be interspersed with these as well. In order to remain backward compatible with a common usage in Numeric, basic slicing is also initiated if the selection object is any sequence (such as a list) containing slice objects, the Ellipsis object, or the newaxis object, but no integer arrays or other embedded sequences. </blockquote> Advanced indexing is activated when you pass a <code>numpy</code> array, a non-tuple sequence containing only integers or containing subsequences of any kind, or a tuple containing an array or subsequence. For details on how advanced indexing and basic slicing differ, see the docs (linked to above). But in this particular case, it's clear to me what's happening. It has to do with the following behavior when using partial indexing: <blockquote> The rule for partial indexing is that the shape of the result (or the interpreted shape of the object to be used in setting) is the shape of x with the indexed subspace replaced with the broadcasted indexing subspace. If the index subspaces are right next to each other, then the broadcasted indexing space directly replaces all of the indexed subspaces in x. If the indexing subspaces are separated (by slice objects), then the broadcasted indexing space is first, followed by the sliced subspace of x. </blockquote> In your definition of <code>a</code>, which uses advanced indexing, you effectively pass the sequence <code>[0, 1]</code> in as the third item of the tuple, and since no broadcasting happens (because there is no other sequence), everything happens as expected. In your definition of <code>b</code>, also using advanced indexing, you effectively pass two sequences, <code>[0]</code>, the first item (which is converted into an <code>intp</code> array), and <code>[0, 1]</code>, the third item. These two items are broadcast together, and the result has the same shape as the third item. However, since broadcasting has happened, we're faced with a problem: where in the new shape tuple do we insert the broadcasted shape? As the docs say, <blockquote> there is no unambiguous place to drop in the indexing subspace, thus it is tacked-on to the beginning. </blockquote> So the <code>2</code> that results from broadcasting is moved to the beginning of the shape tuple, producing an apparent transposition.

Numpy multi-dimensional array indexing swaps axis order

Tags:

python

numpy

I am working with multi-dimensional Numpy arrays. I have noticed some inconsistent behavior when accessing these arrays with other index arrays. For example:

import numpy as np
start = np.zeros((7,5,3))
a     = start[:,:,np.arange(2)]
b     = start[0,:,np.arange(2)]
c     = start[0,:,:2]
print 'a:', a.shape
print 'b:', b.shape
print 'c:', c.shape

In this example, I get the result:

a: (7, 5, 2)
b: (2, 5)
c: (5, 2)

This confuses me. Why do "b" and "c" not have the same dimensions? Why does "b" swap the axis order, but not "a"?

I have been able to design my code around these inconsistencies thanks to lots of unit tests, but understanding what is going on would be appreciated.

For reference, I am using Python 2.7.3, and Numpy 1.6.2 via MacPorts.

617

asked Aug 13 '12 21:08

gbarter

1 Answers

Syntactically, this looks like an inconsistency, but semantically, you're doing two very different things here. In your definition of a and b, you're doing advanced indexing, sometimes called fancy indexing, which returns a copy of the data. In your definition of c, you're doing basic slicing, which returns a view of the data.

To tell the difference, it helps to understand how indices are passed to python objects. Here are some examples:

>>> class ShowIndex(object):
...     def __getitem__(self, index):
...         print index
... 
>>> ShowIndex()[:,:]
(slice(None, None, None), slice(None, None, None))
>>> ShowIndex()[...,:]
(Ellipsis, slice(None, None, None))
>>> ShowIndex()[0:5:2,::-1]
(slice(0, 5, 2), slice(None, None, -1))
>>> ShowIndex()[0:5:2,np.arange(3)]
(slice(0, 5, 2), array([0, 1, 2]))
>>> ShowIndex()[0:5:2]
slice(0, 5, 2)
>>> ShowIndex()[5, 5]
(5, 5)
>>> ShowIndex()[5]
5
>>> ShowIndex()[np.arange(3)]
[0 1 2]

As you can see, there are many different possible configurations. First, individual items may be passed, or tuples of items may be passed. Second, the tuples may contain slice objects, Ellipsis objects, plain integers, or numpy arrays.

Basic slicing is activated when you pass only objects like int, slice, or Ellipsis objects, or None (which is the same as numpy.newaxis). These can be passed singly or in a tuple. Here's what the docs have to say about how basic slicing is activated:

Basic slicing occurs when obj is a slice object (constructed by start:stop:step notation inside of brackets), an integer, or a tuple of slice objects and integers. Ellipsis and newaxis objects can be interspersed with these as well. In order to remain backward compatible with a common usage in Numeric, basic slicing is also initiated if the selection object is any sequence (such as a list) containing slice objects, the Ellipsis object, or the newaxis object, but no integer arrays or other embedded sequences.

Advanced indexing is activated when you pass a numpy array, a non-tuple sequence containing only integers or containing subsequences of any kind, or a tuple containing an array or subsequence.

For details on how advanced indexing and basic slicing differ, see the docs (linked to above). But in this particular case, it's clear to me what's happening. It has to do with the following behavior when using partial indexing:

The rule for partial indexing is that the shape of the result (or the interpreted shape of the object to be used in setting) is the shape of x with the indexed subspace replaced with the broadcasted indexing subspace. If the index subspaces are right next to each other, then the broadcasted indexing space directly replaces all of the indexed subspaces in x. If the indexing subspaces are separated (by slice objects), then the broadcasted indexing space is first, followed by the sliced subspace of x.

In your definition of a, which uses advanced indexing, you effectively pass the sequence [0, 1] in as the third item of the tuple, and since no broadcasting happens (because there is no other sequence), everything happens as expected.

In your definition of b, also using advanced indexing, you effectively pass two sequences, [0], the first item (which is converted into an intp array), and [0, 1], the third item. These two items are broadcast together, and the result has the same shape as the third item. However, since broadcasting has happened, we're faced with a problem: where in the new shape tuple do we insert the broadcasted shape? As the docs say,

there is no unambiguous place to drop in the indexing subspace, thus it is tacked-on to the beginning.

So the 2 that results from broadcasting is moved to the beginning of the shape tuple, producing an apparent transposition.

189

answered Nov 09 '22 04:11

senderle

Related questions
                            
                                matplotlib window layout questions
                            
                                Removing duplicate elements from a Python list containing unhashable elements while preserving order?
                            
                                Subplots with dates on the x-axis
                            
                                sqlalchemy id equality vs reference equality
                            
                                Python 3 - non-copying stream interface to bytearray?
                            
                                Nearest Neighbor Search in Python without k-d tree
                            
                                How to use django-notification to inform a user when somebody comments on their post
                            
                                Task state and django-celery
                            
                                how to install python-devel for 2.6 version?
                            
                                Is it OK to execute code when a module imports?
                            
                                Using PostgreSQL array to store many-to-many relationship
                            
                                user authentication via ssl certs in django
                            
                                Decode base64 string in python 3 (with lxml or not)
                            
                                Minimax explanation "for dummies"
                            
                                How to copy a python bytearray buffer?
                            
                                How to efficiently remove columns from a sparse matrix that only contain zeros?
                            
                                What is the difference between logging.info and logging.getLogger().info?
                            
                                Parsing huge, badly encoded XML files in Python
                            
                                ListField without duplicates in Python mongoengine
                            
                                The difference between a class-based language (like Java or Python) and a prototype-based language (like Javascript)? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With