I'm pretty sure I'm missing something with integer indexing and could use some help. Say that I create a 2D array: <pre class="prettyprint"><code>>>> import numpy as np >>> x=np.array(range(24)).reshape((4,6)) >>> x array([[ 0, 1, 2, 3, 4, 5], [ 6, 7, 8, 9, 10, 11], [12, 13, 14, 15, 16, 17], [18, 19, 20, 21, 22, 23]]) </code></pre> I can then select row 1 and 2 with: <pre class="prettyprint"><code>>>> x[[1,2],:] array([[ 6, 7, 8, 9, 10, 11], [12, 13, 14, 15, 16, 17]]) </code></pre> Or the column 1 of rows 2 and 3 with: <pre class="prettyprint"><code>>>> x[[1,2],1] array([ 7, 13]) </code></pre> So it would makes sense to me that I can select columns 3, 4 and 5 of rows 1 and 2 with this: <pre class="prettyprint"><code>>>> x[[1,2],[3,4,5]] Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: shape mismatch: objects cannot be broadcast to a single shape </code></pre> And instead I need to do it in two steps: <pre class="prettyprint"><code>>>> a=x[[1,2],:] >>> a array([[ 6, 7, 8, 9, 10, 11], [12, 13, 14, 15, 16, 17]]) >>> a[:,[3,4,5]] array([[ 9, 10, 11], [15, 16, 17]]) </code></pre> Coming from R, my expectations seem to be wrong. Can you confirm that this is indeed not possible in one step, or suggest a better alternative? Thanks! EDIT: please note my choice of rows and columns in the example happen to be consecutive, but they don't have to be. In other words, slice indexing won't do for my case.

You also have the option of using broadcasting among the indexing arrays, which is what I would normally do, rather than indexing twice, which creates an intermediate copy of your data: <pre class="prettyprint"><code>>>> x[[[1], [2]],[[3, 4, 5]]] array([[ 9, 10, 11], [15, 16, 17]]) </code></pre> To see a little better what is going on and how to handle larger numbers of indices: <pre class="prettyprint"><code>>>> row_idx = np.array([1, 2]) >>> col_idx = np.array([3, 4, 5]) >>> x[row_idx.reshape(-1, 1), col_idx] array([[ 9, 10, 11], [15, 16, 17]]) </code></pre>

numpy array integer indexing in more than one dimension

Tags:

python

numpy

I'm pretty sure I'm missing something with integer indexing and could use some help. Say that I create a 2D array:

>>> import numpy as np
>>> x=np.array(range(24)).reshape((4,6))
>>> x
array([[ 0,  1,  2,  3,  4,  5],
       [ 6,  7,  8,  9, 10, 11],
       [12, 13, 14, 15, 16, 17],
       [18, 19, 20, 21, 22, 23]])

I can then select row 1 and 2 with:

>>> x[[1,2],:]
array([[ 6,  7,  8,  9, 10, 11],
       [12, 13, 14, 15, 16, 17]])

Or the column 1 of rows 2 and 3 with:

>>> x[[1,2],1]
array([ 7, 13])

So it would makes sense to me that I can select columns 3, 4 and 5 of rows 1 and 2 with this:

>>> x[[1,2],[3,4,5]]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ValueError: shape mismatch: objects cannot be broadcast to a single shape

And instead I need to do it in two steps:

>>> a=x[[1,2],:]
>>> a
array([[ 6,  7,  8,  9, 10, 11],
       [12, 13, 14, 15, 16, 17]])
>>> a[:,[3,4,5]]
array([[ 9, 10, 11],
       [15, 16, 17]])

Coming from R, my expectations seem to be wrong. Can you confirm that this is indeed not possible in one step, or suggest a better alternative? Thanks!

EDIT: please note my choice of rows and columns in the example happen to be consecutive, but they don't have to be. In other words, slice indexing won't do for my case.

895

asked Jan 25 '14 10:01

Miquel

1 Answers

You also have the option of using broadcasting among the indexing arrays, which is what I would normally do, rather than indexing twice, which creates an intermediate copy of your data:

>>> x[[[1], [2]],[[3, 4, 5]]]
array([[ 9, 10, 11],
       [15, 16, 17]])

To see a little better what is going on and how to handle larger numbers of indices:

>>> row_idx = np.array([1, 2])
>>> col_idx = np.array([3, 4, 5])
>>> x[row_idx.reshape(-1, 1), col_idx]
array([[ 9, 10, 11],
       [15, 16, 17]])

answered Oct 12 '22 15:10

Jaime

Related questions
                            
                                Setting the angle of a turtle in Python
                            
                                Gtk-Message: Failed to load module "canberra-gtk-module"
                            
                                using threading in pygame
                            
                                '_csv.writer' object has no attribute 'write'
                            
                                Python + GTK - How to suppress warnings
                            
                                Creating a view function without returning a response in Flask
                            
                                How to properly handle wrong urlsafe key provided? [duplicate]
                            
                                how to find source collections.deque?
                            
                                How to call a celery task delay function from non-python languages such as Java?
                            
                                "OSError: dlopen(libSystem.dylib, 6): image not found" (OS X + macports + Celery 3.1.7)
                            
                                get div from HTML with Python
                            
                                How to use the debugging tool in Spyder for python scripts?
                            
                                Finding a nonrecursive DOM subnode in Python using BeautifulSoup
                            
                                Why does X.dot(X.T) require so much memory in numpy?
                            
                                Flask: asynchronous response to client
                            
                                Speed up nested for loop with elements exponentiation
                            
                                BadStatusLine exception raised when returning reply from server in Python 3
                            
                                Creating custom string type in Python
                            
                                How to open a mp4 file with python?
                            
                                How can I store and print the top 20% feature names and scores?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With