I have a bit of code that attempts to find the contents of an array at indices specified by another, that may specify indices that are out of range of the former array. <pre class="prettyprint"><code>input = np.arange(0, 5) indices = np.array([0, 1, 2, 99]) </code></pre> What I want to do is this: print input[indices] and get [0 1 2] But this yields an exception (as expected): <pre class="prettyprint"><code>IndexError: index 99 out of bounds 0<=index<5 </code></pre> So I thought I could use masked arrays to hide the out of bounds indices: <pre class="prettyprint"><code>indices = np.ma.masked_greater_equal(indices, 5) </code></pre> But still: <pre class="prettyprint"><code>>print input[indices] IndexError: index 99 out of bounds 0<=index<5 </code></pre> Even though: <pre class="prettyprint"><code>>np.max(indices) 2 </code></pre> So I'm having to fill the masked array first, which is annoying, since I don't know what fill value I could use to not select any indices for those that are out of range: <blockquote> print input[np.ma.filled(indices, 0)] </blockquote> <pre class="prettyprint"><code>[0 1 2 0] </code></pre> So my question is: how can you use numpy efficiently to select indices safely from an array without overstepping the bounds of the input array?

Without using masked arrays, you could remove the indices greater or equal to 5 like this: <pre class="prettyprint"><code>print input[indices[indices<5]] </code></pre> Edit: note that if you also wanted to discard negative indices, you could write: <pre class="prettyprint"><code>print input[indices[(0 <= indices) & (indices < 5)]] </code></pre>

It is a VERY BAD idea to index with masked arrays. There was a (very short) time with using MaskedArrays for indexing would have thrown an exception, but it was a bit too harsh... In your test, you're filtering <code>indices</code> to find the entries matching a condition. What should you do with the missing entries of your MaskedArray ? Is the condition False ? True ? Should you use a default ? It's up to you, the user, to decide what to do. Using <code>indices.filled(0)</code> means that when an item of <code>indices</code> is masked (as in, undefined), you want to take the first index (0) as default. Probably not what you wanted. Here, I would have simply used <code>input[indices.compressed()]</code> : the <code>compressed</code> method flattens your MaskedArray, keeping only the unmasked entries. But as you realized, you probably didn't need MaskedArrays in the first place

Indexing with Masked Arrays in numpy

Tags:

python

indexing

numpy

I have a bit of code that attempts to find the contents of an array at indices specified by another, that may specify indices that are out of range of the former array.

input = np.arange(0, 5)
indices = np.array([0, 1, 2, 99])

What I want to do is this: print input[indices] and get [0 1 2]

But this yields an exception (as expected):

IndexError: index 99 out of bounds 0<=index<5

So I thought I could use masked arrays to hide the out of bounds indices:

indices = np.ma.masked_greater_equal(indices, 5)

But still:

>print input[indices]
IndexError: index 99 out of bounds 0<=index<5

Even though:

>np.max(indices)
2

So I'm having to fill the masked array first, which is annoying, since I don't know what fill value I could use to not select any indices for those that are out of range:

print input[np.ma.filled(indices, 0)]

[0 1 2 0]

So my question is: how can you use numpy efficiently to select indices safely from an array without overstepping the bounds of the input array?

685

asked Oct 04 '10 11:10

Widjet

2 Answers

Without using masked arrays, you could remove the indices greater or equal to 5 like this:

print input[indices[indices<5]]

Edit: note that if you also wanted to discard negative indices, you could write:

print input[indices[(0 <= indices) & (indices < 5)]]

answered Sep 28 '22 06:09

François

It is a VERY BAD idea to index with masked arrays. There was a (very short) time with using MaskedArrays for indexing would have thrown an exception, but it was a bit too harsh...

In your test, you're filtering indices to find the entries matching a condition. What should you do with the missing entries of your MaskedArray ? Is the condition False ? True ? Should you use a default ? It's up to you, the user, to decide what to do.

Using indices.filled(0) means that when an item of indices is masked (as in, undefined), you want to take the first index (0) as default. Probably not what you wanted.

Here, I would have simply used input[indices.compressed()] : the compressed method flattens your MaskedArray, keeping only the unmasked entries.

But as you realized, you probably didn't need MaskedArrays in the first place

answered Sep 28 '22 06:09

Pierre GM

Related questions
                            
                                How can I learn to set up a build process? [closed]
                            
                                Generic List View raises Attribute Error: "'function' object has no attribute '_clone'
                            
                                printing to the screen on the same line at different times
                            
                                How do I compare two complex data structures?
                            
                                Creating an interface and swappable implementations in python
                            
                                Change|Assign parent for the Model instance on Google App Engine Datastore
                            
                                Matching a+ in a regex
                            
                                what is the recommended way of running a embedded web server within a desktop app (say wsgi server with pyqt)
                            
                                Query CPU ID from Python?
                            
                                Is it possible to make Nose only run tests which are sub-classes of TestCase or TestSuite (like unittest.main())
                            
                                How to decode a Google App Engine entity Key path str in Python?
                            
                                Python MySQLDB SSL Connection
                            
                                Reading an audiostream in python
                            
                                Use my own main loop in twisted
                            
                                Implementing Load Balancing Using Python
                            
                                Google App Engine: Handlers and WSGI urls
                            
                                How to make tkinter repond events while waiting socket data?
                            
                                source code trees: wide or deep
                            
                                How to glob for iterable element
                            
                                IDE Suggestion for python and javascript

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With