I would like to get the index of a 2 dimensional Numpy array that matches a row. For example, my array is this: <pre class="prettyprint"><code>vals = np.array([[0, 0], [1, 0], [2, 0], [0, 1], [1, 1], [2, 1], [0, 2], [1, 2], [2, 2], [0, 3], [1, 3], [2, 3], [0, 0], [1, 0], [2, 0], [0, 1], [1, 1], [2, 1], [0, 2], [1, 2], [2, 2], [0, 3], [1, 3], [2, 3]]) </code></pre> I would like to get the index that matches the row [0, 1] which is index 3 and 15. When I do something like <code>numpy.where(vals == [0 ,1])</code> I get... <pre class="prettyprint"><code>(array([ 0, 3, 3, 4, 5, 6, 9, 12, 15, 15, 16, 17, 18, 21]), array([0, 0, 1, 1, 1, 0, 0, 0, 0, 1, 1, 1, 0, 0])) </code></pre> I want index array([3, 15]).

You need the <code>np.where</code> function to get the indexes: <pre class="prettyprint"><code>>>> np.where((vals == (0, 1)).all(axis=1)) (array([ 3, 15]),) </code></pre> Or, as the documentation states: <blockquote> If only condition is given, return <code>condition.nonzero()</code> </blockquote> You could directly call <code>.nonzero()</code> on the array returned by <code>.all</code>: <pre class="prettyprint"><code>>>> (vals == (0, 1)).all(axis=1).nonzero() (array([ 3, 15]),) </code></pre> To dissassemble that: <pre class="prettyprint"><code>>>> vals == (0, 1) array([[ True, False], [False, False], ... [ True, False], [False, False], [False, False]], dtype=bool) </code></pre> and calling the <code>.all</code> method on that array (with <code>axis=1</code>) gives you <code>True</code> where both are True: <pre class="prettyprint"><code>>>> (vals == (0, 1)).all(axis=1) array([False, False, False, True, False, False, False, False, False, False, False, False, False, False, False, True, False, False, False, False, False, False, False, False], dtype=bool) </code></pre> and to get which indexes are <code>True</code>: <pre class="prettyprint"><code>>>> np.where((vals == (0, 1)).all(axis=1)) (array([ 3, 15]),) </code></pre> or <pre class="prettyprint"><code>>>> (vals == (0, 1)).all(axis=1).nonzero() (array([ 3, 15]),) </code></pre> <hr> I find my solution a bit more readable, but as unutbu points out, the following may be faster, and returns the same value as <code>(vals == (0, 1)).all(axis=1)</code>: <pre class="prettyprint"><code>>>> (vals[:, 0] == 0) & (vals[:, 1] == 1) </code></pre>

<pre class="prettyprint"><code>In [5]: np.where((vals[:,0] == 0) & (vals[:,1]==1))[0] Out[5]: array([ 3, 15]) </code></pre> <hr> I'm not sure why, but this is significantly faster than <code>np.where((vals == (0, 1)).all(axis=1))</code>: <pre class="prettyprint"><code>In [34]: vals2 = np.tile(vals, (1000,1)) In [35]: %timeit np.where((vals2 == (0, 1)).all(axis=1))[0] 1000 loops, best of 3: 808 µs per loop In [36]: %timeit np.where((vals2[:,0] == 0) & (vals2[:,1]==1))[0] 10000 loops, best of 3: 152 µs per loop </code></pre>

Using the numpy_indexed package, you can simply write: <pre class="prettyprint"><code>import numpy_indexed as npi print(np.flatnonzero(npi.contains([[0, 1]], vals))) </code></pre>

Find matching rows in 2 dimensional numpy array

Tags:

python

numpy

scipy

I would like to get the index of a 2 dimensional Numpy array that matches a row. For example, my array is this:

vals = np.array([[0, 0],
                 [1, 0],
                 [2, 0],
                 [0, 1],
                 [1, 1],
                 [2, 1],
                 [0, 2],
                 [1, 2],
                 [2, 2],
                 [0, 3],
                 [1, 3],
                 [2, 3],
                 [0, 0],
                 [1, 0],
                 [2, 0],
                 [0, 1],
                 [1, 1],
                 [2, 1],
                 [0, 2],
                 [1, 2],
                 [2, 2],
                 [0, 3],
                 [1, 3],
                 [2, 3]])

I would like to get the index that matches the row [0, 1] which is index 3 and 15. When I do something like numpy.where(vals == [0 ,1]) I get...

(array([ 0,  3,  3,  4,  5,  6,  9, 12, 15, 15, 16, 17, 18, 21]), array([0, 0, 1, 1, 1, 0, 0, 0, 0, 1, 1, 1, 0, 0]))

I want index array([3, 15]).

300

asked Sep 13 '14 13:09

b10hazard

3 Answers

You need the np.where function to get the indexes:

>>> np.where((vals == (0, 1)).all(axis=1))
(array([ 3, 15]),)

Or, as the documentation states:

If only condition is given, return condition.nonzero()

You could directly call .nonzero() on the array returned by .all:

>>> (vals == (0, 1)).all(axis=1).nonzero()
(array([ 3, 15]),)

To dissassemble that:

>>> vals == (0, 1)
array([[ True, False],
       [False, False],
       ...
       [ True, False],
       [False, False],
       [False, False]], dtype=bool)

and calling the .all method on that array (with axis=1) gives you True where both are True:

>>> (vals == (0, 1)).all(axis=1)
array([False, False, False,  True, False, False, False, False, False,
       False, False, False, False, False, False,  True, False, False,
       False, False, False, False, False, False], dtype=bool)

and to get which indexes are True:

>>> np.where((vals == (0, 1)).all(axis=1))
(array([ 3, 15]),)

>>> (vals == (0, 1)).all(axis=1).nonzero()
(array([ 3, 15]),)

I find my solution a bit more readable, but as unutbu points out, the following may be faster, and returns the same value as (vals == (0, 1)).all(axis=1):

>>> (vals[:, 0] == 0) & (vals[:, 1] == 1)

142

answered Oct 18 '22 05:10

Russia Must Remove Putin

In [5]: np.where((vals[:,0] == 0) & (vals[:,1]==1))[0]
Out[5]: array([ 3, 15])

I'm not sure why, but this is significantly faster than
np.where((vals == (0, 1)).all(axis=1)):

In [34]: vals2 = np.tile(vals, (1000,1))

In [35]: %timeit np.where((vals2 == (0, 1)).all(axis=1))[0]
1000 loops, best of 3: 808 µs per loop

In [36]: %timeit np.where((vals2[:,0] == 0) & (vals2[:,1]==1))[0]
10000 loops, best of 3: 152 µs per loop

answered Oct 18 '22 03:10

unutbu

Using the numpy_indexed package, you can simply write:

import numpy_indexed as npi
print(np.flatnonzero(npi.contains([[0, 1]], vals)))

answered Oct 18 '22 05:10

Eelco Hoogendoorn

Related questions
                            
                                About the changing id of an immutable string
                            
                                Can I import Python's 3.6's formatted string literals (f-strings) into older 3.x, 2.x Python?
                            
                                What's the best way to initialise and use constants across Python classes?
                            
                                Create hourly/minutely time range using pandas
                            
                                Django model inheritance: create sub-instance of existing instance (downcast)?
                            
                                How can I allow django admin to set a field to NULL?
                            
                                How do I tell if a column in a pandas dataframe is of type datetime? How do I tell if a column is numerical?
                            
                                Ignore case in Python strings [duplicate]
                            
                                In what situation should the built-in 'operator' module be used in python?
                            
                                Python optparse Values Instance
                            
                                Get the mean across multiple Pandas DataFrames
                            
                                Python Argparse conditionally required arguments
                            
                                List of Tuples to DataFrame Conversion [duplicate]
                            
                                Python's Multiple Inheritance: Picking which super() to call
                            
                                How do I bind the enter key to a function in tkinter?
                            
                                How to update a document using elasticsearch-py?
                            
                                list memory usage in ipython and jupyter
                            
                                Pandas DataFrames with NaNs equality comparison
                            
                                Matplotlib: How to plot images instead of points?
                            
                                Try-except clause with an empty except code [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With