I have a 2d boolean array from which I'm trying to extract the indices of the true values. Numpy's nonzero function decomposes my 2d array into a list of x's and y's of positions, which is problematic. Is it possible to find the column indices of the <code>true</code> elements while preserving the row order? Each of the true values in the columns are associated with each other in the same row so splitting them into (row index, column index) pairs isn't helpful. Is this possible? I was thinking that maybe <code>np.apply_along_axis</code> maybe useful.

I did not quite understand what you wanted (maybe an example would help), but two guesses: If you want to see if there are any Trues on a row, then: <pre class="prettyprint"><code>np.any(a, axis=1) </code></pre> will give you an array with boolean value for each row. Or if you want to get the indices for the <code>True</code>s row-by-row, then <pre class="prettyprint"><code>testarray = np.array([ [True, False, True], [True, True, False], [False, False, False], [False, True, False]]) collists = [ np.nonzero(t)[0] for t in testarray ] </code></pre> This gives: <pre class="prettyprint"><code>>>> collists [array([0, 2]), array([0, 1]), array([], dtype=int64), array([1])] </code></pre> If you want to know the indices of columns with a <code>True</code> on row 3, then: <pre class="prettyprint"><code>>>> collists[3] array([1]) </code></pre> There is no pure array-based way of accomplishing this because the number of items on each row varies. That is why we need the lists. On the other hand, the performance is decent, I tried it with a 10000 x 10000 random boolean array, and it took 774 ms to complete the task.

Apply numpy nonzero row-wise?

Tags:

numpy

I have a 2d boolean array from which I'm trying to extract the indices of the true values. Numpy's nonzero function decomposes my 2d array into a list of x's and y's of positions, which is problematic.

Is it possible to find the column indices of the true elements while preserving the row order?

Each of the true values in the columns are associated with each other in the same row so splitting them into (row index, column index) pairs isn't helpful. Is this possible?

I was thinking that maybe np.apply_along_axis maybe useful.

215

asked Jul 11 '14 17:07

tlnagy

1 Answers

I did not quite understand what you wanted (maybe an example would help), but two guesses:

If you want to see if there are any Trues on a row, then:

np.any(a, axis=1)

will give you an array with boolean value for each row.

Or if you want to get the indices for the Trues row-by-row, then

testarray = np.array([
    [True, False, True],
    [True, True, False],
    [False, False, False],
    [False, True, False]])

collists = [ np.nonzero(t)[0] for t in testarray ]

This gives:

>>> collists
[array([0, 2]), array([0, 1]), array([], dtype=int64), array([1])]

If you want to know the indices of columns with a True on row 3, then:

>>> collists[3]
array([1])

There is no pure array-based way of accomplishing this because the number of items on each row varies. That is why we need the lists. On the other hand, the performance is decent, I tried it with a 10000 x 10000 random boolean array, and it took 774 ms to complete the task.

145

answered Oct 18 '22 02:10

DrV

Related questions
                            
                                How to stack arrays and scalars in numpy?
                            
                                Pandas how to split dataframe by column by interval
                            
                                AttributeError list object has no attribute add
                            
                                How to do a cumulative "all"
                            
                                How to convert pandas dataframe columns to native python data types?
                            
                                Pandas DataFrame to lists of lists including headers
                            
                                How to show all the element names in a npz file without having to load the completely?
                            
                                NumPy 2D array: selecting indices in a circle
                            
                                Is there a scope for (numpy) random seeds?
                            
                                TypeError: can't convert np.ndarray of type numpy.object_
                            
                                speed of elementary mathematical operations in Numpy/Python: why is integer division slowest?
                            
                                Write multiple numpy arrays to file
                            
                                & vs * and | vs +
                            
                                How to organize values in a numpy array into bins that contain a certain range of values?
                            
                                Opencv Python display raw image
                            
                                Iterating over arrays in cython, is list faster than np.array?
                            
                                a simple, matlab-like way of finding the null space of a small matrix in numpy (and number formatting) [duplicate]
                            
                                Exe created with py2exe doesn't work and returns logfile with errors
                            
                                Returning groups of correlated columns in pandas data frame
                            
                                Numpy element wise division not working as expected

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With