Is it possible to get the length of the nonzero elements in a numpy array without iterating over the array or masking the array. Speed is the main goal of calculating the length. Essentially, something like <code>len(array).where(array != 0)</code>. If it changes the answer, each row will begin with zeros. The array is filled on the diagonal with zeros.

Assuming you mean total number of nonzero elements (and not total number of nonzero rows): <pre class="prettyprint"><code>In [12]: a = np.random.randint(0, 3, size=(100,100)) In [13]: timeit len(a.nonzero()[0]) 1000 loops, best of 3: 306 us per loop In [14]: timeit (a != 0).sum() 10000 loops, best of 3: 46 us per loop </code></pre> or even better: <pre class="prettyprint"><code>In [22]: timeit np.count_nonzero(a) 10000 loops, best of 3: 39 us per loop </code></pre> This last one, <code>count_nonzero</code>, seems to behave well when the array is small, too, whereas the <code>sum</code> trick not so much: <pre class="prettyprint"><code>In [33]: a = np.random.randint(0, 3, size=(10,10)) In [34]: timeit len(a.nonzero()[0]) 100000 loops, best of 3: 6.18 us per loop In [35]: timeit (a != 0).sum() 100000 loops, best of 3: 13.5 us per loop In [36]: timeit np.count_nonzero(a) 1000000 loops, best of 3: 686 ns per loop </code></pre>

<code>len(np.nonzero(array)[0])</code> ? <ul> <li> <code>np.nonzero</code> returns a tuple of indices, whose length is equal to the number of dimensions in the initial array</li> <li>we get just the indices along the first dimension with <code>[0]</code> </li> <li>compute its length with <code>len</code> </li> </ul>

Get the number of nonzero elements in a numpy array?

2 Answers

Assuming you mean total number of nonzero elements (and not total number of nonzero rows):

In [12]: a = np.random.randint(0, 3, size=(100,100))

In [13]: timeit len(a.nonzero()[0])
1000 loops, best of 3: 306 us per loop

In [14]: timeit (a != 0).sum()
10000 loops, best of 3: 46 us per loop

or even better:

In [22]: timeit np.count_nonzero(a)
10000 loops, best of 3: 39 us per loop

This last one, count_nonzero, seems to behave well when the array is small, too, whereas the sum trick not so much:

In [33]: a = np.random.randint(0, 3, size=(10,10))

In [34]: timeit len(a.nonzero()[0])
100000 loops, best of 3: 6.18 us per loop

In [35]: timeit (a != 0).sum()
100000 loops, best of 3: 13.5 us per loop

In [36]: timeit np.count_nonzero(a)
1000000 loops, best of 3: 686 ns per loop

176

answered Sep 20 '22 10:09

DSM

len(np.nonzero(array)[0]) ?

np.nonzero returns a tuple of indices, whose length is equal to the number of dimensions in the initial array
we get just the indices along the first dimension with [0]
compute its length with len

answered Sep 18 '22 10:09

Andrea Zonca

Related questions
                            
                                Produce PDF files, draw polygons with rounded corners
                            
                                Integer division in Python
                            
                                How to use QPrinter and QPrintPreviewDialog
                            
                                CodingBat sum67: why is this solution wrong?
                            
                                New line for input in Python
                            
                                binary to string, better than a dictionary?
                            
                                How to convert a string data to a JSON object in python?
                            
                                Removing duplicate interaction pairs in python sets
                            
                                An unusual Python syntax element frequently used in Matplotlib
                            
                                sorting values of python dict using sorted builtin function
                            
                                Howto use Django class based UpdateViews with FileFields
                            
                                Django with psycopg2 plugin
                            
                                Localhost Server Refusing Connection
                            
                                How Does Calling Work In Python?
                            
                                Why do NumPy and SciPy have a lot of the same functions? Which should I prefer? [duplicate]
                            
                                I don't understand the Node.js architecture [closed]
                            
                                Cannot get POST values with cgi.FieldStorage
                            
                                Why does len() not support iterators?
                            
                                Simple nested for loop not working correctly
                            
                                How to format pubDate with Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Get the number of nonzero elements in a numpy array?

Tags:

python

numpy

Jzl5325

People also ask

2 Answers

DSM

Andrea Zonca

Recent Activity

Donate For Us