Counting the number of non-NaN elements in a numpy ndarray in Python

Tags:

I need to calculate the number of non-NaN elements in a numpy ndarray matrix. How would one efficiently do this in Python? Here is my simple code for achieving this:

import numpy as np  def numberOfNonNans(data):     count = 0     for i in data:         if not np.isnan(i):             count += 1     return count

Is there a built-in function for this in numpy? Efficiency is important because I'm doing Big Data analysis.

Thnx for any help!

293

asked Feb 14 '14 11:02

jjepsuomi

1 Answers

np.count_nonzero(~np.isnan(data))

~ inverts the boolean matrix returned from np.isnan.

np.count_nonzero counts values that is not 0\false. .sum should give the same result. But maybe more clearly to use count_nonzero

Testing speed:

In [23]: data = np.random.random((10000,10000))  In [24]: data[[np.random.random_integers(0,10000, 100)],:][:, [np.random.random_integers(0,99, 100)]] = np.nan  In [25]: %timeit data.size - np.count_nonzero(np.isnan(data)) 1 loops, best of 3: 309 ms per loop  In [26]: %timeit np.count_nonzero(~np.isnan(data)) 1 loops, best of 3: 345 ms per loop  In [27]: %timeit data.size - np.isnan(data).sum() 1 loops, best of 3: 339 ms per loop

data.size - np.count_nonzero(np.isnan(data)) seems to barely be the fastest here. other data might give different relative speed results.

answered Oct 09 '22 00:10

M4rtini

Related questions
                            
                                Pandas concat: ValueError: Shape of passed values is blah, indices imply blah2
                            
                                What does an 'r' represent before a string in python? [duplicate]
                            
                                Python super() raises TypeError
                            
                                What is the equivalent of MATLAB's repmat in NumPy
                            
                                What's the difference between `from django.conf import settings` and `import settings` in a Django project
                            
                                Why does sys.exit() not exit when called inside a thread in Python?
                            
                                __lt__ instead of __cmp__
                            
                                Where do I find the bashrc file on Mac?
                            
                                How to create a Python decorator that can be used either with or without parameters?
                            
                                Conditional operator in Python? [duplicate]
                            
                                CORS error on same domain?
                            
                                Get pixel's RGB using PIL
                            
                                How to assign to repeated field?
                            
                                Detect & Record Audio in Python
                            
                                Send data from a textbox into Flask?
                            
                                Python: Append item to list N times
                            
                                How can I use a pip requirements file to uninstall as well as install packages?
                            
                                How to convert a timezone aware string to datetime in Python without dateutil?
                            
                                Run code before and after each test in py.test?
                            
                                Why doesn't requests.get() return? What is the default timeout that requests.get() uses?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Counting the number of non-NaN elements in a numpy ndarray in Python

Tags:

python

nan

matrix

numpy

jjepsuomi

People also ask

1 Answers

M4rtini

Recent Activity

Donate For Us