How to calculate mean value of an array (A) avoiding nan? <pre class="prettyprint"><code>import numpy as np A = [5 nan nan nan nan 10] M = np.mean(A[A!=nan]) does not work Any idea? </code></pre>

Use <code>numpy.isnan</code>: <pre class="prettyprint"><code>>>> import numpy as np >>> A = np.array([5, np.nan, np.nan, np.nan, np.nan, 10]) >>> np.isnan(A) array([False, True, True, True, True, False], dtype=bool) >>> ~np.isnan(A) array([ True, False, False, False, False, True], dtype=bool) >>> A[~np.isnan(A)] array([ 5., 10.]) >>> A[~np.isnan(A)].mean() 7.5 </code></pre> because you cannot compare <code>nan</code> with <code>nan</code>: <pre class="prettyprint"><code>>>> np.nan == np.nan False >>> np.nan != np.nan True >>> np.isnan(np.nan) True </code></pre>

Get mean value avoiding nan using numpy in python [duplicate]

Tags:

python

arrays

numpy

How to calculate mean value of an array (A) avoiding nan?

import numpy as np  A = [5    nan    nan    nan    nan  10] M = np.mean(A[A!=nan]) does not work Any idea?

590

asked Nov 08 '13 06:11

2964502

2 Answers

An other possibility is the following:

import numpy from scipy.stats import nanmean # nanmedian exists too, if you need it A = numpy.array([5, numpy.nan, numpy.nan, numpy.nan, numpy.nan, 10]) print nanmean(A) # gives 7.5 as expected

i guess this looks more elegant (and readable) than the other solution already given

edit: apparently (@Jaime) reports that this functionality already exists directly in the latest numpy (1.8) as well, so no need to import scipy.stats anymore if you have that version of numpy:

import numpy A = numpy.array([5, numpy.nan, numpy.nan, numpy.nan, numpy.nan, 10]) print numpy.nanmean(A)

the first solution works also for people who dont have the latest version of numpy (like me)

185

answered Sep 21 '22 00:09

usethedeathstar

Use numpy.isnan:

>>> import numpy as np  >>> A = np.array([5, np.nan, np.nan, np.nan, np.nan, 10]) >>> np.isnan(A) array([False,  True,  True,  True,  True, False], dtype=bool) >>> ~np.isnan(A) array([ True, False, False, False, False,  True], dtype=bool) >>> A[~np.isnan(A)] array([  5.,  10.]) >>> A[~np.isnan(A)].mean() 7.5

because you cannot compare nan with nan:

>>> np.nan == np.nan False >>> np.nan != np.nan True >>> np.isnan(np.nan) True

answered Sep 21 '22 00:09

falsetru

Related questions
                            
                                Turn off error bars in Seaborn Bar Plot
                            
                                Using anaconda environment in Atom
                            
                                Python extend with an empty list bug? [duplicate]
                            
                                In Python, how can I get the correctly-cased path for a file?
                            
                                No speed gains from Cython
                            
                                Flask-principal tutorial (auth + authr) [closed]
                            
                                How to change text/font color in reportlab.pdfgen
                            
                                Python Exception in thread Thread-1 (most likely raised during interpreter shutdown)?
                            
                                Install Anaconda on Ubuntu (or Linux) via command line
                            
                                Most efficient method to check if dictionary key exists and process its value if it does
                            
                                Summing list of counters in python
                            
                                How to get OR permissions instead of AND in REST framework
                            
                                Difference between tf.data.Dataset.map() and tf.data.Dataset.apply()
                            
                                How do I disable pylint unused import error messages in vs code
                            
                                Simple, Cross Platform MIDI Library for Python [closed]
                            
                                Grouping Python tuple list
                            
                                How do I sum the columns in 2D list?
                            
                                Matplotlib yaxis range display using absolute values rather than offset values?
                            
                                Multiple many-to-many relations to the same model in Django
                            
                                Removing objects whose counts are less than threshold in counter.

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With