Why are the following two lists not equal? <pre class="prettyprint"><code>a = [1.0, np.NAN] b = np.append(np.array(1.0), [np.NAN]).tolist() </code></pre> I am using the following to check for identicalness. <pre class="prettyprint"><code>((a == b) | (np.isnan(a) & np.isnan(b))).all(), np.in1d(a,b) </code></pre> Using <code>np.in1d(a, b)</code> it seems the <code>np.NAN</code> values are not equal but I am not sure why this is. Can anyone shed some light on this issue?

<code>NaN</code> values never compare equal. That is, the test <code>NaN==NaN</code> is always <code>False</code> by definition of <code>NaN</code>. So <code>[1.0, NaN] == [1.0, NaN]</code> is also <code>False</code>. Indeed, once a <code>NaN</code> occurs in any list, it cannot compare equal to any other list, even itself. If you want to test a variable to see if it's <code>NaN</code> in <code>numpy</code>, you use the <code>numpy.isnan()</code> function. I don't see any obvious way of obtaining the comparison semantics that you seem to want other than by “manually” iterating over the list with a loop. Consider the following: <pre class="prettyprint"><code>import math import numpy as np def nan_eq(a, b): for i,j in zip(a,b): if i!=j and not (math.isnan(i) and math.isnan(j)): return False return True a=[1.0, float('nan')] b=[1.0, float('nan')] print( float('nan')==float('nan') ) print( a==a ) print( a==b ) print( nan_eq(a,a) ) </code></pre> It will print: <pre class="prettyprint"><code>False True False True </code></pre> The test <code>a==a</code> succeeds because, presumably, Python's idea that references to the same object are equal trumps what would be the result of the element-wise comparison that <code>a==b</code> requires.

Since <code>a</code> and <code>b</code> are lists, <code>a == b</code> isn't returning an array, and so your numpy-like logic won't work: <pre class="prettyprint"><code>>>> a == b False </code></pre> The command you've quoted only works if they're arrays: <pre class="prettyprint"><code>>>> a,b = np.asarray(a), np.asarray(b) >>> a == b array([ True, False], dtype=bool) >>> (a == b) | (np.isnan(a) & np.isnan(b)) array([ True, True], dtype=bool) >>> ((a == b) | (np.isnan(a) & np.isnan(b))).all() True </code></pre> which should work to compare two arrays (either they're both equal or they're both NaN).

Python\Numpy: Comparing arrays with NAN [duplicate]

Tags:

python

numpy

Why are the following two lists not equal?

a = [1.0, np.NAN] 
b = np.append(np.array(1.0), [np.NAN]).tolist()

I am using the following to check for identicalness.

((a == b) | (np.isnan(a) & np.isnan(b))).all(), np.in1d(a,b)

Using np.in1d(a, b) it seems the np.NAN values are not equal but I am not sure why this is. Can anyone shed some light on this issue?

898

asked May 22 '14 14:05

Black

2 Answers

NaN values never compare equal. That is, the test NaN==NaN is always False by definition of NaN.

So [1.0, NaN] == [1.0, NaN] is also False. Indeed, once a NaN occurs in any list, it cannot compare equal to any other list, even itself.

If you want to test a variable to see if it's NaN in numpy, you use the numpy.isnan() function. I don't see any obvious way of obtaining the comparison semantics that you seem to want other than by “manually” iterating over the list with a loop.

Consider the following:

import math
import numpy as np

def nan_eq(a, b):
    for i,j in zip(a,b):
        if i!=j and not (math.isnan(i) and math.isnan(j)):
            return False
    return True

a=[1.0, float('nan')]
b=[1.0, float('nan')]

print( float('nan')==float('nan') )
print( a==a )
print( a==b )
print( nan_eq(a,a) )

It will print:

False
True
False
True

The test a==a succeeds because, presumably, Python's idea that references to the same object are equal trumps what would be the result of the element-wise comparison that a==b requires.

answered Oct 31 '22 03:10

Emmet

Since a and b are lists, a == b isn't returning an array, and so your numpy-like logic won't work:

>>> a == b
False

The command you've quoted only works if they're arrays:

>>> a,b = np.asarray(a), np.asarray(b)
>>> a == b
array([ True, False], dtype=bool)
>>> (a == b) | (np.isnan(a) & np.isnan(b))
array([ True,  True], dtype=bool)
>>> ((a == b) | (np.isnan(a) & np.isnan(b))).all()
True

which should work to compare two arrays (either they're both equal or they're both NaN).

answered Oct 31 '22 05:10

DSM

Related questions
                            
                                Bad results when undistorting points using OpenCV in Python
                            
                                python select.select() on Windows
                            
                                Lambda functions unequal behaviors in Python 3 and Python 2
                            
                                Pandas - cumsum by month?
                            
                                What formats can matplotlib animations be saved as?
                            
                                daisy-chaining Python/Django custom decorators
                            
                                Pandas: how to get a particular group after groupby? [duplicate]
                            
                                PyQt4: why do we need to pass class name in call to super()
                            
                                Python takes more time to print a calculation than to perform it
                            
                                Generate HTML Table from Python Dictionary
                            
                                argparse - disable same argument occurrences
                            
                                Efficiently sampling from a multiset (Counter) in Python
                            
                                Jinja Extension that has access to Context
                            
                                Multiple WHERE clauses in sqlite3 python
                            
                                How to indent a block of Python code in Notepad++?
                            
                                Sqlalchemy from_statement() cannot locate column
                            
                                Saving a matplotlib animation with imagemagick, and without ffmpeg or mencoder
                            
                                How to clear a PyListObject?
                            
                                Deleting items from a dictionary with a for loop [duplicate]
                            
                                Python process forked by NodeJS - Alternative to process.send() for Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With