I have a numpy array <code>arr</code>. It's a <code>numpy.ndarray</code>, size is <code>(5553110,)</code>, <code>dtype=float32</code>. When I do: <pre class="prettyprint"><code>(arr > np.pi )[3154950] False (arr[3154950] > np.pi ) True </code></pre> Why is the first comparison getting it wrong? And how can I fix it? The values: <pre class="prettyprint"><code>arr[3154950]= 3.1415927 np.pi= 3.141592653589793 </code></pre> Is the problem with precision?

The problem is due to accuracy of <code>np.float32</code> vs <code>np.float64</code>. Use <code>np.float64</code> and you will not see a problem: <pre class="prettyprint"><code>import numpy as np arr = np.array([3.1415927], dtype=np.float64) print((arr > np.pi)[0]) # True print(arr[0] > np.pi) # True </code></pre> <hr> As @WarrenWeckesser comments: <blockquote> It involves how numpy decides to cast the arguments of its operations. Apparently, with <code>arr > scalar</code>, the scalar is converted to the same type as the array <code>arr</code>, which in this case is <code>np.float32</code>. On the other hand, with something like <code>arr > arr2</code>, with both arguments nonscalar arrays, they will use a common data type. That's why (<code>arr > np.array([np.pi]))[3154950]</code> returns <code>True</code>. </blockquote> Related github issue

Array comparison not matching elementwise comparison in numpy

Q: How do I check if two arrays are identical in Python?

Compare Two Arrays in Python Using the numpy. array_equiv() Method. The numpy. array_equiv(a1, a2) method takes array a1 and a2 as input and returns True if both arrays' shape and elements are the same; otherwise, returns False .

Q: How do you check if all elements in an array are equal NumPy?

The numpy. array_equiv() function can also be used to check whether two arrays are equal or not in Python. The numpy. array_equiv() function returns True if both arrays have the same shape and all the elements are equal, and returns False otherwise.

(arr > np.pi )[3154950]
False
(arr[3154950] > np.pi )
True

Why is the first comparison getting it wrong? And how can I fix it?

The values:

arr[3154950]= 3.1415927
np.pi= 3.141592653589793

Is the problem with precision?

378

asked Apr 26 '18 15:04

user7867665

1 Answers

The problem is due to accuracy of np.float32 vs np.float64.

Use np.float64 and you will not see a problem:

import numpy as np

arr = np.array([3.1415927], dtype=np.float64)

print((arr > np.pi)[0])  # True

print(arr[0] > np.pi)    # True

As @WarrenWeckesser comments:

It involves how numpy decides to cast the arguments of its operations. Apparently, with arr > scalar, the scalar is converted to the same type as the array arr, which in this case is np.float32. On the other hand, with something like arr > arr2, with both arguments nonscalar arrays, they will use a common data type. That's why (arr > np.array([np.pi]))[3154950] returns True.

Related github issue

108

answered Oct 02 '22 05:10

jpp

Related questions
                            
                                Keras ConvLSTM2D: ValueError on output layer
                            
                                ModuleNotFoundError issue for pytest
                            
                                Cryptacular is broken
                            
                                matplotlib 1.3.1 has requirement numpy>=1.5, but you'll have numpy 1.8.0rc1 which is incompatible
                            
                                Python: Remove duplicates for a specific item from list
                            
                                Why can a subprocess still write to stdout after it's been closed?
                            
                                python requests.get gets stuck
                            
                                Is tf.contrib.layers.fully_connected() behavior change between tensorflow 1.3 and 1.4 an issue?
                            
                                Updating an OpenCV tracker with a bounding box in python
                            
                                How to serialize numpy arrays?
                            
                                beautiful soup regex
                            
                                Check whether a DataFrame or ndrray contains digits
                            
                                How to pass global debug flag variable throughout my code; should I use argparse?
                            
                                worker_machine_type tag not working in Google Cloud Dataflow with python
                            
                                LSTM preprocessing: Build 3d arrays from pandas data frame based on ID
                            
                                How to update pip version installed by pyenv
                            
                                Upgrading SQLite3 version used in python3 on linux?
                            
                                Regarding GIL in python
                            
                                Python for .NET: How to explicitly create instances of C# classes using different versions of the same DLL?
                            
                                Containers communication with python requests

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Array comparison not matching elementwise comparison in numpy

Tags:

python

arrays

floating-point

floating-accuracy

numpy

user7867665

People also ask

1 Answers

jpp

Recent Activity

Donate For Us