I can compare two Pandas series for exact equality using <code>pandas.Series.equals</code>. Is there a corresponding function or parameter that will check if the elements are equal to some ε of precision?

You can use <code>numpy.allclose</code>: <blockquote> <pre class="prettyprint"><code>numpy.allclose(a, b, rtol=1e-05, atol=1e-08, equal_nan=False) </code></pre> Returns <code>True</code> if two arrays are element-wise equal within a tolerance. The tolerance values are positive, typically very small numbers. The relative difference (<code>rtol * abs(b)</code>) and the absolute difference <code>atol</code> are added together to compare against the absolute difference between <code>a</code> and <code>b</code>. </blockquote> <code>numpy</code> works well with <code>pandas.Series</code> objects, so if you have two of them - <code>s1</code> and <code>s2</code>, you can simply do: <pre class="prettyprint"><code>np.allclose(s1, s2, atol=...) </code></pre> Where <code>atol</code> is your tolerance value.

Comparing two pandas series for floating point near-equality?

2 Answers

You can use numpy.allclose:

numpy.allclose(a, b, rtol=1e-05, atol=1e-08, equal_nan=False)
Returns True if two arrays are element-wise equal within a tolerance.

The tolerance values are positive, typically very small numbers. The relative difference (rtol * abs(b)) and the absolute difference atol are added together to compare against the absolute difference between a and b.

numpy works well with pandas.Series objects, so if you have two of them - s1 and s2, you can simply do:

np.allclose(s1, s2, atol=...)

Where atol is your tolerance value.

161

answered Oct 10 '22 10:10

cs95

Numpy works well with pandas Series. However one has to be careful with the order of indices (or columns and indices for pandas DataFrame)

For example

series_1 = pd.Series(data=[0,1], index=['a','b'])
series_2 = pd.Series(data=[1,0], index=['b','a']) 
np.allclose(series_1,series_2)

will return False

A workaround is to use the index of one pandas series

np.allclose(series_1, series_2.loc[series_1.index])

answered Oct 10 '22 11:10

bolirev

Related questions
                            
                                ImportError: cannot import name DependencyWarning
                            
                                How can I use KNN /K-means to clustering time series in a dataframe
                            
                                How to print FF (form feed) character?
                            
                                RuntimeError: The init_func must return a sequence of Artist objects
                            
                                Dotted lines instead of a missing value in matplotlib
                            
                                Firebase database data to R
                            
                                Access Rows by integers and Columns by labels Pandas
                            
                                Resampling and filling missing data in pandas
                            
                                Deep set python dictionary
                            
                                Python Argparse - Set default value of a parameter to another parameter
                            
                                How do install packages from a local python package index?
                            
                                Default Argument decorator python
                            
                                Pandas SQL equivalent for 'not equal' clause
                            
                                O(n) solution for finding maximum sum of differences python 3.x?
                            
                                Keras Extremely High Loss
                            
                                How to know from python if Windows path limit has been removed
                            
                                Python exit from all running threads on truthy condition
                            
                                Splitting list of dictionary into sublists after the occurence of particular key of dictionary
                            
                                Data Normalization with tensorflow tf-transform
                            
                                hog() got an unexpected keyword argument 'visualize'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Comparing two pandas series for floating point near-equality?

Tags:

python

equality

floating-point

pandas

numpy

Mark Harrison

People also ask

2 Answers

cs95

bolirev

Recent Activity

Donate For Us