<pre class="prettyprint"><code>> import numpy as np > min(50, np.NaN) 50 > min(np.NaN, 50) nan </code></pre> (Same behaviour occurs with <code>max</code>) I know that I can avoid this behaviour by using <code>numpy.nanmin</code>. But what causes the change when the order is reversed? Is <code>min</code> sensitive to input order?

Yes <code>nan</code> breaks proper ordering, because it always compares as <code>False</code>. A lot of things with <code>nan</code> are inconsistent: <pre class="prettyprint"><code>In [2]: 3.0 < float('nan') Out[2]: False In [3]: float('nan') < 3.0 Out[3]: False In [4]: float('nan') == 3.0 Out[4]: False </code></pre> <code>min</code> and <code>max</code> can only give you consistent results of you are working with well-defined orderings, which numeric types are not if you can have <code>nan</code>

Why do NaN values make min and max sensitive to order? [duplicate]

Tags:

python

nan

numpy

> import numpy as np

> min(50, np.NaN)
50   
> min(np.NaN, 50)
nan

(Same behaviour occurs with max)

I know that I can avoid this behaviour by using numpy.nanmin. But what causes the change when the order is reversed? Is min sensitive to input order?

910

asked Jun 29 '20 11:06

Josh Friedlander

2 Answers

Yes nan breaks proper ordering, because it always compares as False. A lot of things with nan are inconsistent:

In [2]: 3.0 < float('nan')
Out[2]: False

In [3]: float('nan') < 3.0
Out[3]: False

In [4]: float('nan') == 3.0
Out[4]: False

min and max can only give you consistent results of you are working with well-defined orderings, which numeric types are not if you can have nan

151

answered Oct 12 '22 09:10

juanpa.arrivillaga

Is min sensitive to input order?

Yes.

https://docs.python.org/3/library/functions.html#min

"If multiple items are minimal, the function returns the first one encountered."

The documentation does not specify exactly how "minimal" is defined in the face of items that don't have a consistent order, but it's likely that min is based on looping over the elements and using the < operator to determine if the new element is smaller than the smallest item found so-far.

To confirm this hypothesis we can read the source code (search for builtin_min and min_max in https://github.com/python/cpython/blob/c96d00e88ead8f99bb6aa1357928ac4545d9287c/Python/bltinmodule.c ), it's slightly confusing because the implementations for min and max are combined and the variable names seem to be based on it being a max function but it's not too hard to follow.

And it does indeed loop through the elements in order and performs the comparison with a call to PyObject_RichCompareBool with an "opid" of Py_LT which is the C API equivalent of the python < operator.

Comparisons between NaN and numbers return false, so in a list containing numbers and NaNs if there is a NaN in the first position it will be considered the minimum as no number will be "less than" it. On the other hand, if the NaN is not in the first position then it will be effectively skipped over as it is not "less than" any number.

answered Oct 12 '22 09:10

plugwash

Related questions
                            
                                How to disregard the NaN data point in numpy array and generate the normalized data in Python?
                            
                                Django; AWS Elastic Beanstalk ERROR: Your WSGIPath refers to a file that does not exist
                            
                                Django select_related filter
                            
                                Using SQLAlchemy models in and out of Flask
                            
                                Scipy.optimize Inequality Constraint - Which side of the inequality is considered?
                            
                                pip install gives me this error "can't open file 'pip': [Errno 2] No such file or directory"
                            
                                Show grayscale OpenCV image with matplotlib
                            
                                How to extract the title of a PDF document from within a script for renaming?
                            
                                Seaborn: annotate the linear regression equation
                            
                                How to plot two pandas time series on same plot with legends and secondary y-axis?
                            
                                Indicating multiple value in a Dict[] for type hints
                            
                                Python — check if a string contains Cyrillic characters
                            
                                How to crop or remove white background from an image
                            
                                pandas: dataframe to_csv, how to set column names
                            
                                How to run a coroutine outside of an event loop?
                            
                                Send message using Django Channels from outside Consumer class
                            
                                how overwrite Response class in django rest framework ( DRF )?
                            
                                How to hide secret keys in Google Colaboratory from users having the sharing link?
                            
                                Spacy nlp = spacy.load("en_core_web_lg")
                            
                                How do you express a Python Callable with no arguments?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With