pandas quantiles in series containing infinity?

Tags:

python

pandas

I have the following dataframe:

   calc_value
0         NaN
1    0.000000
2    0.100000
3    0.500000
4    2.333333
5         inf

Now I want to calculate some quantiles:

print df.quantile(.1)['calc_value']
print df.quantile(.25)['calc_value']
print df.quantile(.5)['calc_value']
print df.quantile(.75)['calc_value']
print df.quantile(.9)['calc_value']

But this returns:

0.04
0.1
0.5
nan
inf

I don't understand why the 75th quantile works this way. Shouldn't it be infinity?

541

asked Apr 12 '16 09:04

Richard

1 Answers

I think it may be a bug in numpy:

np.percentile([0,1,np.inf], 50)
Out[63]: nan

while

np.median([0, 1, np.inf])
Out[65]: 1.0

Instead of simply taking a value at index 1, it takes values at indices 1 and 2 with weights 1 and 0. So it results in 0 * inf.

In your case the result should be 2.33 (try with, for example, df.iloc[5,0] = 1e10).

109

answered Oct 20 '22 14:10

ptrj

Related questions
                            
                                Python - why does time.sleep cause memory leak?
                            
                                python: stretch world map
                            
                                Google Cloud VM - Installing openCV
                            
                                Speeding up matrix-vector multiplication and exponentiation in Python, possibly by calling C/C++
                            
                                Pandas invalid type comparison error
                            
                                Where is my custom Django app code?
                            
                                What is the right way to pass inputs parameters to a Theano function?
                            
                                Embed Python Zip file throws error?
                            
                                How to enable mod_wsgi after pip install
                            
                                impyla hangs when connecting to HiveServer2
                            
                                Is it un-pythonic to define a function inside of a class method?
                            
                                Group by year/month/day in pandas
                            
                                pip install --upgrade pip installs the same version
                            
                                Django maintain versions of a model object
                            
                                CFFI UserWarning: 'point_conversion_form_t' has no values explicitly defined;
                            
                                Interactive plot in Jupyter notebook
                            
                                Expected Chromecast Audio Delay?
                            
                                Python package wheel PKG-INFO name
                            
                                Seeded Python RNG showing non-deterministic behavior with sets
                            
                                How to add GDAL as a dependency to a Python package

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With