scipy p-value returns 0.0

Tags:

Using a 2 sample Kolmogorov Smirnov test, I am getting a p-value of 0.0.

>>>scipy.stats.ks_2samp(dataset1, dataset2)
(0.65296076312083573, 0.0)

Looking at the histograms of the 2 datasets, I am quite confident they represent two different datasets. But, really, p = 0.0? That doesn't seem to make sense. Shouldn't it be a very small but positive number?

I know the return value is of type numpy.float64. Does that have something to do with it?

EDIT: data here: https://www.dropbox.com/s/jpixhz0pcybyh1t/data4stack.csv

scipy.version.full_version
'0.13.2'

958

asked Dec 11 '13 21:12

andy

1 Answers

Yes, the probability is very small:

>>> from pprint import pprint
>>> pprint ([(i, scipy.stats.ks_2samp(dataset1, dataset2[:i])[1]) 
...                for i in range(200,len(dataset2),200)])
[(200, 3.1281733251275881e-63),
 (400, 3.5780609056448825e-157),
 (600, 9.2884803664366062e-225),
 (800, 7.1429666685167604e-293),
 (1000, 0.0),
 (1200, 0.0),
 (1400, 0.0),
 (1600, 0.0),
 (1800, 0.0),
 (2000, 0.0),
 (2200, 0.0),
 (2400, 0.0)]

197

answered Sep 20 '22 08:09

alko

Related questions
                            
                                I am using Python3 and I want to use RabbitMQ
                            
                                Round a Python list of numbers and maintain their sum
                            
                                Incrementing (iterating) between two hex values in Python
                            
                                Matplotlib polar plot radial axis offset
                            
                                Gimp: python script not showing in menu
                            
                                reStructuredText: README.rst not parsing on PyPI
                            
                                running Apache + Bottle + Python
                            
                                Why __instancecheck__ is not always called depending on argument?
                            
                                How to Mock a missing attribute
                            
                                Flask-Admin Blueprint creation during Testing
                            
                                Efficient manipulation of a list of cartesian coordinates in Python
                            
                                pandas: fill a column with some numpy arrays
                            
                                How to set ffmpeg for matplotlib in mac os x
                            
                                Python socket server/client programming
                            
                                How to cache a Django Model in Memory [duplicate]
                            
                                Python PIL, preserve quality when resizing and saving [duplicate]
                            
                                Python sort a List by length of value in tuple
                            
                                Installing Python module with pip
                            
                                Django REST Framework, pre_save() and serializer.is_valid(), how do they work?
                            
                                Root logger in dictconfig

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

scipy p-value returns 0.0

Tags:

python

statistics

scipy

andy

People also ask

1 Answers

alko

Recent Activity

Donate For Us