How to interpret scipy.stats.probplot results?

Tags:

I wanted to use scipy.stats.probplot() to perform some gaussianity test on mydata.

from scipy import stats
_,fit=stats.probplot(mydata, dist=stats.norm,plot=ax)
goodness_fit="%.2f" %fit[2]

The documentation says:

Generates a probability plot of sample data against the quantiles of a specified theoretical distribution (the normal distribution by default). probplot optionally calculates a best-fit line for the data and plots the results using Matplotlib or a given plot function. probplot generates a probability plot, which should not be confused with a Q-Q or a P-P plot. Statsmodels has more extensive functionality of this type, see statsmodels.api.ProbPlot.

But if google probability plot, it is a common name for P-P plot, while the documentation says not to confuse the two things.

Now I am confused, what is this function doing?

616

asked Jan 05 '18 06:01

000000

1 Answers

I looked since hours for an answer to this question, and this can be found in the Scipy/Statsmodel code comments.

In Scipy, comment at https://github.com/scipy/scipy/blob/abdab61d65dda1591f9d742230f0d1459fd7c0fa/scipy/stats/morestats.py#L523 says:

probplot generates a probability plot, which should not be confused with a Q-Q or a P-P plot. Statsmodels has more extensive functionality of this type, see statsmodels.api.ProbPlot.

So, now, let's look at Statsmodels, where comment at https://github.com/statsmodels/statsmodels/blob/66fc298c51dc323ce8ab8564b07b1b3797108dad/statsmodels/graphics/gofplots.py#L58 says:

ppplot : Probability-Probability plot Compares the sample and theoretical probabilities (percentiles).

qqplot : Quantile-Quantile plot Compares the sample and theoretical quantiles

probplot : Probability plot Same as a Q-Q plot, however probabilities are shown in the scale of the theoretical distribution (x-axis) and the y-axis contains unscaled quantiles of the sample data.

So, difference between QQ plot and Probability plot, in these modules, is related to the scales.

150

answered Sep 23 '22 18:09

mike123

Related questions
                            
                                send email with a pandas dataframe as attachment
                            
                                Difficulty with python while installing YouCompleteMe in vim
                            
                                Elegant way to delete items in a list which do not has substrings that appear in another list
                            
                                How exactly does random.random() work in python?
                            
                                PyQt4 to PyQt5 -> mainFrame() deprecated, need fix to load web pages
                            
                                Representing voxels with matplotlib
                            
                                Fastest way to cast all dataframe columns to float - pandas astype slow
                            
                                How to get the symmetric difference of two dictionaries
                            
                                Keras training only specific outputs
                            
                                TypeError: run() missing 1 required positional argument: 'fetches' on Session.run()
                            
                                How to split one column into multiple columns in Pandas using regular expression?
                            
                                Incremental training of random forest model using python sklearn
                            
                                Checking if an environment variable exists and is set to True [closed]
                            
                                Scrapy: Save response.body as html file?
                            
                                Flatten multi-index pandas dataframe where column names become values
                            
                                How to find first local maximum for every group?
                            
                                Python installer: "0x80070642 - User cancelled installation"
                            
                                psycopg2 import error when ubuntu upgraded to 17.10 (from 17.04)
                            
                                import matplotlib.pyplot as plt, ImportError: libGL.so.1: cannot open shared object file: No such file or directory
                            
                                Using multiple conditions in Django's Case When expressions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to interpret scipy.stats.probplot results?

Tags:

python

matplotlib

plot

numpy

statistics

000000

People also ask

1 Answers

mike123

Recent Activity

Donate For Us

How to interpret scipy.stats.probplot results?

Tags:

python

matplotlib

plot

numpy

statistics

00__00__00

People also ask

1 Answers

mike123

Related questions

Recent Activity

Donate For Us

000000