In scipy.stats rv_continuous has a fit method to find MLEs, but rv_discrete does not. Why?

Tags:

I would like to find the Maximum Likelihood Estimator for some data that may be governed by a discrete distribution. But in scipy.stats only classes representing continuous distributions have a fit function to do that. What is the reason that the classes representing discrete distributions do not?

731

asked May 08 '13 22:05

Keith Braithwaite

1 Answers

Short answer: because nobody wrote the code for it, or even tried, as far as I know.

Longer answer: I don't know how far we can get with the discrete models with a generic maximum likelihood method as ther is for the continuous distributions, which works for many but not all of those.

Most discrete distributions have strong restrictions on the parameters, and most likely most of them will need a fit methods specific to the distribution

>>> [(f, getattr(stats, f).shapes) for f in dir(stats) if isinstance(getattr(stats, f), stats.distributions.rv_discrete)]
[('bernoulli', 'pr'), ('binom', 'n, pr'), ('boltzmann', 'lamda, N'), 
 ('dlaplace', 'a'), ('geom', 'pr'), ('hypergeom', 'M, n, N'), 
 ('logser', 'pr'), ('nbinom', 'n, pr'), ('planck', 'lamda'), 
 ('poisson', 'mu'), ('randint', 'min, max'), ('skellam', 'mu1,mu2'), 
 ('zipf', 'a')]

statsmodels is providing a few of the discrete models where the parameters can also depend on some explanatory variables. Most of those, like generalized linear models, need a link function to restrict the values for the parameters to the valid range, for example interval (0, 1) for probabilities, or larger than zero for parameters in count models.

Then "n" parameter in binomial and some of the other ones are required to be integers, which makes it impossible to use the usual continuous minimizers from scipy.optimize.

A good solution would be for someone to add distribution specific fit methods, so that we have at least the easier ones available.

123

answered Oct 16 '22 16:10

Josef

Related questions
                            
                                Why is python's subprocess.call implemented like this?
                            
                                Detecting hangs with Python urllib2.urlopen
                            
                                Gunicorn not reloading a Django application
                            
                                Passing keyword arguments to a function when local variable names are same as function parameter names
                            
                                What are the mature CMSs and Blogs built on web2py?
                            
                                Lazy data-flow (spreadsheet like) properties with dependencies in Python
                            
                                Parsing Lines in Python: Use RE or Not?
                            
                                Find words and combinations of words that can be spoken the quickest
                            
                                argparse accept everything
                            
                                Factor/collect expression in Sympy
                            
                                How to skip or ignore python decorators
                            
                                Reliable way to execute thousands of independent transaction?
                            
                                Safe to change base class in python?
                            
                                OpenCV (via python) on Linux: Set frame width/height?
                            
                                Why does SimpleHTTPServer redirect to ?querystring/ when I request ?querystring?
                            
                                Custom Logger class and correct line number/function name in log
                            
                                python data and non-data descriptors
                            
                                BeautifulSoup similar for C# [closed]
                            
                                Python: shorter syntax for slices with gaps?
                            
                                How to unpickle an object whose class exists in a different namespace (python)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In scipy.stats rv_continuous has a fit method to find MLEs, but rv_discrete does not. Why?

Tags:

python

statistics

scipy

Keith Braithwaite

People also ask

1 Answers

Josef

Recent Activity

Donate For Us