I want to fit lognormal distribution to my data, using python <code>scipy.stats.lognormal.fit</code>. According to the manual, <code>fit</code> returns shape, loc, scale parameters. But, lognormal distribution normally needs only two parameters: mean and standard deviation. How to interpret the results from scipy <code>fit</code> function? How to get mean and std.dev.?

I just spent some time working this out and wanted to document it here: If you want to get the probability density (at point <code>x</code>) from the three return values of <code>lognorm.fit</code> (lets call them <code>(shape, loc, scale)</code>), you need to use this formula: <pre class="prettyprint"><code>x = 1 / (shape*((x-loc)/scale)*sqrt(2*pi)) * exp(-1/2*(log((x-loc)/scale)/shape)**2) / scale </code></pre> So as an equation that is (<code>loc</code> is <code>µ</code>, <code>shape</code> is <code>σ</code> and <code>scale</code> is <code>α</code>): <img src="https://i.stack.imgur.com/w0BMf.png" alt="x = \frac{1}{(x-\mu)\cdot\sqrt{2\pi\sigma^2}} \cdot e^{-\frac{log(\frac{x-\mu}{\alpha})^2}{2\sigma^2}} ">

scipy, lognormal distribution - parameters

Tags:

python

statistics

scipy

I want to fit lognormal distribution to my data, using python scipy.stats.lognormal.fit. According to the manual, fit returns shape, loc, scale parameters. But, lognormal distribution normally needs only two parameters: mean and standard deviation.

How to interpret the results from scipy fit function? How to get mean and std.dev.?

239

asked Jan 05 '12 18:01

Jakub M.

2 Answers

The distributions in scipy are coded in a generic way wrt two parameter location and scale so that location is the parameter (loc) which shifts the distribution to the left or right, while scale is the parameter which compresses or stretches the distribution.

For the two parameter lognormal distribution, the "mean" and "std dev" correspond to log(scale) and shape (you can let loc=0).

The following illustrates how to fit a lognormal distribution to find the two parameters of interest:

In [56]: import numpy as np  In [57]: from scipy import stats  In [58]: logsample = stats.norm.rvs(loc=10, scale=3, size=1000) # logsample ~ N(mu=10, sigma=3)  In [59]: sample = np.exp(logsample) # sample ~ lognormal(10, 3)  In [60]: shape, loc, scale = stats.lognorm.fit(sample, floc=0) # hold location to 0 while fitting  In [61]: shape, loc, scale Out[61]: (2.9212650122639419, 0, 21318.029350592606)  In [62]: np.log(scale), shape  # mu, sigma Out[62]: (9.9673084420467362, 2.9212650122639419)

140

answered Sep 17 '22 14:09

ars

I just spent some time working this out and wanted to document it here: If you want to get the probability density (at point x) from the three return values of lognorm.fit (lets call them (shape, loc, scale)), you need to use this formula:

x = 1 / (shape*((x-loc)/scale)*sqrt(2*pi)) * exp(-1/2*(log((x-loc)/scale)/shape)**2) / scale

So as an equation that is (loc is µ, shape is σ and scale is α):

$x = \frac{1}{(x-\mu)\cdot\sqrt{2\pi\sigma^2}} \cdot e^{-\frac{log(\frac{x-\mu}{\alpha})^2}{2\sigma^2}}$

answered Sep 16 '22 14:09

Chronial

Related questions
                            
                                Is it ever useful to use Python's input over raw_input?
                            
                                error extracting element from an array. python
                            
                                Pythonic way for `return (value == 'ok') ? 'ok' : 'nok' ` [duplicate]
                            
                                In python, how to do unit test on a function without return value?
                            
                                Pandas: Checking if a date is a holiday and assigning boolean value
                            
                                SQLalchemy AttributeError: 'str' object has no attribute '_sa_instance_state'
                            
                                CrawlerProcess vs CrawlerRunner
                            
                                Using the class as a type hint for arguments in its methods [duplicate]
                            
                                Python webdriver to handle pop up browser windows which is not an alert
                            
                                Retrieve XY data from matplotlib figure [duplicate]
                            
                                Opening a pdf and reading in tables with python pandas
                            
                                Why does my use of click.argument produce "got an unexpected keyword argument 'help'?
                            
                                Python Requests getting ('Connection aborted.', BadStatusLine("''",)) error
                            
                                Insert or delete a step in scikit-learn Pipeline
                            
                                replace part of the string in pandas data frame
                            
                                How to execute two "aggregate" functions (like sum) concurrently, feeding them from the same iterator?
                            
                                Draw a line at specific position/annotate a Facetgrid in seaborn
                            
                                Dynamically importing Python module
                            
                                How to display picture and get mouse click coordinate on it [closed]
                            
                                Python multiprocessing - How to release memory when a process is done?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With