Scipy: lognormal fitting

A)

Whats the problem in creating a lognorm directly:

# lognorm(mu=10,sigma=3)
# so shape=3, loc=0, scale=np.exp(10) ?
x=np.linspace(0.01,20,200)
sample_dist = sp.stats.lognorm.pdf(x, 3, loc=0, scale=np.exp(10))
shape, loc, scale = sp.stats.lognorm.fit(sample_dist, floc=0)
print shape, loc, scale
print np.log(scale), shape # mu and sigma
# last line: -7.63285693379 0.140259699945  # not 10 and 3

B)

I use the return values of a fit to create a fitted distribution. But again im doing something wrong apparently:

samp=sp.stats.lognorm(0.5,loc=0,scale=1).rvs(size=2000) # sample
param=sp.stats.lognorm.fit(samp) # fit the sample data
print param # does not coincide  with shape, loc, scale above!
x=np.linspace(0,4,100)
pdf_fitted = sp.stats.lognorm.pdf(x, param[0], loc=param[1], scale=param[2]) # fitted distribution
pdf = sp.stats.lognorm.pdf(x, 0.5, loc=0, scale=1) # original distribution
plt.plot(x,pdf_fitted,'r-',x,pdf,'g-')
plt.hist(samp,bins=30,normed=True,alpha=.3)

lognorm

373

asked Aug 30 '13 13:08

bioslime

1 Answers

I made the same observations: a free fit of all parameters fails most of the time. You can help by providing a better initial guess, fixing the parameter is not necessary.

samp = stats.lognorm(0.5,loc=0,scale=1).rvs(size=2000)

# this is where the fit gets it initial guess from
print stats.lognorm._fitstart(samp)

(1.0, 0.66628696413404565, 0.28031095750445462)

print stats.lognorm.fit(samp)
# note that the fit failed completely as the parameters did not change at all

(1.0, 0.66628696413404565, 0.28031095750445462)

# fit again with a better initial guess for loc
print stats.lognorm.fit(samp, loc=0)

(0.50146296628099118, 0.0011019321419653122, 0.99361128537912125)

You can also make up your own function to calculate the initial guess, e.g.:

def your_func(sample):
    # do some magic here
    return guess

stats.lognorm._fitstart = your_func

195

answered Nov 09 '22 12:11

Christian K.

Related questions
                            
                                How do I use python as a server-side language?
                            
                                In-place sort_values in pandas what does it exactly mean?
                            
                                How to test functions cdef'd in Cython?
                            
                                How to convert a list of strings into a tensor in pytorch?
                            
                                What's the advantage of using yield in __iter__()?
                            
                                Python Pandas: How to merge based on an "OR" condition?
                            
                                How to run Airflow PythonOperator in a virtual environment
                            
                                In Python, how to drop into the debugger in an except block and have access to the exception instance?
                            
                                Seaborn Lineplot Module Object Has No Attribute 'Lineplot'
                            
                                Call Hierarchy in Visual Studio Code
                            
                                Python not able to connect to grpc channel -> "failed to connect to all addresses" "grpc_status":14
                            
                                PyQt: How to update progress without freezing the GUI?
                            
                                Is python's shutil.move() atomic on linux?
                            
                                Can I get a view of a numpy array at specified indexes? (a view from "fancy indexing")
                            
                                Convert HTML string to an image in Python [closed]
                            
                                Library or tool to download multiple files in parallel [closed]
                            
                                How do I apply a DCT to an image in Python?
                            
                                Django's Double Underscore
                            
                                SqlAlchemy won't accept datetime.datetime.now value in a DateTime column
                            
                                How to train a neural network to supervised data set using pybrain black-box optimization?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Scipy: lognormal fitting

Tags:

python

statistics

scipy

A)

B)

bioslime

People also ask

1 Answers

Christian K.

Recent Activity

Donate For Us