I've been surfing but haven't found the correct method to do the following. I have a histogram done with matplotlib: <pre class="prettyprint"><code>hist, bins, patches = plt.hist(distance, bins=100, normed='True') </code></pre> From the plot, I can see that the distribution is more or less an exponential (Poisson distribution). How can I do the best fitting, taking into account my hist and bins arrays? UPDATE I am using the following approach: <pre class="prettyprint"><code>x = np.float64(bins) # Had some troubles with data types float128 and float64 hist = np.float64(hist) myexp=lambda x,l,A:A*np.exp(-l*x) popt,pcov=opt.curve_fit(myexp,(x[1:]+x[:-1])/2,hist) </code></pre> But I get <pre class="prettyprint"><code>---> 41 plt.plot(stats.expon.pdf(np.arange(len(hist)),popt),'-') ValueError: operands could not be broadcast together with shapes (100,) (2,) </code></pre>

What you described is a form of exponential distribution, and you want to estimate the parameters of the exponential distribution, given the probability density observed in your data. Instead of using non-linear regression method (which assumes the residue errors are Gaussian distributed), one correct way is arguably a MLE (maximum likelihood estimation). <code>scipy</code> provides a large number of continuous distributions in its <code>stats</code> library, and the MLE is implemented with the <code>.fit()</code> method. Of course, exponential distribution is there: <pre class="prettyprint"><code>In [1]: import numpy as np import scipy.stats as ss import matplotlib.pyplot as plt %matplotlib inline In [2]: #generate data X = ss.expon.rvs(loc=0.5, scale=1.2, size=1000) #MLE P = ss.expon.fit(X) print P (0.50046056920696858, 1.1442947648425439) #not exactly 0.5 and 1.2, due to being a finite sample In [3]: #plotting rX = np.linspace(0,10, 100) rP = ss.expon.pdf(rX, *P) #Yup, just unpack P with *P, instead of scale=XX and shape=XX, etc. In [4]: #need to plot the normalized histogram with `normed=True` plt.hist(X, normed=True) plt.plot(rX, rP) Out[4]: </code></pre> <img src="https://i.stack.imgur.com/94u4y.png" alt="enter image description here"> Your <code>distance</code> will replace <code>X</code> here.

Histogram fitting with python

Tags:

python

pandas

matplotlib

scipy

data-analysis

I've been surfing but haven't found the correct method to do the following.

I have a histogram done with matplotlib:

hist, bins, patches = plt.hist(distance, bins=100, normed='True')

From the plot, I can see that the distribution is more or less an exponential (Poisson distribution). How can I do the best fitting, taking into account my hist and bins arrays?

UPDATE

I am using the following approach:

x = np.float64(bins) # Had some troubles with data types float128 and float64
hist = np.float64(hist)
myexp=lambda x,l,A:A*np.exp(-l*x)
popt,pcov=opt.curve_fit(myexp,(x[1:]+x[:-1])/2,hist)

But I get

---> 41 plt.plot(stats.expon.pdf(np.arange(len(hist)),popt),'-')

ValueError: operands could not be broadcast together with shapes (100,) (2,)

218

asked Nov 19 '15 18:11

user2820579

1 Answers

What you described is a form of exponential distribution, and you want to estimate the parameters of the exponential distribution, given the probability density observed in your data. Instead of using non-linear regression method (which assumes the residue errors are Gaussian distributed), one correct way is arguably a MLE (maximum likelihood estimation).

scipy provides a large number of continuous distributions in its stats library, and the MLE is implemented with the .fit() method. Of course, exponential distribution is there:

In [1]:

import numpy as np
import scipy.stats as ss
import matplotlib.pyplot as plt
%matplotlib inline
In [2]:
#generate data 
X = ss.expon.rvs(loc=0.5, scale=1.2, size=1000)

#MLE
P = ss.expon.fit(X)
print P
(0.50046056920696858, 1.1442947648425439)
#not exactly 0.5 and 1.2, due to being a finite sample

In [3]:
#plotting
rX = np.linspace(0,10, 100)
rP = ss.expon.pdf(rX, *P)
#Yup, just unpack P with *P, instead of scale=XX and shape=XX, etc.
In [4]:

#need to plot the normalized histogram with `normed=True`
plt.hist(X, normed=True)
plt.plot(rX, rP)
Out[4]:

enter image description here

Your distance will replace X here.

199

answered Sep 21 '22 16:09

CT Zhu

Related questions
                            
                                Get and Post methods in Python (Flask)
                            
                                Python : generating random numbers from a power law distribution [duplicate]
                            
                                Lambda with nested if else is not working
                            
                                Using Selenium in Python to click through all elements with the same class name
                            
                                how to select specific json element in python [duplicate]
                            
                                How can I load an image using Python Pillow?
                            
                                Install psycopg2 for Anaconda Python
                            
                                SQLite 3 Database with Django
                            
                                Get consistent Key error: \n [duplicate]
                            
                                Load JPEG from URL to skimage without temporary file
                            
                                Is it possible to overload logical and in Python?
                            
                                Changing a class attribute within __init__
                            
                                Sorting and auto filtering Excel with openpyxl
                            
                                Building a Bootstrap table with dynamic elements in Flask
                            
                                NumPy sum along disjoint indices
                            
                                Plots are not visible using matplotlib plt.show()
                            
                                Embed "Bokeh created html file" into Flask "template.html" file
                            
                                How do I list contents of a gz file without extracting it in python?
                            
                                Matplotlib: how to plot with a specific hex color and a specific marker?
                            
                                pandas - check for non unique values in dataframe groupby

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With