Im generating a random sample of data and plotting its pdf using scipy.stats.norm.fit to generate my loc and scale parameters. I wanted to see how different my pdf would look like if I just calculated the mean and std using numpy without any actual fitting. To my surprise when I plot both pdfs and print both sets of mu and std the results I get are exactly the same. So my question is, what is the point of norm.fit if I can just calculate the mean and std of my sample and still get the same results? This is my code: <pre class="prettyprint"><code>import numpy as np from scipy.stats import norm import matplotlib.pyplot as plt data = norm.rvs(loc=0,scale=1,size=200) mu1 = np.mean(data) std1 = np.std(data) print(mu1) print(std1) mu, std = norm.fit(data) plt.hist(data, bins=25, density=True, alpha=0.6, color='g') xmin, xmax = plt.xlim() x = np.linspace(xmin, xmax, 100) p = norm.pdf(x, mu, std) q = norm.pdf(x, mu1, std1) plt.plot(x, p, 'k', linewidth=2) plt.plot(x, q, 'r', linewidth=1) title = "Fit results: mu = %.5f, std = %.5f" % (mu, std) plt.title(title) plt.show() </code></pre> And this is the results I got: Pdf of a random set of values mu1 = 0.034824979915482716 std1 = 0.9945453455908072

The point is that there are several other distributions out there besides the normal distribution. Scipy provides a consistent API for learning the parameters of these distributions from data. (Want an exponential distribution instead of a normal distribution? It’s <code>scipy.stats.expon.fit</code>.) So sure, your way also works because the parameters of the normal distribution happen to be the mean and standard deviation. But this is about providing a consistent interface across distributions, including ones where that’s not true.

What is the point of norm.fit in scipy?

Tags:

python

statistics

scipy

scipy.stats

Im generating a random sample of data and plotting its pdf using scipy.stats.norm.fit to generate my loc and scale parameters.

I wanted to see how different my pdf would look like if I just calculated the mean and std using numpy without any actual fitting. To my surprise when I plot both pdfs and print both sets of mu and std the results I get are exactly the same. So my question is, what is the point of norm.fit if I can just calculate the mean and std of my sample and still get the same results?

This is my code:

import numpy as np
from scipy.stats import norm
import matplotlib.pyplot as plt

data = norm.rvs(loc=0,scale=1,size=200)

mu1 = np.mean(data)

std1 = np.std(data)

print(mu1)
print(std1)

mu, std = norm.fit(data)

plt.hist(data, bins=25, density=True, alpha=0.6, color='g')

xmin, xmax = plt.xlim()
x = np.linspace(xmin, xmax, 100)
p = norm.pdf(x, mu, std)
q = norm.pdf(x, mu1, std1)
plt.plot(x, p, 'k', linewidth=2)
plt.plot(x, q, 'r', linewidth=1)
title = "Fit results: mu = %.5f,  std = %.5f" % (mu, std)
plt.title(title)

plt.show()

And this is the results I got:

Pdf of a random set of values

mu1 = 0.034824979915482716

std1 = 0.9945453455908072

325

asked Mar 26 '20 06:03

José Manuel Valladares

1 Answers

The point is that there are several other distributions out there besides the normal distribution. Scipy provides a consistent API for learning the parameters of these distributions from data. (Want an exponential distribution instead of a normal distribution? It’s scipy.stats.expon.fit.)

So sure, your way also works because the parameters of the normal distribution happen to be the mean and standard deviation. But this is about providing a consistent interface across distributions, including ones where that’s not true.

144

answered Sep 18 '22 15:09

Arya McCarthy

Related questions
                            
                                Using the full PyTorch Transformer Module
                            
                                TypeError: Tensor is unhashable if Tensor equality is enabled. Instead, use tensor.experimental_ref() as the key
                            
                                How do I print values only when they appear more than once in a list in python
                            
                                Calculating weighted average by GroupBy.agg and a named aggregation
                            
                                How to make Dash app run faster if its slowed by large data imported
                            
                                detect dates in spacy
                            
                                Is ray thread safe?
                            
                                Improve real-life results of neural network trained with mnist dataset
                            
                                Django collectstatic not working on production with S3, but same settings work locally
                            
                                can not install psycopg2 on macOS Catalina
                            
                                Plot FFT as a set of sine waves in python?
                            
                                How to use multiprocessing to drop duplicates in a very big list?
                            
                                Access deprecated attribute "validation_data" in tf.keras.callbacks.Callback
                            
                                Where does zappa upload environment variables to?
                            
                                Is it possible to change the pydantic error messages in fastAPI?
                            
                                Flask: orjson instead of json module for decoding
                            
                                How to use type hinting with dictionaries and google protobuf enum?
                            
                                Count number of the blues lines on white background in the image
                            
                                how to use python -c on windows?
                            
                                FastAPI - mocking path function has no effect

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With