I'm trying to do a linear fit to some data in numpy. Ex (where w is the number of samples I have for that value, i.e. for the point <code>(x=0, y=0)</code> I only have 1 measurement and the value of that measurement is <code>2.2</code>, but for the point <code>(1,1)</code> I have 2 measurements with a value of <code>3.5</code>. <pre class="prettyprint"><code>x = np.array([0, 1, 2, 3]) y = np.array([2.2, 3.5, 4.6, 5.2]) w = np.array([1, 2, 2, 1]) z = np.polyfit(x, y, 1, w = w) </code></pre> So, now the question is: is it correct to use <code>w=w</code> in polyfit for these cases or should I use <code>w = sqrt(w)</code> of what should I use? Also, how can I get the fit error from polyfit?

If you have normally distributed measurements, then your uncertainty in each value would be proportional to <code>1/sqrt(n)</code> where <code>n</code> is the number of measurements. You want to weigh your fit by the inverse of your uncertainty, so your second guess is best: <code>w=np.sqrt(n)</code> To get the covariance on your parameters, also give <code>cov=True</code>. <pre class="prettyprint"><code>x = np.array([0, 1, 2, 3]) y = np.array([2.2, 3.5, 4.6, 5.2]) n = np.array([1, 2, 2, 1]) p, c = np.polyfit(x, y, 1, w=np.sqrt(n), cov=True) </code></pre> The diagonals of your <code>cov</code> matrix are the individual variances on each parameter, and of course the off-diagonals are the covariances. So most likely what you want for "fit error" is the square root of these diagonals: <pre class="prettyprint"><code>e = np.sqrt(np.diag(c)) </code></pre>

What are the weight values to use in numpy polyfit and what is the error of the fit

Tags:

python

numpy

statistics

curve-fitting

I'm trying to do a linear fit to some data in numpy.

Ex (where w is the number of samples I have for that value, i.e. for the point (x=0, y=0) I only have 1 measurement and the value of that measurement is 2.2, but for the point (1,1) I have 2 measurements with a value of 3.5.

x = np.array([0, 1, 2, 3])
y = np.array([2.2, 3.5, 4.6, 5.2])
w = np.array([1, 2, 2, 1])

z = np.polyfit(x, y, 1, w = w)

So, now the question is: is it correct to use w=w in polyfit for these cases or should I use w = sqrt(w) of what should I use?

Also, how can I get the fit error from polyfit?

292

asked Oct 29 '13 19:10

jbssm

1 Answers

If you have normally distributed measurements, then your uncertainty in each value would be proportional to 1/sqrt(n) where n is the number of measurements. You want to weigh your fit by the inverse of your uncertainty, so your second guess is best: w=np.sqrt(n)

To get the covariance on your parameters, also give cov=True.

x = np.array([0, 1, 2, 3])
y = np.array([2.2, 3.5, 4.6, 5.2])
n = np.array([1, 2, 2, 1])

p, c = np.polyfit(x, y, 1, w=np.sqrt(n), cov=True)

The diagonals of your cov matrix are the individual variances on each parameter, and of course the off-diagonals are the covariances. So most likely what you want for "fit error" is the square root of these diagonals:

e = np.sqrt(np.diag(c))

167

answered Sep 17 '22 17:09

askewchan

Related questions
                            
                                How to connect two computers on the same network using python
                            
                                How to inherit mathematical operations?
                            
                                In a new Django project, should I use Class-based or Function-based views? [closed]
                            
                                Replace one python object with another everywhere
                            
                                Python Create a VPN connection for just a host
                            
                                Python having trouble accessing usb microphone using Gstreamer to perform speech recognition with Pocketsphinx on a Raspberry Pi
                            
                                Stop selenium from opening a new window when clicking on a link
                            
                                How to use griddata from scipy.interpolate
                            
                                TypeError: must be string without null bytes, not str
                            
                                May I omit .pyo and .pyc files in an RPM?
                            
                                How to correctly catch and process RQ timeouts in Python?
                            
                                "ImportError: No module named pwd" but it exists
                            
                                OpenERP module xml ValidateError
                            
                                Angularjs routing with django's urls
                            
                                Is there a way to get the name of the 'parent' module from an imported module?
                            
                                How to create multiple signup pages with django-allauth?
                            
                                Create a complete binary search tree from list
                            
                                Matplotlib: How to adjust linewidth in colorbar for contour plot?
                            
                                python StringIO doesn't work as file with subrpocess.call()
                            
                                Acquiring a regular reference from a weakref proxy in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With