Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Using the absolute_sigma parameter in scipy.optimize.curve_fit

To perform a fit, I am currently using the curve_fit from scipy.optimize.

I have calculated the error associated with each of my ydata and I would like to add the calculated sigma = y_errors present in the data to the fit,

i.e. minimising sum( ((f(xdata, *popt) - ydata) / sigma)**2 ) instead of just sum( (f(xdata, *popt) - ydata)).

I can see that it can be done assigning the parameter sigma in the documentation. What I don't clearly understand is the absolute_sigma parameter. The explanation given in the documentation is quite confusing to me.

Should I set absolute_sigma= = True ? Or should it be set to False if I need to consider the y_errors associated with each of my ydata?

like image 508
Srivatsan Avatar asked Jul 29 '15 15:07

Srivatsan


1 Answers

If you have absolute uncertainties in your data, i.e. if the units of your y_errors are the same as units of your ydata, then you should set absolute_sigma= = True. However it is often the case that the units of y_errors aren't known precisely, only the relative magnitude is known. An example of the latter case could be where some of the y values come from repeated measurements at the same x value. Then it would make sense to weight the repeated y values twice as much as the non-repeated y values, but the units of this weight (2) are not the same as whatever the units of y are.

Here's some code to illustrate the difference:

import numpy as np
from scipy.optimize import curve_fit
from scipy.stats import norm

# defining a model
def model(x, a, b):
    return a * np.exp(-b * x)

# defining the x vector and the real value of some parameters
x_vector = np.arange(100)
a_real, b_real = 1, 0.05

# some toy data with multiplicative uncertainty
y_vector = model(x_vector, a_real, b_real) * (1 + norm.rvs(scale=0.08, size=100))

# fit the parameters, equal weighting on all data points
params, cov = curve_fit(model, x_vector, y_vector )
print params
print cov


# fit the parameters, weighting each data point by its inverse value
params, cov = curve_fit(model, x_vector, y_vector, 
                        sigma=1/y_vector, absolute_sigma=False)
print params
print cov

# with absolute_sigma=False:
## multiplicative transformations of y_data don't matter
params, cov = curve_fit(model, x_vector, y_vector, 
                        sigma=100/y_vector, absolute_sigma=False)
print params
print cov

# but absolute_sigma=True:
## multiplicative transformations of sigma carry through to pcov
params, cov = curve_fit(model, x_vector, y_vector,
                        sigma=100/y_vector, absolute_sigma=True)
print params
print cov

[ 1.03190409  0.05093425]
[[  1.15344847e-03   5.70001955e-05]
 [  5.70001955e-05   5.92595318e-06]]

[ 1.0134898   0.04872328]
[[  1.57940876e-04   1.56490218e-05]
 [  1.56490218e-05   3.56159680e-06]]

[ 1.0134898   0.04872328]
[[  1.57940878e-04   1.56490220e-05]
 [  1.56490220e-05   3.56159682e-06]]

[ 1.0134898   0.04872328]
[[ 2978.10865352   295.07552766]
 [  295.07552766    67.15691613]]
like image 178
Curt F. Avatar answered Sep 18 '22 14:09

Curt F.