Calculate confidence band of least-square fit

Tags:

I got a question that I fight around for days with now.

How do I calculate the (95%) confidence band of a fit?

Fitting curves to data is the every day job of every physicist -- so I think this should be implemented somewhere -- but I can't find an implementation for this neither do I know how to do this mathematically.

The only thing I found is seaborn that does a nice job for linear least-square.

import numpy as np                                                                                                                                                                                                                         
from matplotlib import pyplot as plt
import seaborn as sns
import pandas as pd

x = np.linspace(0,10)
y = 3*np.random.randn(50) + x

data = {'x':x, 'y':y}
frame = pd.DataFrame(data, columns=['x', 'y'])
sns.lmplot('x', 'y', frame, ci=95)

plt.savefig("confidence_band.pdf")

linear-least-square

But this is just linear least-square. When I want to fit e.g. a saturation curve like saturation-eqn , I'm screwed.

Sure, I can calculate the t-distribution from the std-error of a least-square method like scipy.optimize.curve_fit but that is not what I'm searching for.

Thanks for any help!!

863

asked Nov 25 '14 00:11

Suuuehgi

2 Answers

You can achieve this easily using StatsModels module.

Also see this example and this answer.

Here is an answer for your question:

import numpy as np
from matplotlib import pyplot as plt

import statsmodels.api as sm
from statsmodels.stats.outliers_influence import summary_table

x = np.linspace(0,10)
y = 3*np.random.randn(50) + x
X = sm.add_constant(x)
res = sm.OLS(y, X).fit()

st, data, ss2 = summary_table(res, alpha=0.05)
fittedvalues = data[:,2]
predict_mean_se  = data[:,3]
predict_mean_ci_low, predict_mean_ci_upp = data[:,4:6].T
predict_ci_low, predict_ci_upp = data[:,6:8].T

fig, ax = plt.subplots(figsize=(8,6))
ax.plot(x, y, 'o', label="data")
ax.plot(X, fittedvalues, 'r-', label='OLS')
ax.plot(X, predict_ci_low, 'b--')
ax.plot(X, predict_ci_upp, 'b--')
ax.plot(X, predict_mean_ci_low, 'g--')
ax.plot(X, predict_mean_ci_upp, 'g--')
ax.legend(loc='best');
plt.show()

Example

answered Oct 21 '22 06:10

Seyeong Jeong

kmpfit's confidence_band() calculates the confidence band for non-linear least squares. Here for your saturation curve:

from pylab import *
from kapteyn import kmpfit

def model(p, x):
  a, b = p
  return a*(1-np.exp(b*x))

x = np.linspace(0, 10, 100)
y = .1*np.random.randn(x.size) + model([1, -.4], x)

fit = kmpfit.simplefit(model, [.1, -.1], x, y)
a, b = fit.params
dfdp = [1-np.exp(b*x), -a*x*np.exp(b*x)]
yhat, upper, lower = fit.confidence_band(x, dfdp, 0.95, model)

scatter(x, y, marker='.', color='#0000ba')
for i, l in enumerate((upper, lower, yhat)):
  plot(x, l, c='g' if i == 2 else 'r', lw=2)
savefig('kmpfit confidence bands.png', bbox_inches='tight')

The dfdp are the partial derivatives ∂f/∂p of the model f = a*(1-e^(b*x)) with respect to each parameter p (i.e., a and b), see my answer to a similar question for background links. And here the output:

kmpfit confidence bands

answered Oct 21 '22 05:10

Ulrich Stern

Related questions
                            
                                Python: Detect number separator symbols and parse into a float without locale
                            
                                Reading Single Line CSV using numpy.genfromtxt
                            
                                Microsecond accurate timestamp in python?
                            
                                2D Interpolation with periodic boundary conditions
                            
                                Python: calling stop on mock patch class decorator
                            
                                TypeError: list indices must be integers, not str,while parsing json
                            
                                Get a permutation as a function of a unique given index in O(n)
                            
                                Tkinter looks different on different computers
                            
                                Security of regular expressions [duplicate]
                            
                                How do you pass a numpy array to openCV without saving the file as a png or jpeg first?
                            
                                Creating a column from another column in SQLAlchemy
                            
                                How to remove a module using Anaconda in Python
                            
                                How to apply the output of numpy.argpartition for 2-D Arrays?
                            
                                Select2 field implementation in flask/flask-admin
                            
                                django admin list_filter "or" condition
                            
                                How can I access a low-level client from a Boto 3 resource instance?
                            
                                Django 1.7 multisite User model
                            
                                Pandas: plotting two histograms on the same plot
                            
                                execute robot keyword from python using robotframework api
                            
                                ImportError: No module named pynotify. While the module is installed

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Calculate confidence band of least-square fit

Tags:

python

statistics

regression

confidence-interval

Suuuehgi

People also ask

2 Answers

Seyeong Jeong

Ulrich Stern

Recent Activity

Donate For Us