Exponential decay curve fitting in numpy and scipy

Tags:

I'm having a bit of trouble with fitting a curve to some data, but can't work out where I am going wrong.

In the past I have done this with numpy.linalg.lstsq for exponential functions and scipy.optimize.curve_fit for sigmoid functions. This time I wished to create a script that would let me specify various functions, determine parameters and test their fit against the data. While doing this I noticed that Scipy leastsq and Numpy lstsq seem to provide different answers for the same set of data and the same function. The function is simply y = e^(l*x) and is constrained such that y=1 at x=0.

Excel trend line agrees with the Numpy lstsq result, but as Scipy leastsq is able to take any function, it would be good to work out what the problem is.

import scipy.optimize as optimize
import numpy as np
import matplotlib.pyplot as plt

## Sampled data
x = np.array([0, 14, 37, 975, 2013, 2095, 2147])
y = np.array([1.0, 0.764317544, 0.647136491, 0.070803763, 0.003630962,     0.001485394,     0.000495131])

# function
fp = lambda p, x: np.exp(p*x)

# error function
e = lambda p, x, y: (fp(p, x) - y)

# using scipy least squares
l1, s =  optimize.leastsq(e, -0.004, args=(x,y))
print l1
# [-0.0132281]


# using numpy least squares
l2 = np.linalg.lstsq(np.vstack([x, np.zeros(len(x))]).T,np.log(y))[0][0]
print l2
# -0.00313461628963 (same answer as Excel trend line)

# smooth x for plotting
x_ = np.arange(0, x[-1], 0.2)

plt.figure()
plt.plot(x, y, 'rx', x_, fp(l1, x_), 'b-', x_, fp(l2, x_), 'g-')
plt.show()

Edit - additional information

The MWE above includes a small sample of the dataset. When fitting the actual data the scipy.optimize.curve_fit curve presents an R^2 of 0.82, while the numpy.linalg.lstsq curve, which is the same as that calculated by Excel, has an R^2 of 0.41.

452

asked Jan 16 '13 00:01

StacyR

2 Answers

You are minimizing different error functions.

When you use numpy.linalg.lstsq, the error function being minimized is

np.sum((np.log(y) - p * x)**2)

while scipy.optimize.leastsq minimizes the function

np.sum((y - np.exp(p * x))**2)

The first case requires a linear dependency between the dependent and independent variables, but the solution is known analitically, while the second can handle any dependency, but relies on an iterative method.

On a separate note, ~~I cannot test it right now, but~~ when using numpy.linalg.lstsq, I you don't need to vstack a row of zeros, the following works as well:

l2 = np.linalg.lstsq(x[:, None], np.log(y))[0][0]

199

answered Sep 29 '22 10:09

Jaime

To expound a bit on Jaime's point, any non-linear transformation of the data will lead to a different error function and hence to different solutions. These will lead to different confidence intervals for the fitting parameters. So you have three possible criteria to use to make a decision: which error you want to minimize, which parameters you want more confidence in, and finally, if you are using the fitting to predict some value, which method yields less error in the interesting predicted value. Playing around a bit analytically and in Excel suggests that different kinds of noise in the data (e.g. if the noise function scales the amplitude, affects the time-constant or is additive) leads to different choices of solution.

I'll also add that while this trick "works" for exponential decay to 0, it can't be used in the more general (and common) case of damped exponentials (rising or falling) to values that cannot be assumed to be 0.

answered Sep 29 '22 10:09

user3117404

Related questions
                            
                                Whats the difference between `y = x` and `y = x[:]` with x a numpy-ndarray?
                            
                                Pythonic way to split 3D array in smaller blocks of fixed dimension
                            
                                Shift rows in array independently
                            
                                Add an attribute to a Numpy array in runtime
                            
                                Python Numpy Structured Array (recarray) assigning values into slices
                            
                                Plotting directly to movie with numpy and mencoder
                            
                                python matrix multiplication: how to handle very large matrices?
                            
                                NumPy: array assignment issue when using custom dtype
                            
                                OpenCV Python and Histogram of Oriented Gradient
                            
                                How to vectorize the evaluation of bilinear & quadratic forms?
                            
                                Creating package installer in OS X - install Python, NumPy and other dependencies
                            
                                Specify lag in numpy.correlate
                            
                                Get only "valid" points in 2D interpolation of cloud point using Scipy/Numpy
                            
                                Calculating pdf of Dirichlet distribution in python
                            
                                Replace subarrays in numpy
                            
                                Finding first instance of one list in a second list
                            
                                Numpy Matrix class: Default constructor attributes for inherited class
                            
                                carving 2D numpy array by index
                            
                                efficiently computing parafac / CP product in numpy
                            
                                numpy binary raster image to polygon transformation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Exponential decay curve fitting in numpy and scipy

Tags:

numpy

scipy

curve-fitting

exponential

least-squares