I have these values: <pre class="prettyprint"><code>T_values = (222, 284, 308.5, 333, 358, 411, 477, 518, 880, 1080, 1259) (x values) </code></pre> <pre class="prettyprint"><code>C/(3Nk)_values = (0.1282, 0.2308, 0.2650, 0.3120 , 0.3547, 0.4530, 0.5556, 0.6154, 0.8932, 0.9103, 0.9316) (y values) </code></pre> I know they follow the model: <pre class="prettyprint"><code>C/(3Nk)=(h*w/(k*T))**2*(exp(h*w/(k*T)))/(exp(h*w/(k*T)-1))**2 </code></pre> I also know that <code>k=1.38*10**(-23)</code> and <code>h=6.626*10**(-34)</code>. I have to find the w that best describes the measurement data. I'd like to solve this using the least square method in python, however I don't really understand how this works. Can anyone help me?

This answer provides a walk-through on using Python to determine fitting parameters for a general exponential pattern. See also a related posts on linearization techniques and using the <code>lmfit</code> library. <h3>Data Cleaning</h3> First, let's input and organize the sampling data as numpy arrays, which will later help with computation and clarity. <pre class="prettyprint"><code>import matplotlib.pyplot as plt import scipy.optimize as opt import numpy as np #% matplotlib inline # DATA ------------------------------------------------------------------------ T_values = np.array([222, 284, 308.5, 333, 358, 411, 477, 518, 880, 1080, 1259]) C_values = np.array([0.1282, 0.2308, 0.2650, 0.3120 , 0.3547, 0.4530, 0.5556, 0.6154, 0.8932, 0.9103, 0.9316]) x_samp = T_values y_samp = C_values </code></pre> There are many curve fitting functions in scipy and numpy and each is used differently, e.g. <code>scipy.optimize.leastsq</code> and <code>scipy.optimize.least_squares</code>. For simplicity, we will use <code>scipy.optimize.curve_fit</code>, but it is difficult to find an optimized regression curve without selecting reasonable starting parameters. A simple technique will later be demonstrated on selecting starting parameters. <h3>Review</h3> First, although the OP provided an expected fitting equation, we will approach the problem of using Python to curve fit by reviewing the general equation for an exponential function: <img src="https://i.stack.imgur.com/6uHMm.png" alt="enter image description here"> Now we build this general function, which will be used a few times: <pre class="prettyprint"><code># GENERAL EQUATION ------------------------------------------------------------ def func(x, A, c, d): return A*np.exp(c*x) + d </code></pre> Trends <ul> <li> amplitude: a small <code>A</code> gives a small amplitude</li> <li> shape: a small <code>c</code> controls the shape by flattening the "knee" of the curve</li> <li> position: <code>d</code> sets the y-intercept</li> <li> orientation: a negative <code>A</code> flips the curve across a horizontal axis; a negative <code>c</code> flips the curve across a vertical axis</li> </ul> The latter trends are illustrated below, highlighting the control (black line) compared to a line with a varied parameter (red line): <img src="https://i.stack.imgur.com/Kj76r.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/5Rc8D.png" alt="enter image description here"> <h3>Selecting Initial Parameters</h3> Using the latter trends, let us next look at the data and try to emulate the curve by adjusting these parameters. For demonstration, we plot several trial equations against our data: <pre class="prettyprint"><code># SURVEY ---------------------------------------------------------------------- # Plotting Sampling Data plt.plot(x_samp, y_samp, "ko", label="Data") x_lin = np.linspace(0, x_samp.max(), 50) # a number line, 50 evenly spaced digits between 0 and max # Trials A, c, d = -1, -1e-2, 1 y_trial1 = func(x_lin, A, c, d) y_trial2 = func(x_lin, -1, -1e-3, 1) y_trial3 = func(x_lin, -1, -3e-3, 1) plt.plot(x_lin, y_trial1, "--", label="Trial 1") plt.plot(x_lin, y_trial2, "--", label="Trial 2") plt.plot(x_lin, y_trial3, "--", label="Trial 3") plt.legend() </code></pre> <img src="https://i.stack.imgur.com/0VRnX.png" alt="enter image description here"> From simple trial and error, we can approximate the shape, amplitude, position and orientation of the curve better. For instance, we know the first two parameters (<code>A</code> and <code>c</code>) must be negative. We also have a reasonable guess for the order of magnitude for <code>c</code>. <h3>Computing Estimated Parameters</h3> We will now use the parameters of the best trial for our initial guesses: <pre class="prettyprint"><code># REGRESSION ------------------------------------------------------------------ p0 = [-1, -3e-3, 1] # guessed params w, _ = opt.curve_fit(func, x_samp, y_samp, p0=p0) print("Estimated Parameters", w) # Model y_model = func(x_lin, *w) # PLOT ------------------------------------------------------------------------ # Visualize data and fitted curves plt.plot(x_samp, y_samp, "ko", label="Data") plt.plot(x_lin, y_model, "k--", label="Fit") plt.title("Least squares regression") plt.legend(loc="upper left") # Estimated Parameters [-1.66301087 -0.0026884 1.00995394] </code></pre> <img src="https://i.stack.imgur.com/D3HYY.png" alt="enter image description here"> <h3>How Does this Work?</h3> <code>curve_fit</code> is one of many optimization functions offered by scipy. Given an initial value, the resulting estimated parameters are iteratively refined so that the resulting curve minimizes the residual error, or difference between the fitted line and sampling data. A better guess reduces the number of iterations and speeds up the result. With these estimated parameters for the fitted curve, one can now calculate the specific coefficients for a particular equation (a final exercise left to the OP).

Least square method in python?

Tags:

python

data-fitting

least-squares

I have these values:

Click to copy

T_values = (222, 284, 308.5, 333, 358, 411, 477, 518, 880, 1080, 1259) (x values)

Click to copy

C/(3Nk)_values = (0.1282, 0.2308, 0.2650, 0.3120 , 0.3547, 0.4530, 0.5556, 0.6154, 0.8932, 0.9103, 0.9316) (y values)

I know they follow the model:

Click to copy

C/(3Nk)=(h*w/(k*T))**2*(exp(h*w/(k*T)))/(exp(h*w/(k*T)-1))**2

I also know that k=1.38*10**(-23) and h=6.626*10**(-34). I have to find the w that best describes the measurement data. I'd like to solve this using the least square method in python, however I don't really understand how this works. Can anyone help me?

205

asked Apr 25 '17 17:04

Philipp

2 Answers

This answer provides a walk-through on using Python to determine fitting parameters for a general exponential pattern. See also a related posts on linearization techniques and using the lmfit library.

Data Cleaning

First, let's input and organize the sampling data as numpy arrays, which will later help with computation and clarity.

Click to copy

import matplotlib.pyplot as plt
import scipy.optimize as opt
import numpy as np


#% matplotlib inline

# DATA ------------------------------------------------------------------------
T_values = np.array([222, 284, 308.5, 333, 358, 411, 477, 518, 880, 1080, 1259])
C_values = np.array([0.1282, 0.2308, 0.2650, 0.3120 , 0.3547, 0.4530, 0.5556, 0.6154, 0.8932, 0.9103, 0.9316])

x_samp = T_values
y_samp = C_values

There are many curve fitting functions in scipy and numpy and each is used differently, e.g. scipy.optimize.leastsq and scipy.optimize.least_squares. For simplicity, we will use scipy.optimize.curve_fit, but it is difficult to find an optimized regression curve without selecting reasonable starting parameters. A simple technique will later be demonstrated on selecting starting parameters.

Review

First, although the OP provided an expected fitting equation, we will approach the problem of using Python to curve fit by reviewing the general equation for an exponential function:

enter image description here

Now we build this general function, which will be used a few times:

Click to copy

# GENERAL EQUATION ------------------------------------------------------------
def func(x, A, c, d):
    return A*np.exp(c*x) + d

Trends

amplitude: a small A gives a small amplitude
shape: a small c controls the shape by flattening the "knee" of the curve
position: d sets the y-intercept
orientation: a negative A flips the curve across a horizontal axis; a negative c flips the curve across a vertical axis

The latter trends are illustrated below, highlighting the control (black line) compared to a line with a varied parameter (red line):

enter image description here

Selecting Initial Parameters

Using the latter trends, let us next look at the data and try to emulate the curve by adjusting these parameters. For demonstration, we plot several trial equations against our data:

Click to copy

# SURVEY ----------------------------------------------------------------------
# Plotting Sampling Data
plt.plot(x_samp, y_samp, "ko", label="Data")

x_lin = np.linspace(0, x_samp.max(), 50)                   # a number line, 50 evenly spaced digits between 0 and max

# Trials
A, c, d = -1, -1e-2, 1
y_trial1 = func(x_lin,  A,     c, d)
y_trial2 = func(x_lin, -1, -1e-3, 1)
y_trial3 = func(x_lin, -1, -3e-3, 1)

plt.plot(x_lin, y_trial1, "--", label="Trial 1")
plt.plot(x_lin, y_trial2, "--", label="Trial 2")
plt.plot(x_lin, y_trial3, "--", label="Trial 3")
plt.legend()

enter image description here

From simple trial and error, we can approximate the shape, amplitude, position and orientation of the curve better. For instance, we know the first two parameters (A and c) must be negative. We also have a reasonable guess for the order of magnitude for c.

Computing Estimated Parameters

We will now use the parameters of the best trial for our initial guesses:

Click to copy

# REGRESSION ------------------------------------------------------------------
p0 = [-1, -3e-3, 1]                                        # guessed params
w, _ = opt.curve_fit(func, x_samp, y_samp, p0=p0)     
print("Estimated Parameters", w)  

# Model
y_model = func(x_lin, *w)

# PLOT ------------------------------------------------------------------------
# Visualize data and fitted curves
plt.plot(x_samp, y_samp, "ko", label="Data")
plt.plot(x_lin, y_model, "k--", label="Fit")
plt.title("Least squares regression")
plt.legend(loc="upper left")

# Estimated Parameters [-1.66301087 -0.0026884   1.00995394]

enter image description here

How Does this Work?

curve_fit is one of many optimization functions offered by scipy. Given an initial value, the resulting estimated parameters are iteratively refined so that the resulting curve minimizes the residual error, or difference between the fitted line and sampling data. A better guess reduces the number of iterations and speeds up the result. With these estimated parameters for the fitted curve, one can now calculate the specific coefficients for a particular equation (a final exercise left to the OP).

answered Oct 17 '22 12:10

pylang

You want to use scipy:

Click to copy

import scipy.optimize.curve_fit

def my_model(T,w):
    return (hw/(kT))**2*(exp(hw/(kT)))/(exp(hw/(kT)-1))**2
w= 0 #initial guess
popt, pcov = curve_fit(my_model, T_values, C_values,p0=[w])

answered Oct 17 '22 12:10

Mohammad Athar

Related questions
                            
                                Set y-axis scale for pandas Dataframe Boxplot(), 3 Deviations?
                            
                                How to remove hyphens from a list of strings [duplicate]
                            
                                How to create a dictionary with new KEY with data from list? [duplicate]
                            
                                Python custom module name not defined
                            
                                what is the shortcut for "replace all" in pycharm
                            
                                Convert JSON list of integers to String in Django Rest Framework
                            
                                Charting Candlestick_OHLC one minute bars with Pandas and Matplotlib
                            
                                Install scipy for both python 2 and python 3
                            
                                python - replace the boolean value of a list with the values from two different lists [duplicate]
                            
                                Get Type in Robot Framework
                            
                                Regex split string by last occurrence of pattern
                            
                                Convert array of integers into dictionary of indices
                            
                                Django - authenticate() A user with that username already exists
                            
                                Python - Create a List Starting at a Given Value and End at Given Length
                            
                                insert item to list without insert() or append() Python
                            
                                S3: ExpiredToken error for S3 pre-signed url within expiry period
                            
                                NameError: name 'tree' is not defined
                            
                                "No module named scipy" on Windows
                            
                                TypeError: '<' not supported between instances of 'State' and 'State' PYTHON 3
                            
                                Warning using Scipy with Pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Least square method in python?

Tags:

python

data-fitting

least-squares

Philipp

People also ask

2 Answers

Data Cleaning

Review

Selecting Initial Parameters

Computing Estimated Parameters

How Does this Work?

pylang

Mohammad Athar

Recent Activity

Donate For Us