I spent some time these days on a problem. I have a set of data: y = f(t), where y is very small concentration (10^-7), and t is in second. t varies from 0 to around 12000. The measurements follow an established model: <pre class="prettyprint"><code>y = Vs * t - ((Vs - Vi) * (1 - np.exp(-k * t)) / k) </code></pre> And I need to find Vs, Vi, and k. So I used curve_fit, which returns the best fitting parameters, and I plotted the curve. And then I used a similar model: <pre class="prettyprint"><code>y = (Vs * t/3600 - ((Vs - Vi) * (1 - np.exp(-k * t/3600)) / k)) * 10**7 </code></pre> By doing that, t is a number of hour, and y is a number between 0 and about 10. The parameters returned are of course different. But when I plot each curve, here is what I get: http://i.imgur.com/XLa4LtL.png The green fit is the first model, the blue one with the "normalized" model. And the red dots are the experimental values. The fitting curves are different. I think it's not expected, and I don't understand why. Are the calculations more accurate if the numbers are "reasonnable" ?

The docstring for <code>optimize.curve_fit</code> says, <pre class="prettyprint"><code>p0 : None, scalar, or M-length sequence Initial guess for the parameters. If None, then the initial values will all be 1 (if the number of parameters for the function can be determined using introspection, otherwise a ValueError is raised). </code></pre> Thus, to begin with, the initial guess for the parameters is by default 1. Moreover, curve fitting algorithms have to sample the function for various values of the parameters. The "various values" are initially chosen with an initial step size on the order of 1. The algorithm will work better if your data varies somewhat smoothly with changes in the parameter values that on the order of 1. If the function varies wildly with parameter changes on the order of 1, then the algorithm may tend to miss the optimum parameter values. Note that even if the algorithm uses an adaptive step size when it tweaks the parameter values, if the initial tweak is so far off the mark as to produce a big residual, and if tweaking in some other direction happens to produce a smaller residual, then the algorithm may wander off in the wrong direction and miss the local minimum. It may find some other (undesired) local minimum, or simply fail to converge. So using an algorithm with an adaptive step size won't necessarily save you. The moral of the story is that scaling your data can improve the algorithm's chances of of finding the desired minimum. <hr> Numerical algorithms in general all tend to work better when applied to data whose magnitude is on the order of 1. This bias enters into the algorithm in numerous ways. For instance, <code>optimize.curve_fit</code> relies on <code>optimize.leastsq</code>, and the call signature for <code>optimize.leastsq</code> is: <pre class="prettyprint"><code>def leastsq(func, x0, args=(), Dfun=None, full_output=0, col_deriv=0, ftol=1.49012e-8, xtol=1.49012e-8, gtol=0.0, maxfev=0, epsfcn=None, factor=100, diag=None): </code></pre> Thus, by default, the tolerances <code>ftol</code> and <code>xtol</code> are on the order of 1e-8. If finding the optimum parameter values require much smaller tolerances, then these hard-coded default numbers will cause <code>optimize.curve_fit</code> to miss the optimize parameter values. To make this more concrete, suppose you were trying to minimize <code>f(x) = 1e-100*x**2</code>. The factor of 1e-100 squashes the <code>y</code>-values so much that a wide range of <code>x</code>-values (the parameter values mentioned above) will fit within the tolerance of 1e-8. So, with un-ideal scaling, <code>leastsq</code> will not do a good job of finding the minimum. <hr> Another reason to use floats on the order of 1 is because there are many more (IEEE754) floats in the interval <code>[-1,1]</code> than there are far away from 1. For example, <pre class="prettyprint"><code>import struct def floats_between(x, y): """ http://stackoverflow.com/a/3587987/190597 (jsbueno) """ a = struct.pack("<dd", x, y) b = struct.unpack("<qq", a) return b[1] - b[0] In [26]: floats_between(0,1) / float(floats_between(1e6,1e7)) Out[26]: 311.4397707054894 </code></pre> This shows there are over 300 times as many floats representing numbers between 0 and 1 than there are in the interval [1e6, 1e7]. Thus, all else being equal, you'll typically get a more accurate answer if working with small numbers than very large numbers.

Fitting curve: why small numbers are better?

Tags:

python

numpy

I spent some time these days on a problem. I have a set of data:

y = f(t), where y is very small concentration (10^-7), and t is in second. t varies from 0 to around 12000.

The measurements follow an established model:

Click to copy

y = Vs * t - ((Vs - Vi) * (1 - np.exp(-k * t)) / k)

And I need to find Vs, Vi, and k. So I used curve_fit, which returns the best fitting parameters, and I plotted the curve.

And then I used a similar model:

Click to copy

y = (Vs * t/3600 - ((Vs - Vi) * (1 - np.exp(-k * t/3600)) / k)) * 10**7

By doing that, t is a number of hour, and y is a number between 0 and about 10. The parameters returned are of course different. But when I plot each curve, here is what I get:

http://i.imgur.com/XLa4LtL.png

The green fit is the first model, the blue one with the "normalized" model. And the red dots are the experimental values.

The fitting curves are different. I think it's not expected, and I don't understand why. Are the calculations more accurate if the numbers are "reasonnable" ?

687

asked Sep 06 '13 17:09

JPFrancoia

1 Answers

The docstring for optimize.curve_fit says,

Click to copy

p0 : None, scalar, or M-length sequence
    Initial guess for the parameters.  If None, then the initial
    values will all be 1 (if the number of parameters for the function
    can be determined using introspection, otherwise a ValueError
    is raised).

Thus, to begin with, the initial guess for the parameters is by default 1.

Moreover, curve fitting algorithms have to sample the function for various values of the parameters. The "various values" are initially chosen with an initial step size on the order of 1. The algorithm will work better if your data varies somewhat smoothly with changes in the parameter values that on the order of 1.

If the function varies wildly with parameter changes on the order of 1, then the algorithm may tend to miss the optimum parameter values.

Note that even if the algorithm uses an adaptive step size when it tweaks the parameter values, if the initial tweak is so far off the mark as to produce a big residual, and if tweaking in some other direction happens to produce a smaller residual, then the algorithm may wander off in the wrong direction and miss the local minimum. It may find some other (undesired) local minimum, or simply fail to converge. So using an algorithm with an adaptive step size won't necessarily save you.

The moral of the story is that scaling your data can improve the algorithm's chances of of finding the desired minimum.

Numerical algorithms in general all tend to work better when applied to data whose magnitude is on the order of 1. This bias enters into the algorithm in numerous ways. For instance, optimize.curve_fit relies on optimize.leastsq, and the call signature for optimize.leastsq is:

Click to copy

def leastsq(func, x0, args=(), Dfun=None, full_output=0,
            col_deriv=0, ftol=1.49012e-8, xtol=1.49012e-8,
            gtol=0.0, maxfev=0, epsfcn=None, factor=100, diag=None):

Thus, by default, the tolerances ftol and xtol are on the order of 1e-8. If finding the optimum parameter values require much smaller tolerances, then these hard-coded default numbers will cause optimize.curve_fit to miss the optimize parameter values.

To make this more concrete, suppose you were trying to minimize f(x) = 1e-100*x**2. The factor of 1e-100 squashes the y-values so much that a wide range of x-values (the parameter values mentioned above) will fit within the tolerance of 1e-8. So, with un-ideal scaling, leastsq will not do a good job of finding the minimum.

Another reason to use floats on the order of 1 is because there are many more (IEEE754) floats in the interval [-1,1] than there are far away from 1. For example,

Click to copy

import struct
def floats_between(x, y):
    """
    http://stackoverflow.com/a/3587987/190597 (jsbueno)
    """
    a = struct.pack("<dd", x, y)
    b = struct.unpack("<qq", a)
    return b[1] - b[0]

In [26]: floats_between(0,1) / float(floats_between(1e6,1e7))
Out[26]: 311.4397707054894

This shows there are over 300 times as many floats representing numbers between 0 and 1 than there are in the interval [1e6, 1e7]. Thus, all else being equal, you'll typically get a more accurate answer if working with small numbers than very large numbers.

143

answered Oct 10 '22 11:10

unutbu

Related questions
                            
                                Python regexp: get all group's sequence
                            
                                Pandas Dataframe groupby Display
                            
                                Python os.system() call runs in incorrect directory
                            
                                Exporting Python List into csv
                            
                                Of scraping data, headless browsers, and Python [closed]
                            
                                Applying time-variant filter in Python
                            
                                What's the pythonic way to distinguish between a dict and a list of dicts?
                            
                                python custom class operator overloading
                            
                                python matplotlib plot hist2d with normalised masked numpy array
                            
                                Match to string length by using regex in python
                            
                                How to write tuples in a csv file in python with no blank line
                            
                                trouble in converting unicode template to pdf using xhtml2pdf
                            
                                Function running way too slow
                            
                                matplotlib not continuing lines after NaN's [duplicate]
                            
                                Launch Firefox with Python 3.x
                            
                                Allowing remote access to Elasticsearch
                            
                                how to write a script to edit a JSON file? [closed]
                            
                                How to create dropdown with value and text node - WXPython
                            
                                Colored output from fabric script
                            
                                Python/Django REST API Architecture

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fitting curve: why small numbers are better?

Tags:

python

numpy

JPFrancoia

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us