I am using the <code>nlsLM</code> function from the <code>minpack.lm</code> package to find the values of parameters <code>a,</code> <code>e</code>, and <code>c</code> that give the best fit to the data <code>out</code>. Here is my code: <pre class="prettyprint"><code>n <- seq(0, 70000, by = 1) TR <- 0.946 b <- 2000 k <- 50000 nr <- 25 na <- 4000 nd <- 3200 d <- 0.05775 y <- d + ((TR*b)/k)*(nr/(na + nd + nr))*n ## summary(y) out <- data.frame(n = n, y = y) plot(out$n, out$y) ## Estimate the parameters of a nonlinear model library(minpack.lm) k1 <- 50000 k2 <- 5000 fit_r <- nlsLM(y ~ a*(e*n + k1*k2 + c), data=out, start=list(a = 2e-10, e = 6e+05, c = 1e+07), lower = c(0, 0, 0), algorithm="port") print(fit_r) ## summary(fit_r) df_fit <- data.frame(n = seq(0, 70000, by = 1)) df_fit$y <- predict(fit_r, newdata = df_fit) plot(out$n, out$y, type = "l", col = "red", ylim = c(0,10)) lines(df_fit$n, df_fit$y, col="green") legend(0,ceiling(max(out$y)),legend=c("observed","predicted"), col=c("red","green"), lty=c(1,1), ncol=1) </code></pre> The fitting to the data seems to be very sensitive to initial conditions. For example: <ul> <li>with <code>list(a = 2e-10, e = 6e+05, c = 1e+07)</code>, this gives a good fit:</li> </ul> <blockquote> <pre class="prettyprint"><code>Nonlinear regression model model: y ~ a * (e * n + k1 * k2 + c) data: out a e c 2.221e-10 5.895e+05 9.996e+06 residual sum-of-squares: 3.225e-26 Algorithm "port", convergence message: Relative error between `par' and the solution is at most `ptol'. </code></pre> </blockquote> <img src="https://i.stack.imgur.com/rdnYK.png" alt="enter image description here"> <ul> <li>with <code>list(a = 2e-01, e = 100, c = 2)</code>, this gives a bad fit:</li> </ul> <blockquote> <pre class="prettyprint"><code>Nonlinear regression model model: y ~ a * (e * n + k1 * k2 + c) data: out a e c 1.839e-08 1.000e+02 0.000e+00 residual sum-of-squares: 476410 Algorithm "port", convergence message: Relative error in the sum of squares is at most `ftol'. </code></pre> </blockquote> <img src="https://i.stack.imgur.com/QOveq.png" alt="enter image description here"> So, Is there an efficient way to find initial conditions that give a good fit to the data ? EDIT: I added the following code to explain better the problem. The code is used to find the values of <code>a</code>, <code>e</code>, and <code>c</code> that give the best fit to data from several data sets. Each line in <code>Y</code> corresponds with one data set. By running the code, there is an error message for the 3rd data set (or 3rd line in <code>Y</code>): <code>singular gradient matrix at initial parameter estimates.</code> Here is the code: <pre class="prettyprint"><code>TR <- 0.946 b <- 2000 k <- 50000 nr <- 25 na <- 4000 nd <- 3200 d <- 0.05775 Y <- data.frame(k1 = c(114000, 72000, 2000, 100000), k2 = c(47356, 30697, 214, 3568), n = c(114000, 72000, 2000, 100000), na = c(3936, 9245, 6834, 2967), nd = c(191, 2409, 2668, 2776), nr = c(57, 36, 1, 50), a = NA, e = NA, c = NA) ## Create a function to round values roundDown <- function(x) { k <- floor(log10(x)) out <- floor(x*10^(-k))*10^k return(out) } ID_line_NA <- which(is.na(Y[,c("a")]), arr.ind=TRUE) ## print(ID_line_NA) for(i in ID_line_NA){ print(i) ## Define the variable y seq_n <- seq(0, Y[i, c("n")], by = 1) y <- d + (((TR*b)/(Y[i, c("k1")]))*(Y[i, c("nr")]/(Y[i, c("na")] + Y[i, c("nd")] + Y[i, c("nr")])))*seq_n ## summary(y) out <- data.frame(n = seq_n, y = y) ## plot(out$n, out$y) ## Build the linear model to find the values of parameters that give the best fit mod <- lm(y ~ n, data = out) ## print(mod) ## Define initial conditions test_a <- roundDown(as.vector(coefficients(mod)[1])/(Y[i, c("k1")]*Y[i, c("k2")])) test_e <- as.vector(coefficients(mod)[2])/test_a test_c <- (as.vector(coefficients(mod)[1])/test_a) - (Y[i, c("k1")]*Y[i, c("k2")]) ## Build the nonlinear model fit <- tryCatch( nlsLM(y ~ a*(e*n + Y[i, c("k1")]*Y[i, c("k2")] + c), data=out, start=list(a = test_a, e = test_e, c = test_c), lower = c(0, 0, 0)), warning = function(w) return(1), error = function(e) return(2)) ## print(fit) if(is(fit,"nls")){ ## Plot tiff(paste("F:/Sources/Test_", i, ".tiff", sep=""), width = 10, height = 8, units = 'in', res = 300) par(mfrow=c(1,2),oma = c(0, 0, 2, 0)) df_fit <- data.frame(n = seq_n) df_fit$y <- predict(fit, newdata = df_fit) plot(out$n, out$y, type = "l", col = "red", ylim = c(0, ceiling(max(out$y)))) lines(df_fit$n, df_fit$y, col="green") dev.off() ## Add the parameters a, e and c in the data frame Y[i, c("a")] <- as.vector(coef(fit)[c("a")]) Y[i, c("e")] <- as.vector(coef(fit)[c("e")]) Y[i, c("c")] <- as.vector(coef(fit)[c("c")]) } else{ print("Error in the NLM") } } </code></pre> So, using the constraints <code>a > 0, e > 0, and c > 0</code>, is there an efficient way to find initial conditions for the <code>nlsLM</code> function that give a good fit to the data and to avoid error messages ? I added some conditions to define initial conditions for the parameters <code>a</code>, <code>e</code>, and <code>c</code>: Using the result of the linear model <code>lm(y ~ n)</code>: <blockquote> <pre class="prettyprint"><code>c = intercept/a - k1*k2 > 0 and e = slope/a > 0 0 < a < intercept/(k1*k2) </code></pre> </blockquote> , where <code>intercept</code> and <code>slope</code> is the intercept and slope of <code>lm(y ~ n)</code>, respectively.

The problem is not how to find the initial values of the parameters. The problem is that this is a re-parameterized linear model with constraints. The parameters for the linear model are slope <code>a*e</code> and intercept <code>a*(k1 * k2 + c)</code> so there can only be 2 parameters such as slope and intercept but the formula in the quesiton attempts to define three: <code>a</code>, <code>c</code> and <code>e</code>. We will need to fix one of the variables or, in general, add an additional constraint. Now if <code>co</code> is a vector whose first element is the Intercept and second element is the slope from the linear model <pre class="prettyprint"><code>fm <- lm(y ~ n) co <- coef(fm) </code></pre> then we have the equations: <pre class="prettyprint"><code>co[[1]] = a*e co[[2]] = a*(k1*k2+c) </code></pre> <code>co</code>, <code>k1</code> and <code>k2</code> are known and if we consider <code>c</code> as fixed then we can solve for <code>a</code> and <code>e</code> to give: <pre class="prettyprint"><code>a = co[[2]] / (k1*k2 + c) e = (k1 * k2 + c) * co[[1]] / co[[2]] </code></pre> Since both <code>co[[1]]</code> and <code>co[[2]]</code> are positive and <code>c</code> must be too <code>a</code> and <code>e</code> are necessarily positive as well giving us a solution once we arbitrarily fix <code>c</code>. This gives an infinite number of <code>a</code>, <code>e</code> pairs which minimize the model, one for each non-negative value of <code>c</code>. Note that we do not need to invoke <code>nlsLM</code> for this. For example, for <code>c = 1e-10</code> we have: <pre class="prettyprint"><code>fm <- lm(y ~ n) co <- coef(fm) c <- 1e-10 a <- co[[2]] / (k1*k2 + c) e <- (k1 * k2 + c) * co[[1]] / co[[2]] a; e ## [1] 5.23737e-13 ## [1] 110265261628 </code></pre> Note that numerical problems may exist due to the large difference in magnitude among the coefficients; however, if we increase <code>c</code> that will increase <code>e</code> and decrease <code>a</code> making the scaling even worse so the parameterization of this problem given in the question seems to have inheritently bad numerical scaling. Note that none of this requires running nlsLM to get the optimum coefficients; however, due to the bad scaling it might still be possible to improve the answer somewhat. <pre class="prettyprint"><code>co <- coef(lm(y ~ n)) c <- 1e-10 a <- co[[2]] / (k1*k2 + c) e <- (k1 * k2 + c) * co[[1]] / co[[2]] nlsLM(y ~ a * (e * n + k1 * k2 + c), start = list(a = a, e = e), lower = c(0, 0)) </code></pre> which gives: <pre class="prettyprint"><code>Nonlinear regression model model: y ~ a * (e * n + k1 * k2 + c) data: parent.frame() a e 2.310e-10 5.668e+05 residual sum-of-squares: 1.673e-26 Number of iterations to convergence: 12 Achieved convergence tolerance: 1.49e-08 </code></pre>

Find initial conditions for nonlinear models using the nlsLM function

Tags:

r

non-linear-regression

I am using the nlsLM function from the minpack.lm package to find the values of parameters a, e, and c that give the best fit to the data out. Here is my code:

n <- seq(0, 70000, by = 1)
TR <- 0.946
b <- 2000
k <- 50000
nr <- 25
na <- 4000
nd <- 3200
d <- 0.05775

y <- d + ((TR*b)/k)*(nr/(na + nd + nr))*n
## summary(y)
out <- data.frame(n = n, y = y)
plot(out$n, out$y)

## Estimate the parameters of a nonlinear model
library(minpack.lm)
k1 <- 50000
k2 <- 5000

fit_r <- nlsLM(y ~ a*(e*n + k1*k2 + c), data=out,
               start=list(a = 2e-10,
                          e = 6e+05, 
                          c = 1e+07), lower = c(0, 0, 0), algorithm="port")
print(fit_r)
## summary(fit_r)

df_fit <- data.frame(n = seq(0, 70000, by = 1))
df_fit$y <- predict(fit_r, newdata = df_fit)
plot(out$n, out$y, type = "l", col = "red", ylim = c(0,10))
lines(df_fit$n, df_fit$y, col="green")
legend(0,ceiling(max(out$y)),legend=c("observed","predicted"), col=c("red","green"), lty=c(1,1), ncol=1)

The fitting to the data seems to be very sensitive to initial conditions. For example:

with list(a = 2e-10, e = 6e+05, c = 1e+07), this gives a good fit:

Nonlinear regression model
  model: y ~ a * (e * n + k1 * k2 + c)
   data: out
        a         e         c 
2.221e-10 5.895e+05 9.996e+06 
 residual sum-of-squares: 3.225e-26

Algorithm "port", convergence message: Relative error between `par' and the solution is at most `ptol'.

enter image description here

with list(a = 2e-01, e = 100, c = 2), this gives a bad fit:

Nonlinear regression model
  model: y ~ a * (e * n + k1 * k2 + c)
   data: out
        a         e         c 
1.839e-08 1.000e+02 0.000e+00 
 residual sum-of-squares: 476410

Algorithm "port", convergence message: Relative error in the sum of squares is at most `ftol'.

enter image description here

So, Is there an efficient way to find initial conditions that give a good fit to the data ?

EDIT:

I added the following code to explain better the problem. The code is used to find the values of a, e, and c that give the best fit to data from several data sets. Each line in Y corresponds with one data set. By running the code, there is an error message for the 3rd data set (or 3rd line in Y): singular gradient matrix at initial parameter estimates. Here is the code:

TR <- 0.946
b <- 2000
k <- 50000
nr <- 25
na <- 4000
nd <- 3200
d <- 0.05775

Y <- data.frame(k1 = c(114000, 72000, 2000, 100000), k2 = c(47356, 30697, 214, 3568), n = c(114000, 72000, 2000, 100000), 
           na = c(3936, 9245, 6834, 2967), nd = c(191, 2409, 2668, 2776), nr = c(57, 36, 1, 50), a = NA, e = NA, c = NA)

## Create a function to round values
roundDown <- function(x) {
  k <- floor(log10(x))
  out <- floor(x*10^(-k))*10^k
  return(out)
}

ID_line_NA <- which(is.na(Y[,c("a")]), arr.ind=TRUE)
## print(ID_line_NA)

for(i in ID_line_NA){

  print(i)

  ## Define the variable y
  seq_n <- seq(0, Y[i, c("n")], by = 1)
  y <- d + (((TR*b)/(Y[i, c("k1")]))*(Y[i, c("nr")]/(Y[i, c("na")] + Y[i, c("nd")] + Y[i, c("nr")])))*seq_n
  ## summary(y)
  out <- data.frame(n = seq_n, y = y)
  ## plot(out$n, out$y)

  ## Build the linear model to find the values of parameters that give the best fit 
  mod <- lm(y ~ n, data = out)
  ## print(mod)

  ## Define initial conditions
  test_a <- roundDown(as.vector(coefficients(mod)[1])/(Y[i, c("k1")]*Y[i, c("k2")]))
  test_e <- as.vector(coefficients(mod)[2])/test_a
  test_c <- (as.vector(coefficients(mod)[1])/test_a) - (Y[i, c("k1")]*Y[i, c("k2")])

  ## Build the nonlinear model
  fit <- tryCatch( nlsLM(y ~ a*(e*n + Y[i, c("k1")]*Y[i, c("k2")] + c), data=out,
                                   start=list(a = test_a,
                                              e = test_e,
                                              c = test_c), lower = c(0, 0, 0)),
                             warning = function(w) return(1), error = function(e) return(2))
  ## print(fit)

  if(is(fit,"nls")){

    ## Plot
    tiff(paste("F:/Sources/Test_", i, ".tiff", sep=""), width = 10, height = 8, units = 'in', res = 300)
    par(mfrow=c(1,2),oma = c(0, 0, 2, 0))
    df_fit <- data.frame(n = seq_n)
    df_fit$y <- predict(fit, newdata = df_fit)
    plot(out$n, out$y, type = "l", col = "red", ylim = c(0, ceiling(max(out$y))))
    lines(df_fit$n, df_fit$y, col="green")
    dev.off()

    ## Add the parameters a, e and c in the data frame
    Y[i, c("a")] <- as.vector(coef(fit)[c("a")])
    Y[i, c("e")] <- as.vector(coef(fit)[c("e")])
    Y[i, c("c")] <- as.vector(coef(fit)[c("c")])

  } else{

    print("Error in the NLM")

  }
}

So, using the constraints a > 0, e > 0, and c > 0, is there an efficient way to find initial conditions for the nlsLM function that give a good fit to the data and to avoid error messages ?

I added some conditions to define initial conditions for the parameters a, e, and c:

Using the result of the linear model lm(y ~ n):

c = intercept/a - k1*k2 > 0 and 
e = slope/a > 0
0 < a < intercept/(k1*k2)

, where intercept and slope is the intercept and slope of lm(y ~ n), respectively.

364

asked Sep 03 '18 22:09

Pierre

Video Answer

1 Answers

The problem is not how to find the initial values of the parameters. The problem is that this is a re-parameterized linear model with constraints. The parameters for the linear model are slope a*e and intercept a*(k1 * k2 + c) so there can only be 2 parameters such as slope and intercept but the formula in the quesiton attempts to define three: a, c and e.

We will need to fix one of the variables or, in general, add an additional constraint. Now if co is a vector whose first element is the Intercept and second element is the slope from the linear model

fm <- lm(y ~ n)
co <- coef(fm)

then we have the equations:

co[[1]] = a*e
co[[2]] = a*(k1*k2+c)

co, k1 and k2 are known and if we consider c as fixed then we can solve for a and e to give:

a = co[[2]] / (k1*k2 + c)
e = (k1 * k2 + c) * co[[1]] / co[[2]]

Since both co[[1]] and co[[2]] are positive and c must be too a and e are necessarily positive as well giving us a solution once we arbitrarily fix c. This gives an infinite number of a, e pairs which minimize the model, one for each non-negative value of c. Note that we do not need to invoke nlsLM for this.

For example, for c = 1e-10 we have:

fm <- lm(y ~ n)
co <- coef(fm)

c <- 1e-10
a <- co[[2]] / (k1*k2 + c)
e <- (k1 * k2 + c) * co[[1]] / co[[2]]
a; e
## [1] 5.23737e-13
## [1] 110265261628

Note that numerical problems may exist due to the large difference in magnitude among the coefficients; however, if we increase c that will increase e and decrease a making the scaling even worse so the parameterization of this problem given in the question seems to have inheritently bad numerical scaling.

Note that none of this requires running nlsLM to get the optimum coefficients; however, due to the bad scaling it might still be possible to improve the answer somewhat.

co <- coef(lm(y ~ n))
c <- 1e-10
a <- co[[2]] / (k1*k2 + c)
e <- (k1 * k2 + c) * co[[1]] / co[[2]]
nlsLM(y ~ a * (e * n + k1 * k2 + c), start = list(a = a, e = e), lower = c(0, 0))

which gives:

Nonlinear regression model
  model: y ~ a * (e * n + k1 * k2 + c)
   data: parent.frame()
        a         e 
2.310e-10 5.668e+05 
 residual sum-of-squares: 1.673e-26

Number of iterations to convergence: 12 
Achieved convergence tolerance: 1.49e-08

194

answered Nov 14 '22 20:11

G. Grothendieck

Related questions
                            
                                crayon in R Markdown / knitr reports
                            
                                Use function argument as name for new data frame in R
                            
                                R predict warning
                            
                                R: reorder factor levels for several individual plots
                            
                                Rmarkdown HTML Template produces pandoc error 61
                            
                                Is there a downside to using get() in dplyr instead of SE?
                            
                                R - stringr add newline character every two spaced digits
                            
                                How to create On load Event or default event in shiny?
                            
                                Format hover data labels Plotly R
                            
                                How can I image_read multiple images at once?
                            
                                How to use axis range and labels from original data in ggtern?
                            
                                Using any() vs | in dplyr::mutate
                            
                                udunits2 R install: udunits2.h not found
                            
                                Have downloadButton work with observeEvent
                            
                                Stacked bar chart with varying widths in ggplot
                            
                                What does Continuous x aesthetic -- did you forget aes(group=...) mean?
                            
                                How to create zipcode boundaries in R
                            
                                what does %*%+ mean in matrix operations?
                            
                                How to list all S3 methods defined in a specific package / namespace for a particular generic function
                            
                                Add multiple output variables using purrr and a predefined function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With