I am trying to implement segmented regression as per this example Segmented Regression, Breakpoint analysis. Now, how can i implement it in such a way the second part will be quadratic polynomial and remaining other things same. I tried the same by changing <code>Z= ~poly(DistanceMeters, 2)</code> however it didn't work. Also, How can I get equations like <pre class="prettyprint"><code>part 1: a1*x+b1 part 2: a2*x2**2 + b2*x + c1 part 3 :a3*x + b3 </code></pre> There are similar questions like this however they din't explain using segmented function.

I have two ideas, both with drawbacks. Maybe you can adjust one of them to your needs. Unfortunately cannot access Drive at the moment, so some artificial data used. 1. "Manually" fit polynomial models Here you can specify whichever models you like, some segments can be lm's, some polynomials etc. Code: <pre class="prettyprint"><code>library(segmented) library(ggplot2) library(data.table) # Data set.seed(12) xx <- 1:100 yy <- 2 + 1.5 * pmax(xx-35, 0) - 1.5 * pmax(xx-70, 0) + 15 * pmax(runif(100) - 0.5, 0) + rnorm(100, 0, 2) dt <- data.table(x = xx, y = yy, type = 'act') dt_all <- copy(dt) # lm lm_lin <- lm(y ~ x, data = dt) summary(lm_lin) # Find segments lm_seg <- segmented( lm_lin, seg.Z = ~ x, psi = list(x = c(30, 80))) # "Manual" lm's breaks <- unname(lm_seg$psi[, 'Est.']) lm_poly1 <- lm(y ~ poly(x, 4), data = dt[x < breaks[1], ]) lm_2 <- lm(y ~ x, data = dt[x > breaks[1] & x < breaks[2], ]) lm_poly3 <- lm(y ~ poly(x, 4), data = dt[x > breaks[2], ]) dt_all <- rbind( dt_all, data.table(x = xx, y = c( predict(lm_poly1), predict(lm_2), predict(lm_poly3)), type = 'lm_poly' ) ) </code></pre> 2. Fit a gam model using breaks from <code>segmented</code> and some splines Here you will get a smooth transition between segments, but you have much less control on what is happening. <pre class="prettyprint"><code># Using splines for smooth segments library(mgcv) spl <- gam(y ~ s(x, bs = "cc", k = 12), data = dt, knots = list(xx = breaks)) # Plot dt_all <- rbind(dt_all, data.table(x = xx, y = predict(spl), type = 'spl')) ggplot(dt_all, aes(x = x, y = y)) + geom_point(size = 1) + facet_grid(. ~ type) + theme_minimal() </code></pre> <img src="https://i.stack.imgur.com/tgSs2.jpg" alt="enter image description here"> Both can be done using e.g. <code>list()</code> and <code>lapply()</code> to automate a bit (for varying number of breaks etc.). Edit: By changing arguments of <code>poly</code> and <code>s</code> you can get slightly "better" fitting models, but for <code>gam</code> errors on the edges are quite big, see for <code>degree = 6</code> and <code>k = 30</code>: <img src="https://i.stack.imgur.com/90HF0.jpg" alt="enter image description here">

Segmented regression with quadratic polynomial and a strightline

Tags:

r

regression

curve-fitting

I am trying to implement segmented regression as per this example Segmented Regression, Breakpoint analysis.

Now, how can i implement it in such a way the second part will be quadratic polynomial and remaining other things same.

I tried the same by changing Z= ~poly(DistanceMeters, 2) however it didn't work.

Also, How can I get equations like

part 1: a1*x+b1
part 2: a2*x2**2 + b2*x + c1
part 3 :a3*x + b3

There are similar questions like this however they din't explain using segmented function.

845

asked Mar 07 '17 04:03

Shankar Pandala

1 Answers

I have two ideas, both with drawbacks. Maybe you can adjust one of them to your needs. Unfortunately cannot access Drive at the moment, so some artificial data used.

1. "Manually" fit polynomial models

Here you can specify whichever models you like, some segments can be lm's, some polynomials etc.

Code:

library(segmented)
library(ggplot2)
library(data.table)

# Data
set.seed(12)
xx <- 1:100
yy <- 2 + 1.5 * pmax(xx-35, 0) - 1.5 * pmax(xx-70, 0) + 15 * pmax(runif(100) - 0.5, 0) + rnorm(100, 0, 2)

dt <- data.table(x = xx, y = yy, type = 'act')
dt_all <- copy(dt)

# lm
lm_lin <- lm(y ~ x, data = dt)
summary(lm_lin)

# Find segments
lm_seg <- segmented(
  lm_lin, seg.Z = ~ x, psi = list(x = c(30, 80)))

# "Manual" lm's
breaks <- unname(lm_seg$psi[, 'Est.'])
lm_poly1 <- lm(y ~ poly(x, 4), data = dt[x < breaks[1], ])
lm_2 <- lm(y ~ x, data = dt[x > breaks[1] & x < breaks[2], ])
lm_poly3 <- lm(y ~ poly(x, 4), data = dt[x > breaks[2], ])

dt_all <- rbind(
  dt_all,
  data.table(x = xx, y = c(
    predict(lm_poly1),
    predict(lm_2),
    predict(lm_poly3)),
    type = 'lm_poly'
  )
)

2. Fit a gam model using breaks from segmented and some splines

Here you will get a smooth transition between segments, but you have much less control on what is happening.

# Using splines for smooth segments
library(mgcv)

spl <- gam(y ~ s(x, bs = "cc", k = 12), data = dt, knots = list(xx = breaks))

# Plot
dt_all <- rbind(dt_all, data.table(x = xx, y = predict(spl), type = 'spl'))
ggplot(dt_all, aes(x = x, y = y)) + geom_point(size = 1) +
  facet_grid(. ~ type) + theme_minimal()

enter image description here

Both can be done using e.g. list() and lapply() to automate a bit (for varying number of breaks etc.).

Edit:

By changing arguments of poly and s you can get slightly "better" fitting models, but for gam errors on the edges are quite big, see for degree = 6 and k = 30:

enter image description here

177

answered Oct 20 '22 05:10

m-dz

Related questions
                            
                                Shiny DataTable: Disable row selection for certain rows
                            
                                How to use R's testthat to unit test individual files?
                            
                                How does ggplot2 density differ from the density function?
                            
                                imageOutput click within conditionalPanel
                            
                                Rmarkdown Chunk Name from Variable
                            
                                when is R's `ByteCompile` counter-productive?
                            
                                How can I set R.HOME() and/or R_HOME correctly?
                            
                                Rmarkdown of Stargazer: LaTeX Error if align is set to TRUE
                            
                                Efficiently construct GRanges/IRanges from Rle vector
                            
                                R function for position of sun giving unexpected results
                            
                                Combine data frame rows in R based on multiple columns
                            
                                Is there a reliable way to detect POSIXlt objects representing a time which does not exist due to DST?
                            
                                lme4::glmer.nb function produces "Error in family$family : $ operator not defined for this S4 class" depending on the order I run models
                            
                                R plotly version 4.5.2 scatterplot legend bubble size settings
                            
                                modelr: Fitting multiple models with resampled data
                            
                                How to change column names in stargazer when printing data frames?
                            
                                buffer areas around lines ggplot2
                            
                                How to show error location in tryCatch?
                            
                                Data.Table non-equi join with arithmetic operations
                            
                                Source function in R, error cannot find file - do I have to change working directory?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With