mgcv: How to set number and / or locations of knots for splines

Tags:

I want to use function gam in mgcv packages:

 x <- seq(0,60, len =600)
 y <- seq(0,1, len=600) 
 prova <- gam(y ~ s(x, bs='cr')

can I set the number of knots in s()? and then can I know where are the knots that the spline used? Thanks!

697

asked Oct 15 '16 07:10

memy

1 Answers

While setting k is the correct way to go, fx = TRUE is definitely not right: it will force using pure regression spline without penalization.

locations of knots

For penalized regression spline, the exact locations are not important, as long as:

k is adequately big;
the spread of knots has good, reasonable coverage.

By default:

natural cubic regression spline bs = 'cr' places knots by quantile;
B-splines family (bs = 'bs', bs = 'ps', bs = 'ad') place knots evenly.

Compare the following:

library(mgcv)

## toy data
set.seed(0); x <- sort(rnorm(400, 0, pi))  ## note, my x are not uniformly sampled
set.seed(1); e <- rnorm(400, 0, 0.4)
y0 <- sin(x) + 0.2 * x + cos(abs(x))
y <- y0 + e

## fitting natural cubic spline
cr_fit <- gam(y ~ s(x, bs = 'cr', k = 20))
cr_knots <- cr_fit$smooth[[1]]$xp  ## extract knots locations

## fitting B-spline
bs_fit <- gam(y ~ s(x, bs = 'bs', k = 20))
bs_knots <- bs_fit$smooth[[1]]$knots  ## extract knots locations

## summary plot
par(mfrow = c(1,2))
plot(x, y, col= "grey", main = "natural cubic spline");
lines(x, cr_fit$linear.predictors, col = 2, lwd = 2)
abline(v = cr_knots, lty = 2)
plot(x, y, col= "grey", main = "B-spline");
lines(x, bs_fit$linear.predictors, col = 2, lwd = 2)
abline(v = bs_knots, lty = 2)

enter image description here

You can see the difference in knots placement.

Setting your own knots locations:

You can also provide your customized knots locations via the knots argument of gam() (yes, knots are not fed to s(), but to gam()). For example, you can do evenly spaced knots for cr:

xlim <- range(x)  ## get range of x
myfit <- gam(y ~ s(x, bs = 'cr', k =20),
         knots = list(x = seq(xlim[1], xlim[2], length = 20)))

Now you can see that:

my_knots <- myfit$smooth[[1]]$xp
plot(x, y, col= "grey", main = "my knots");
lines(x, myfit$linear.predictors, col = 2, lwd = 2)
abline(v = my_knots, lty = 2)

enter image description here

However, there is usually no need to set knots yourself. But if you do want to do this, you must be clear what you are doing. Also, the number of knots you provided must match k in the s().

186

answered Sep 19 '22 12:09

Zheyuan Li

Related questions
                            
                                Disabling the cat command
                            
                                Vi keybindings for R command line like in Bash
                            
                                Dataframes in a list; adding a new variable with name of dataframe
                            
                                Create empty csv file in R
                            
                                Setting Function Defaults R on a Project Specific Basis
                            
                                Cut and labels/breaks length conflict
                            
                                Make dataframe of top N frequent terms for multiple corpora using tm package in R
                            
                                How to add an inset (subplot) to "topright" of an R plot?
                            
                                Legend with both point and line in R
                            
                                Programmatically get list of base packages
                            
                                Remove all characters before a period in a string
                            
                                knitr: include figures in report *and* output figures to separate files
                            
                                Current time in ISO 8601 format
                            
                                Shiny - how to change the font size in select tags?
                            
                                Set 0-point for pheatmap in R
                            
                                How to downgrade R version 3.2.2 to version 3.1.1 on Ubuntu
                            
                                Enum-like arguments in R
                            
                                Reading hdf files into R and converting them to geoTIFF rasters
                            
                                How to read merged excel cells with R
                            
                                Add Regression Plane to 3d Scatter Plot in Plotly

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

mgcv: How to set number and / or locations of knots for splines

Tags:

r

regression

spline

mgcv

gam

memy

People also ask

1 Answers

Zheyuan Li

Recent Activity

Donate For Us