Plotting a 95% confidence interval for a lm object

Tags:

How can I calculate and plot a confidence interval for my regression in r? So far I have two numerical vectors of equal length (x,y) and a regression object(lm.out). I have made a scatterplot of y given x and added the regression line to this plot. I am looking for a way to add a 95% prediction confidence band for lm.out to the plot. I've tried using the predict function, but I don't even know where to start with that :/. Here is my code at the moment:

x=c(1,2,3,4,5,6,7,8,9,0)
y=c(13,28,43,35,96,84,101,110,108,13)

lm.out <- lm(y ~ x)

plot(x,y)

regression.data = summary(lm.out) #save regression summary as variable
names(regression.data) #get names so we can index this data
a= regression.data$coefficients["(Intercept)","Estimate"] #grab values
b= regression.data$coefficients["x","Estimate"]
abline(a,b) #add the regression line

Thank you!

Edit: I've taken a look at the proposed duplicate and can't quite get to the bottom of it.

781

asked Sep 28 '17 01:09

Max Lester

2 Answers

You have yo use predict for a new vector of data, here newx.

x=c(1,2,3,4,5,6,7,8,9,0)

y=c(13,28,43,35,96,84,101,110,108,13)

lm.out <- lm(y ~ x)
newx = seq(min(x),max(x),by = 0.05)
conf_interval <- predict(lm.out, newdata=data.frame(x=newx), interval="confidence",
                         level = 0.95)
plot(x, y, xlab="x", ylab="y", main="Regression")
abline(lm.out, col="lightblue")
lines(newx, conf_interval[,2], col="blue", lty=2)
lines(newx, conf_interval[,3], col="blue", lty=2)

EDIT

as it is mention in the coments by Ben this can be done with matlines as follow:

plot(x, y, xlab="x", ylab="y", main="Regression")
abline(lm.out, col="lightblue")
matlines(newx, conf_interval[,2:3], col = "blue", lty=2)

answered Oct 08 '22 12:10

Alejandro Andrade

I'm going to add a tip that would have saved me a lot of frustration when trying the method given by @Alejandro Andrade: If your data are in a data frame, then when you build your model with lm(), use the data= argument rather than $ notation. E.g., use

lm.out <- lm(y ~ x, data = mydata)

rather than

lm.out <- lm(mydata$y ~ mydata$x)

If you do the latter, then this statement

predict(lm.out, newdata=data.frame(x=newx), interval="confidence", level = 0.95)

seems to either ignore the new values passed using newdata= or there's a silent error. Either way, the output is the predictions from the original data, not the new data.

Also, be sure your x variable gets the same name in the new data frame that it had in the original. That's easier to figure out because you do get an error, but knowing it ahead of time might save you a round of debugging.

Note: Tried to add this as a comment, but don't have enough reputation points.

answered Oct 08 '22 11:10

acullum

Related questions
                            
                                Efficient coding to make time increments 'finer' from minutes to seconds
                            
                                Turning a data frame and a list into long format with dplyr
                            
                                matching row values (text) with column names and return value
                            
                                Overall Title for Plotting Window
                            
                                How to partition a set of values (vector) in R
                            
                                Easily input a correlation matrix in R
                            
                                cut() - include lowest values
                            
                                Splitting a number in R
                            
                                More efficient strategy for which() or match()
                            
                                get filename from url path in R
                            
                                Efficient use of functions on long data.frames in R
                            
                                Add new row to matrix one by one
                            
                                matching and counting strings (k-mer of DNA) in R
                            
                                Replace a set of pattern matches with corresponding replacement strings in R
                            
                                R get rows based on multiple conditions - use dplyr and reshape2
                            
                                Stratified sampling on factor
                            
                                Cannot install devtools package after upgrading R
                            
                                How to remove first N rows in a data set in R? [duplicate]
                            
                                Passing reactive values to conditionalPanel condition
                            
                                Distinct enclosing environment, function environment, etc. in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Plotting a 95% confidence interval for a lm object

Tags:

plot

r

regression

intervals