How to obtain RMSE out of lm result?

Tags:

I know there is a small difference between $sigma and the concept of root mean squared error. So, i am wondering what is the easiest way to obtain RMSE out of lm function in R?

res<-lm(randomData$price ~randomData$carat+
                     randomData$cut+randomData$color+
                     randomData$clarity+randomData$depth+
                     randomData$table+randomData$x+
                     randomData$y+randomData$z)

length(coefficients(res))

contains 24 coefficient, and I cannot make my model manually anymore. So, how can I evaluate the RMSE based on coefficients derived from lm?

926

asked Mar 30 '17 16:03

Jeff

1 Answers

Residual sum of squares:

RSS <- c(crossprod(res$residuals))

Mean squared error:

MSE <- RSS / length(res$residuals)

Root MSE:

RMSE <- sqrt(MSE)

Pearson estimated residual variance (as returned by summary.lm):

sig2 <- RSS / res$df.residual

Statistically, MSE is the maximum likelihood estimator of residual variance, but is biased (downward). The Pearson one is the restricted maximum likelihood estimator of residual variance, which is unbiased.

Remark

Given two vectors x and y, c(crossprod(x, y)) is equivalent to sum(x * y) but much faster. c(crossprod(x)) is likewise faster than sum(x ^ 2).
sum(x) / length(x) is also faster than mean(x).

answered Oct 05 '22 19:10

Zheyuan Li

Related questions
                            
                                More efficient R / Sweave / TeXShop work-flow?
                            
                                How do I add the mean value to a histogram in R?
                            
                                Read csv from specific row
                            
                                How do I generate a histogram for each column of my table?
                            
                                Add missing value in column with value from row above
                            
                                Joining aggregated values back to the original data frame [duplicate]
                            
                                How to fill NAs with LOCF by factors in data frame, split by country
                            
                                Difference between the == and %in% operators in R [duplicate]
                            
                                How to find the difference in value in every two consecutive rows in R?
                            
                                Fill in data frame with values from rows above
                            
                                dplyr if_else() vs base R ifelse()
                            
                                Filter values from list in R
                            
                                How do I use the lubridate package to calculate the number of months between two date vectors where one of the vectors has NA values?
                            
                                Deleting every n-th row in a dataframe
                            
                                What's wrong with my function to load multiple .csv files into single dataframe in R using rbind?
                            
                                How to setup environment variable R_user to use rpy2 in python
                            
                                Union of intersecting vectors in a list in R
                            
                                Remove NA/NaN/Inf in a matrix
                            
                                Format axis tick labels to percentage in plotly
                            
                                How do I add a prefix to several variable names using dplyr?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to obtain RMSE out of lm result?

Tags:

r

linear-regression

regression

lm

Jeff

People also ask

1 Answers

Zheyuan Li

Recent Activity

Donate For Us