How to reverse PCA in prcomp to get original data

Tags:

r

I want to reverse the PCA calculated from prcomp to get back to my original data.

I thought something like the following would work:

pca$x %*% t(pca$rotation)

but it doesn't.

The following link shows how to get back the original data from PCs, but explains it only for PCA using eigen on the covariance matrix http://www.di.fc.ul.pt/~jpn/r/pca/pca.html

prcomp doesn't calcluate PCs that way.

"The calculation is done by a singular value decomposition of the (centered and possibly scaled) data matrix, not by using eigen on the covariance matrix." -prcomp

424

asked Apr 21 '15 21:04

Jase Villam

1 Answers

prcomp will center the variables so you need to add the subtracted means back

t(t(pca$x %*% t(pca$rotation)) + pca$center)

If pca$scale is TRUE you will also need to re-scale

t(t(pca$x %*% t(pca$rotation)) * pca$scale + pca$center)

169

answered Sep 27 '22 23:09

konvas

Related questions
                            
                                Replace the spaces between multiple (3+) capital letters
                            
                                Explaining the forecasts from an ARIMA model
                            
                                Efficiently compute the row sums of a 3d array in R
                            
                                Order data frame by two columns in R
                            
                                How to set up an R based service on a web page [closed]
                            
                                wide to long multiple measures each time
                            
                                combining two plots in r
                            
                                Circular plot with vectors in R
                            
                                Creating line plot with time scale and labels in r
                            
                                Trying to get tf-idf weighting working in R
                            
                                Extract only coefficients whose p values are significant from a logistic model
                            
                                Getting driving distance between two points (lat, lon) using R and Google Map API
                            
                                Vary colors of axis labels in R based on another variable
                            
                                Is there an expression in `R` for "output of the last command"? [duplicate]
                            
                                Plotting points with color and shape based on data variables
                            
                                Labeling center of map polygons in R ggplot
                            
                                Merging two data.frames by key column
                            
                                Weighted sampling in R
                            
                                ggplot2 error : Discrete value supplied to continuous scale
                            
                                Freezing header and first column using data.table in Shiny

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With