While using <code>princomp()</code> function in R, the following error is encountered : <code>"covariance matrix is not non-negative definite"</code>. I think, this is due to some values being zero (actually close to zero, but becomes zero during rounding) in the covariance matrix. Is there a work around to proceed with PCA when covariance matrix contains zeros ? [FYI : obtaining the covariance matrix is an intermediate step within the <code>princomp()</code> call. Data file to reproduce this error can be downloaded from here - http://tinyurl.com/6rtxrc3]

The first strategy might be to decrease the tolerance argument. Looks to me that <code>princomp</code> won't pass on a tolerance argument but that <code>prcomp</code> does accept a 'tol' argument. If not effective, this should identify vectors which have nearly-zero covariance: <pre class="prettyprint"><code>nr0=0.001 which(abs(cov(M)) < nr0, arr.ind=TRUE) </code></pre> And this would identify vectors with negative eigenvalues: <pre class="prettyprint"><code>which(eigen(M)$values < 0) </code></pre> Using the h9 example on the help(qr) page: <pre class="prettyprint"><code>> which(abs(cov(h9)) < .001, arr.ind=TRUE) row col [1,] 9 4 [2,] 8 5 [3,] 9 5 [4,] 7 6 [5,] 8 6 [6,] 9 6 [7,] 6 7 [8,] 7 7 [9,] 8 7 [10,] 9 7 [11,] 5 8 [12,] 6 8 [13,] 7 8 [14,] 8 8 [15,] 9 8 [16,] 4 9 [17,] 5 9 [18,] 6 9 [19,] 7 9 [20,] 8 9 [21,] 9 9 > qr(h9[-9,-9])$rank [1] 7 # rank deficient, at least at the default tolerance > qr(h9[-(8:9),-(8:9)])$ take out only the vector with the most dependencies [1] 6 #Still rank deficient > qr(h9[-(7:9),-(7:9)])$rank [1] 6 </code></pre> Another approach might be to use the <code>alias</code> function: <pre class="prettyprint"><code>alias( lm( rnorm(NROW(dfrm)) ~ dfrm) ) </code></pre>

How to use princomp () function in R when covariance matrix has zero's?

1 Answers

The first strategy might be to decrease the tolerance argument. Looks to me that princomp won't pass on a tolerance argument but that prcomp does accept a 'tol' argument. If not effective, this should identify vectors which have nearly-zero covariance:

nr0=0.001
which(abs(cov(M)) < nr0, arr.ind=TRUE)

And this would identify vectors with negative eigenvalues:

which(eigen(M)$values < 0)

Using the h9 example on the help(qr) page:

> which(abs(cov(h9)) < .001, arr.ind=TRUE)
      row col
 [1,]   9   4
 [2,]   8   5
 [3,]   9   5
 [4,]   7   6
 [5,]   8   6
 [6,]   9   6
 [7,]   6   7
 [8,]   7   7
 [9,]   8   7
[10,]   9   7
[11,]   5   8
[12,]   6   8
[13,]   7   8
[14,]   8   8
[15,]   9   8
[16,]   4   9
[17,]   5   9
[18,]   6   9
[19,]   7   9
[20,]   8   9
[21,]   9   9
> qr(h9[-9,-9])$rank  
[1] 7                  # rank deficient, at least at the default tolerance
> qr(h9[-(8:9),-(8:9)])$ take out only the vector  with the most dependencies
[1] 6                   #Still rank deficient
> qr(h9[-(7:9),-(7:9)])$rank
[1] 6

Another approach might be to use the alias function:

alias( lm( rnorm(NROW(dfrm)) ~ dfrm) )

answered Oct 10 '22 08:10

IRTFM

Related questions
                            
                                quantile cut by group in data.table
                            
                                How many arguments to a function from OUTSIDE the function
                            
                                R: Automatically expand margins in VIM::aggr plots
                            
                                ggplot2 outside panel border when using facet
                            
                                How to resize HTML widget using saveWidget in htmlwidgets R?
                            
                                get(x) does not work in R data.table when x is also a column in the data table
                            
                                strings are identical (using `base::identical`) and yet behave differently with `grepl` / `gsub`
                            
                                How can I add additional arguments to methods for internal generics?
                            
                                Understanding difference between attr(x, "class") and class(x)
                            
                                Rvest read table with cells that span multiple rows
                            
                                How to get frequency counts using column breaks by row?
                            
                                how to suppress "S3 method overwritten" messages from being printed to user console
                            
                                Cache or pre render leaflet map in shiny app
                            
                                Keyboard shortcut to split screen in half with source pane on left and console pane
                            
                                Disable GUI, graphics devices in R
                            
                                merging two dataframes in R
                            
                                How to set g++ compiler flags using Rcpp and inline?
                            
                                Put the Y axis on the left of a heatmap?
                            
                                Cannot view gvisMotionChart from printed html file
                            
                                Emacs, R, Org-mode: how to enable automatic switch to ESS-mode within R code blocks?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use princomp () function in R when covariance matrix has zero's?

Tags:

r

statistics

pca

princomp

eigenvector

384X21

People also ask

1 Answers

IRTFM

Recent Activity

Donate For Us