how do i calculate correlation between corresponding columns of two matrices and not getting other correlations as output

Tags:

3 Answers

The first answer above calculates all pairwise correlations, which is fine unless the matrices are large, and the second one doesn't work. As far as I can tell, efficient computation must be done directly, such as this code borrowed from borrowed from the arrayMagic Bioconductor package, works efficiently for large matrices:

> colCors = function(x, y) { 
+   sqr = function(x) x*x
+   if(!is.matrix(x)||!is.matrix(y)||any(dim(x)!=dim(y)))
+     stop("Please supply two matrices of equal size.")
+   x   = sweep(x, 2, colMeans(x))
+   y   = sweep(y, 2, colMeans(y))
+   cor = colSums(x*y) /  sqrt(colSums(sqr(x))*colSums(sqr(y)))
+   return(cor)
+ }

> set.seed(1)
> a=matrix(rnorm(15),nrow=5)
> b=matrix(rnorm(15),nrow=5)
> diag(cor(a,b))
[1]  0.2491625 -0.5313192  0.5594564
> mapply(cor,a,b)
 [1] NA NA NA NA NA NA NA NA NA NA NA NA NA NA NA
> colCors(a,b)
[1]  0.2491625 -0.5313192  0.5594564

answered Oct 11 '22 12:10

user1048410

I would probably personally just use diag:

> diag(cor(a,b))
[1]  1.0000000 -1.0000000 -0.6964286

But you could also use mapply:

> mapply(cor,a,b)
         a          b          c 
 1.0000000 -1.0000000 -0.6964286

answered Oct 11 '22 12:10

Joshua Ulrich

mapply works with data frames but not matrices. That is because in data frames each column is an element, while in matrices each entry is an element.

In the answer above mapply(cor,as.data.frame(a),as.data.frame(b)) works just fine.

set.seed(1)
a=matrix(rnorm(15),nrow=5)
b=matrix(rnorm(15),nrow=5)
diag(cor(a,b))
[1]  0.2491625 -0.5313192  0.5594564
mapply(cor,as.data.frame(a),as.data.frame(b))
    V1         V2         V3 
 0.2491625 -0.5313192  0.5594564

This is much more efficient for large matrices.

answered Oct 11 '22 11:10

Cão

Related questions
                            
                                dplyr mutate_at and case_when
                            
                                How to change the order of ggplot2 facets [duplicate]
                            
                                tryCatch inside dplyr's mutate?
                            
                                Set interval between breaks on time axis
                            
                                programmatically rename columns in dplyr
                            
                                How can i specify encode in fwrite() for export csv file R?
                            
                                How do I summarise all columns except one(s) I specify?
                            
                                Custom shape in ggplot (geom_point)
                            
                                subsetting a data.table based on a named list
                            
                                Write a file using `saveRDS()` so that it is backwards compatible with old versions of R
                            
                                regex for replacement of specific character outside parenthesis only
                            
                                Recursive sum over two variables using dplyr
                            
                                Assigning group ID with ddply
                            
                                Drop lines from actual to modeled points in R
                            
                                R Plyr - Ordering results from DDPLY?
                            
                                Functions not executing before Sys.sleep()
                            
                                Linear regression in R (normal and logarithmic data)
                            
                                What's similar to an #ifdef DEBUG in R?
                            
                                Converting an ftable (contingency table) to a dataframe in R
                            
                                Error in frame() : figure margins too large

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how do i calculate correlation between corresponding columns of two matrices and not getting other correlations as output

Tags:

r

correlation

rder

People also ask

3 Answers

user1048410

Joshua Ulrich

Cão

Recent Activity

Donate For Us