Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Add p-values in corrplot matrix

I calculated the Spearman correlation between two matrices and I'm plotting the r values using corrplot. How can I plot only the significant correlations (so only those correlations having p value lower than 0.00 and delete those having higher p value, even if are strong correlations - high value of r). I generated the correlation matrix using corr.test in psych package, so I already have the p values in cor.matrix$p

This is the code I'm using:

library(corrplot)
library(psych)
corr.test(mydata_t1, mydata_t2, method="spearman")
M <- corrplot(cor.matrix$r, method="square",type="lower",col=col1(100),is.corr=T,mar=c(1,1,1,1),tl.cex=0.5)

How can I modify it to plot only significant corelations?

like image 612
Francesca de Filippis Avatar asked Jan 15 '14 20:01

Francesca de Filippis


People also ask

How do you find the p-value in a correlation matrix?

The correlation matrix with p-values for an R data frame can be found by using the function rcorr of Hmisc package and read the output as matrix. For example, if we have a data frame called df then the correlation matrix with p-values can be found by using rcorr(as. matrix(df)).

What should be included in a correlation matrix?

A correlation matrix consists of rows and columns that show the variables. Each cell in a table contains the correlation coefficient. In addition, the correlation matrix is frequently utilized in conjunction with other types of statistical analysis.

How do you make a Corrplot bigger in R?

The correlation coefficient value size in correlation matrix plot created by using corrplot function ranges from 0 to 1, 0 referring to the smallest and 1 referring to the largest, by default it is 1. To change this size, we need to use number. cex argument.


2 Answers

Take a look at the examples of corrplot. do ?corrplot. It has options for doing what you want. You can plot the p-values on the graph itself, which I think is better than putting stars, as people not familiar with that terminology have one more thing to look up. to put p-values on graph do this corrplot(cor.matrix$r, p.mat = cor.matrix$p, insig = "p-value") where cor.matrix is object holding the result of cor.test. The insig option can put:

  • p-values (as shown above)
  • blank out insignificant correlations with corrplot(cor.matrix$r, p.mat = cor.matrix$p, insig = "blank")`
  • Cross out (put a X on) insignificant correlations) with option corrplot(cor.matrix$r, p.mat = cor.matrix$p, insig = "pch") (DEFAULT)
  • Do nothing with to the plot, with corrplot(cor.matrix$r, p.mat = cor.matrix$p, insig = "n")

If you do want stars, p-value on the correlation matrix plot - take a look at this thread Correlation Corrplot Configuration

Though I have to say I really like @sven hohenstein's elegant subset solution.

like image 192
infominer Avatar answered Sep 22 '22 18:09

infominer


Create a copy of cor.mat and replace the corresponding correlation coefficients with zero:

cor.matrix2 <- cor.matrix

# find cells with p-values > 0.05 and replace corresponding
# correlations coefficients with zero
cor.matrix2$r[cor.matrix2$p > 0.05] <- 0

# use this matrix for corrplot
M <- corrplot(cor.matrix2$r, method="square",type="lower",col=col1(100),
              is.corr=T,mar=c(1,1,1,1),tl.cex=0.5)

The replaced values will appear as a white cell.

like image 43
Sven Hohenstein Avatar answered Sep 19 '22 18:09

Sven Hohenstein