Why Pearson correlation output is NaN?

Tags:

I'm trying to get the Pearson correlation coefficient between to variables in R. This is the scatterplot of the variables:

ggplot(results_summary, aes(x =D_in, y = D_ex)) + geom_point(col=ifelse(results_summary$FDR < 0.05, ifelse(results_summary$logF>0, "red", "green" ), "black"))

enter image description here

As you can see, the variables correlate pretty well, so I'm expecting a high correlation coefficient. However when I try to get the Pearson correlation coefficient I'm getting a NaN!

> cor(results_summary$D_in, results_summary$D_ex, method="spearman")
[1] 0.868079
> cor(results_summary$D_in, results_summary$D_ex, method="kendall")
[1] 0.6973086
> cor(results_summary$D_in, results_summary$D_ex, method="pearson")
[1] NaN

I checked if my data contains any NaN:

> nrow(subset(results_summary, is.nan(results_summary$D_ex)==TRUE)) 
[1] 0
> nrow(subset(results_summary, is.nan(results_summary$D_in)==TRUE)) 
[1] 0
> cor(results_summary$D_in, results_summary$D_ex, method="pearson", use="complete.obs")
[1] NaN

But it's seems that is not the reason of the resulting NaN. Can some one give any clue about what is might happening here?

Thanks for your time!

705

asked Aug 06 '15 11:08

Geparada

1 Answers

That seems odd. My guess is that there is some problem with the input data (which was not revealed by the check you mentioned). I suggest you running:

any(!is.finite(results_summary$D_in))

any(!is.finite(results_summary$D_ex))

You could also try calculating Pearson's correlation by hand, to try to get some insight on where the problem is (in the numerator and/or denominator?):

pearson_num = cov(results_summary$D_in, results_summary$D_ex, use="complete.obs")

pearson_den = c(sd(results_summary$D_in), sd(results_summary$D_ex))

answered Oct 29 '22 03:10

tguzella

Related questions
                            
                                Nested Model in STAN?
                            
                                How to programmatically provide a list of filters to apply via dplyr and filter_
                            
                                R: Adding row to a dataframe with multiple classes
                            
                                Using packages with multi-threading in R
                            
                                Does stargazer interpreting data.frame data as latex code constitute a bug or is this intended?
                            
                                How to set legend height to be the same as the height of the plot area?
                            
                                Combining chaining and assignment by reference in a data.table
                            
                                glmnet error for logistic regression/binomial
                            
                                Create a Triangular Matrix from a Vector performing sequential operations
                            
                                Error with dplyr group_by
                            
                                Reading aligned column data with fread
                            
                                Keep hitting the error ""loop_apply" not resolved from current namespace (plyr)" in ggplot2 with example codes
                            
                                r - tryCatch error handling within Shiny
                            
                                All paths in directed tree graph from root to leaves in igraph R
                            
                                How to compute Voronoi tesselation based on manhattan distance in R
                            
                                ggplot2, line stacking order for aesthetic mapping of variable
                            
                                Caret Neural Network Error: "missing values in resampled performance measures"
                            
                                Extremely high probability of being alive BTYD R
                            
                                Translate a vector of values using a key value mapping in R (equivalent to a HashMap)
                            
                                R inline markdown

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why Pearson correlation output is NaN?

Tags:

r

statistics

pearson

Geparada

People also ask

1 Answers

tguzella

Recent Activity

Donate For Us