Correlation matrix in NumPy with NaN's

Tags:

A have a n x m matrix in which row i represents the timeseries of the variable V_i. I would like to compute the n x n correlation matrix M, where M_{i,j} contains the correlation coefficient (Pearson's r) between V_i and V_j.

However, when I try the following in numpy:

numpy.corrcoef(numpy.matrix('5 6 7; 1 1 1'))

I get the following output:

array([[  1., nan],
       [ nan, nan]])

It seems that numpy.corrcoef doesn't like unit vectors, because if I change the second row to 7 6 5, I get the expected result:

array([[  1., -1.],
       [ -1.,  1.]])

What is the reason for this kind of behavior of numpy.corrcoef?

889

asked Dec 05 '13 14:12

John Manak

1 Answers

leewangzhong (in the comment) is correct, Pearson's r is not defined for constant timeseries, as their standard deviation is zero. Thanks!

107

answered Oct 28 '22 15:10

John Manak

Related questions
                            
                                Python - why it doesn't create a new instance of object? [duplicate]
                            
                                How can I find all public repos in github that a user contributes to?
                            
                                Nearest neighbor 1 dimensional data with a specified range
                            
                                Stop cmdloop() from stripping white space from the input line in python
                            
                                How do you get the Titan graph database working with Python?
                            
                                Python 2.7: ImportError: DLL load failed: The specified module could not be found
                            
                                How I make color calibration in opencv using a colorchecker?
                            
                                Check if any tests raise a deprecation warning with pytest
                            
                                Matplotlib legend fontsize
                            
                                Large celery task memory leak
                            
                                regex module with pypy
                            
                                Why does SymPy give me the wrong answer when I row-reduce a symbolic matrix?
                            
                                How can I override standard handler404, handler403, handler500 in Django?
                            
                                Python C++ extension: compile only modified source files
                            
                                How to add/import a Django project into a virtualenv?
                            
                                Routes With Custom Domains Using Flask
                            
                                Need to convert to lambda function
                            
                                Type hinting class not yet imported
                            
                                How to use peewee limit()?
                            
                                lack of speedup and erroneous results with OpenMP and Cython

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Correlation matrix in NumPy with NaN's

Tags:

python

matrix

numpy

correlation

John Manak

People also ask

1 Answers

John Manak

Recent Activity

Donate For Us