How to get correlation of two vectors in python [duplicate]

Tags:

python

numpy

In matlab I use

a=[1,4,6]
b=[1,2,3]
corr(a,b)

which returns .9934. I've tried numpy.correlate but it returns something completely different. What is the simplest way to get the correlation of two vectors?

829

asked Oct 17 '13 13:10

Luke Makk

1 Answers

The docs indicate that numpy.correlate is not what you are looking for:

numpy.correlate(a, v, mode='valid', old_behavior=False)[source]
  Cross-correlation of two 1-dimensional sequences.
  This function computes the correlation as generally defined in signal processing texts:
     z[k] = sum_n a[n] * conj(v[n+k])
  with a and v sequences being zero-padded where necessary and conj being the conjugate.

Instead, as the other comments suggested, you are looking for a Pearson correlation coefficient. To do this with scipy try:

from scipy.stats.stats import pearsonr   
a = [1,4,6]
b = [1,2,3]   
print(pearsonr(a,b))

This gives

(0.99339926779878274, 0.073186395040328034)

You can also use numpy.corrcoef:

import numpy
print(numpy.corrcoef(a,b))

This gives:

[[ 1.          0.99339927]
 [ 0.99339927  1.        ]]

190

answered Oct 19 '22 20:10

Hooked

Related questions
                            
                                Import arbitrary python source file. (Python 3.3+)
                            
                                How do I sum values in a column that match a given condition using pandas?
                            
                                pip cannot uninstall <package>: "It is a distutils installed project"
                            
                                How do I set browser width and height in Selenium WebDriver?
                            
                                Python equivalent of Java StringBuffer?
                            
                                Specify extras_require with pip install -e
                            
                                Pandas DataFrame performance
                            
                                How do you find the IQR in Numpy?
                            
                                Is it pythonic for a function to return multiple values?
                            
                                How to check if a deque is empty
                            
                                pandas: find percentile stats of a given column
                            
                                Does Python have an argc argument?
                            
                                matplotlib savefig() plots different from show()
                            
                                Connecting to a host listed in ~/.ssh/config when using Fabric
                            
                                How to open and convert sqlite database to pandas dataframe
                            
                                python pep8 class in init imported but not used
                            
                                Adding a scrollbar to a group of widgets in Tkinter
                            
                                How do I concatenate a boolean to a string in Python?
                            
                                python time + timedelta equivalent
                            
                                Confused with python lists: are they or are they not iterators?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With