I am trying to figure out how to calculate covariance with the Python Numpy function cov. When I pass it two one-dimentional arrays, I get back a 2x2 matrix of results. I don't know what to do with that. I'm not great at statistics, but I believe covariance in such a situation should be a single number. This is what I am looking for. I wrote my own: <pre class="prettyprint"><code>def cov(a, b): if len(a) != len(b): return a_mean = np.mean(a) b_mean = np.mean(b) sum = 0 for i in range(0, len(a)): sum += ((a[i] - a_mean) * (b[i] - b_mean)) return sum/(len(a)-1) </code></pre> That works, but I figure the Numpy version is much more efficient, if I could figure out how to use it. Does anybody know how to make the Numpy cov function perform like the one I wrote? Thanks, Dave

When <code>a</code> and <code>b</code> are 1-dimensional sequences, <code>numpy.cov(a,b)[0][1]</code> is equivalent to your <code>cov(a,b)</code>. The 2x2 array returned by <code>np.cov(a,b)</code> has elements equal to <pre class="prettyprint"><code>cov(a,a) cov(a,b) cov(a,b) cov(b,b) </code></pre> (where, again, <code>cov</code> is the function you defined above.)

Calculating Covariance with Python and Numpy

Tags:

python

numpy

covariance

I am trying to figure out how to calculate covariance with the Python Numpy function cov. When I pass it two one-dimentional arrays, I get back a 2x2 matrix of results. I don't know what to do with that. I'm not great at statistics, but I believe covariance in such a situation should be a single number. This is what I am looking for. I wrote my own:

Click to copy

def cov(a, b):      if len(a) != len(b):         return      a_mean = np.mean(a)     b_mean = np.mean(b)      sum = 0      for i in range(0, len(a)):         sum += ((a[i] - a_mean) * (b[i] - b_mean))      return sum/(len(a)-1)

That works, but I figure the Numpy version is much more efficient, if I could figure out how to use it.

Does anybody know how to make the Numpy cov function perform like the one I wrote?

Thanks,

Dave

560

asked Mar 10 '13 01:03

Dave

1 Answers

When a and b are 1-dimensional sequences, numpy.cov(a,b)[0][1] is equivalent to your cov(a,b).

The 2x2 array returned by np.cov(a,b) has elements equal to

Click to copy

cov(a,a)  cov(a,b)  cov(a,b)  cov(b,b)

(where, again, cov is the function you defined above.)

120

answered Sep 27 '22 19:09

unutbu

Related questions
                            
                                re.sub replace with matched content
                            
                                pandas to_csv output quoting issue
                            
                                SyntaxError: non-default argument follows default argument
                            
                                Python conversion from binary string to hexadecimal
                            
                                Decorating class methods - how to pass the instance to the decorator?
                            
                                Conditional statement in a one line lambda function in python?
                            
                                ImportError: No module named google.protobuf
                            
                                Pandas: drop columns with all NaN's
                            
                                How to perform element-wise Boolean operations on NumPy arrays [duplicate]
                            
                                How can I ignore ValueError when I try to remove an element from a list?
                            
                                Appending an id to a list if not already present in a string
                            
                                How can I get a Python generator to return None rather than StopIteration?
                            
                                How to Format dict string outputs nicely
                            
                                re.findall which returns a dict of named capturing groups?
                            
                                What do numbers starting with 0 mean in python?
                            
                                Box around text in matplotlib
                            
                                Determining what version of Flask is installed
                            
                                How to convert current date to epoch timestamp?
                            
                                flask does not see change in .js file
                            
                                Remove char at specific index - python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Calculating Covariance with Python and Numpy

Tags:

python

numpy

covariance

Dave

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us