In the NumPy v1.15 Reference Guide, the documentation for numpy.dot uses the concept of "sum product". Namely, we read the following: <blockquote> <ul> <li>If a is an N-D array and b is a 1-D array, it is a sum product over the last axis of a and b.</li> <li>If a is an N-D array and b is an M-D array (where M>=2), it is a sum product over the last axis of a and the second-to-last axis of b: <code>dot(a, b)[i,j,k,m] = sum(a[i,j,:] * b[k,:,m])</code> </li> </ul> </blockquote> What is the definition for this "sum product" concept? (Couldn't find such a definition on Wikipedia, for example.)

https://en.wikipedia.org/wiki/Matrix_multiplication <pre class="prettyprint"><code>That is, the entry c[i,j] of the product is obtained by multiplying term-by-term the entries of the ith row of A and the jth column of B, and summing these m products. In other words, c[i,j] is the dot product of the ith row of A and the jth column of B. </code></pre> https://en.wikipedia.org/wiki/Dot_product <pre class="prettyprint"><code>Algebraically, the dot product is the sum of the products of the corresponding entries of the two sequences of numbers. </code></pre> In early math classes did you learn to take the matrix product, by running one finger across the rows of <code>A</code> and down the columns of <code>B</code>, mulitplying pairs of numbers and summing them? That motion is part of my intuition of how that product is taken. <hr> For the 1d second argument case, <code>np.dot</code> and <code>np.matmul</code> produce the same thing, but describe the action differently: <ul> <li>If <code>a</code> is an N-D array and <code>b</code> is a 1-D array, it is a sum product over the last axis of <code>a</code> and <code>b</code>.</li> <li> If the second argument is 1-D, it is promoted to a matrix by appending a 1 to its dimensions. After matrix multiplication the appended 1 is removed. In [103]: np.dot([[1,2],[3,4]], [1,2]) Out[103]: array([ 5, 11]) In [104]: np.matmul([[1,2],[3,4]], [1,2]) Out[104]: array([ 5, 11]) </li> </ul> Appending the dimension to <code>B</code>, does: <pre class="prettyprint"><code>In [105]: np.matmul([[1,2],[3,4]], [[1],[2]]) Out[105]: array([[ 5], [11]]) </code></pre> This last is a (2,2) with (2,1) => (2,1) Sometimes it is clearer to express the action in <code>einsum</code> terms: <pre class="prettyprint"><code>In [107]: np.einsum('ij,j->i', [[1,2],[3,4]], [1,2]) Out[107]: array([ 5, 11]) </code></pre> <code>j</code>, the last axis of both arrays is the one that gets 'summed'.

What is the meaning of "sum product" as mentioned in Numpy documentation?

Tags:

numpy

In the NumPy v1.15 Reference Guide, the documentation for numpy.dot uses the concept of "sum product".

Namely, we read the following:

If a is an N-D array and b is a 1-D array, it is a sum product over the last axis of a and b.

If a is an N-D array and b is an M-D array (where M>=2), it is a sum product over the last axis of a and the second-to-last axis of b:
dot(a, b)[i,j,k,m] = sum(a[i,j,:] * b[k,:,m])

What is the definition for this "sum product" concept?
(Couldn't find such a definition on Wikipedia, for example.)

675

asked Oct 11 '18 15:10

JérômeL

1 Answers

https://en.wikipedia.org/wiki/Matrix_multiplication

That is, the entry c[i,j] of the product is obtained by multiplying 
term-by-term the entries of the ith row of A and the jth column of B, 
and summing these m products. In other words, c[i,j] is the dot product 
of the ith row of A and the jth column of B.

https://en.wikipedia.org/wiki/Dot_product

Algebraically, the dot product is the sum of the products of the 
corresponding entries of the two sequences of numbers.

In early math classes did you learn to take the matrix product, by running one finger across the rows of A and down the columns of B, mulitplying pairs of numbers and summing them? That motion is part of my intuition of how that product is taken.

For the 1d second argument case, np.dot and np.matmul produce the same thing, but describe the action differently:

If a is an N-D array and b is a 1-D array, it is a sum product over the last axis of a and b.
If the second argument is 1-D, it is promoted to a matrix by appending a 1 to its dimensions. After matrix multiplication the appended 1 is removed.

In [103]: np.dot([[1,2],[3,4]], [1,2]) Out[103]: array([ 5, 11]) In [104]: np.matmul([[1,2],[3,4]], [1,2]) Out[104]: array([ 5, 11])

Appending the dimension to B, does:

In [105]: np.matmul([[1,2],[3,4]], [[1],[2]])
Out[105]: 
array([[ 5],
       [11]])

This last is a (2,2) with (2,1) => (2,1)

Sometimes it is clearer to express the action in einsum terms:

In [107]: np.einsum('ij,j->i', [[1,2],[3,4]], [1,2])
Out[107]: array([ 5, 11])

j, the last axis of both arrays is the one that gets 'summed'.

answered Nov 15 '22 08:11

hpaulj

Related questions
                            
                                Why use numpy over list based on speed?
                            
                                Elegant way to compare to torch.FloatTensor on GPU
                            
                                Why do I get an overflow error multiplying Numpy product outputs?
                            
                                Indexing Numpy array with list and tuple gives different results?
                            
                                How do I visualize or plot a multidimensional tensor?
                            
                                New to Python, don't know what is wrong with my code
                            
                                Subtract two dataframe with the same name different index
                            
                                Working with binary PNG images in PIL/pillow
                            
                                Vectorized pythonic way to get count of elements greater than current element
                            
                                Permission Error: Using Image.open
                            
                                Is numpy+mkl faster than numpy?
                            
                                Pytorch - Pick best probability after softmax layer
                            
                                example usage of xt::where for xtensor C++
                            
                                Trouble with linking boost::python::numpy
                            
                                Multiple mkl packages installed in anaconda
                            
                                How to sort a NumPy array by frequency?
                            
                                How to compare equality of dataclasses holding numpy.ndarray (bool(a==b) raises ValueError)?
                            
                                Numba - does nopython mode support list of tuples?
                            
                                Vectorize numpy code with operation depending on previous value
                            
                                Make a numpy array monotonic without a Python loop

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With