Difference between numpy dot() and Python 3.5+ matrix multiplication @

People also ask

Is NP dot the same as matrix multiplication?

dot() This function returns the dot product of two arrays. For 2-D vectors, it is the equivalent to matrix multiplication.

What is the difference between dot product and matrix multiplication?

Dot product is defined between two vectors. Matrix product is defined between two matrices. They are different operations between different objects.

What is the difference between NP dot and NP multiply?

np. dot is the dot product of two matrices. Whereas np. multiply does an element-wise multiplication of two matrices.

What is the difference between numpy array and numpy matrix?

Numpy matrices are strictly 2-dimensional, while numpy arrays (ndarrays) are N-dimensional. Matrix objects are a subclass of ndarray, so they inherit all the attributes and methods of ndarrays.

The @ operator calls the array's __matmul__ method, not dot. This method is also present in the API as the function np.matmul.

>>> a = np.random.rand(8,13,13)
>>> b = np.random.rand(8,13,13)
>>> np.matmul(a, b).shape
(8, 13, 13)

From the documentation:

matmul differs from dot in two important ways.

Multiplication by scalars is not allowed.

Stacks of matrices are broadcast together as if the matrices were elements.

The last point makes it clear that dot and matmul methods behave differently when passed 3D (or higher dimensional) arrays. Quoting from the documentation some more:

For matmul:

If either argument is N-D, N > 2, it is treated as a stack of matrices residing in the last two indexes and broadcast accordingly.

For np.dot:

For 2-D arrays it is equivalent to matrix multiplication, and for 1-D arrays to inner product of vectors (without complex conjugation). For N dimensions it is a sum product over the last axis of a and the second-to-last of b

The answer by @ajcr explains how the dot and matmul (invoked by the @ symbol) differ. By looking at a simple example, one clearly sees how the two behave differently when operating on 'stacks of matricies' or tensors.

To clarify the differences take a 4x4 array and return the dot product and matmul product with a 3x4x2 'stack of matricies' or tensor.

import numpy as np
fourbyfour = np.array([
                       [1,2,3,4],
                       [3,2,1,4],
                       [5,4,6,7],
                       [11,12,13,14]
                      ])


threebyfourbytwo = np.array([
                             [[2,3],[11,9],[32,21],[28,17]],
                             [[2,3],[1,9],[3,21],[28,7]],
                             [[2,3],[1,9],[3,21],[28,7]],
                            ])

print('4x4*3x4x2 dot:\n {}\n'.format(np.dot(fourbyfour,threebyfourbytwo)))
print('4x4*3x4x2 matmul:\n {}\n'.format(np.matmul(fourbyfour,threebyfourbytwo)))

The products of each operation appear below. Notice how the dot product is,

...a sum product over the last axis of a and the second-to-last of b

and how the matrix product is formed by broadcasting the matrix together.

4x4*3x4x2 dot:
 [[[232 152]
  [125 112]
  [125 112]]

 [[172 116]
  [123  76]
  [123  76]]

 [[442 296]
  [228 226]
  [228 226]]

 [[962 652]
  [465 512]
  [465 512]]]

4x4*3x4x2 matmul:
 [[[232 152]
  [172 116]
  [442 296]
  [962 652]]

 [[125 112]
  [123  76]
  [228 226]
  [465 512]]

 [[125 112]
  [123  76]
  [228 226]
  [465 512]]]

Just FYI, @ and its numpy equivalents dot and matmul are all equally fast. (Plot created with perfplot, a project of mine.)

enter image description here

Code to reproduce the plot:

import perfplot
import numpy


def setup(n):
    A = numpy.random.rand(n, n)
    x = numpy.random.rand(n)
    return A, x


def at(data):
    A, x = data
    return A @ x


def numpy_dot(data):
    A, x = data
    return numpy.dot(A, x)


def numpy_matmul(data):
    A, x = data
    return numpy.matmul(A, x)


perfplot.show(
    setup=setup,
    kernels=[at, numpy_dot, numpy_matmul],
    n_range=[2 ** k for k in range(15)],
)

In mathematics, I think the dot in numpy makes more sense

dot(a,b)_{i,j,k,a,b,c} = $\sum_m a_{i,j,k,m}b_{a,b,m,c}$

since it gives the dot product when a and b are vectors, or the matrix multiplication when a and b are matrices

As for matmul operation in numpy, it consists of parts of dot result, and it can be defined as

>matmul(a,b)_{i,j,k,c} = $\sum_m a_{i,j,k,m}b_{i,j,m,c}$

So, you can see that matmul(a,b) returns an array with a small shape, which has smaller memory consumption and make more sense in applications. In particular, combining with broadcasting, you can get

matmul(a,b)_{i,j,k,l} = $\sum_m a_{i,j,k,m}b_{j,m,l}$

for example.

From the above two definitions, you can see the requirements to use those two operations. Assume a.shape=(s1,s2,s3,s4) and b.shape=(t1,t2,t3,t4)

To use dot(a,b) you need
1. t3=s4;
To use matmul(a,b) you need
1. t3=s4
2. t2=s2, or one of t2 and s2 is 1
3. t1=s1, or one of t1 and s1 is 1

Use the following piece of code to convince yourself.

Code sample

import numpy as np
for it in xrange(10000):
    a = np.random.rand(5,6,2,4)
    b = np.random.rand(6,4,3)
    c = np.matmul(a,b)
    d = np.dot(a,b)
    #print 'c shape: ', c.shape,'d shape:', d.shape

    for i in range(5):
        for j in range(6):
            for k in range(2):
                for l in range(3):
                    if not c[i,j,k,l] == d[i,j,k,j,l]:
                        print it,i,j,k,l,c[i,j,k,l]==d[i,j,k,j,l] #you will not see them

Here is a comparison with np.einsum to show how the indices are projected

np.allclose(np.einsum('ijk,ijk->ijk', a,b), a*b)        # True 
np.allclose(np.einsum('ijk,ikl->ijl', a,b), a@b)        # True
np.allclose(np.einsum('ijk,lkm->ijlm',a,b), a.dot(b))   # True

Related questions
                            
                                how to clear the screen in python [duplicate]
                            
                                How can I find the first occurrence of a sub-string in a python string?
                            
                                pandas: How do I split text in a column into multiple rows?
                            
                                Days between two dates? [duplicate]
                            
                                Convert string in base64 to image and save on filesystem
                            
                                Python: Convert timedelta to int in a dataframe
                            
                                Remove all values within one list from another list? [duplicate]
                            
                                Return first N key:value pairs from dict
                            
                                Change one value based on another value in pandas
                            
                                How do you check in python whether a string contains only numbers?
                            
                                How to sort a list of strings numerically?
                            
                                Application not picking up .css file (flask/python)
                            
                                Python xml ElementTree from a string source?
                            
                                Python logging not outputting anything
                            
                                Running Selenium WebDriver python bindings in chrome
                            
                                Appending a list or series to a pandas DataFrame as a row?
                            
                                How to write a file or data to an S3 object using boto3
                            
                                Coroutine vs Continuation vs Generator
                            
                                Can iterators be reset in Python?
                            
                                Python how to write to a binary file?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between numpy dot() and Python 3.5+ matrix multiplication @

Tags:

python

numpy

python-3.5

matrix-multiplication

People also ask

>matmul(a,b)_{i,j,k,c} = $\sum_m a_{i,j,k,m}b_{i,j,m,c}$

Code sample

Recent Activity

Donate For Us