why is the Numpy matrix multiplication operation called "dot"?

Tags:

numpy

I am a bit confused about the naming of the Numpy function dot: https://docs.scipy.org/doc/numpy-1.13.0/reference/generated/numpy.dot.html

It seems like this is what is used in Numpy to perform matrix multiplication. However the "dot product" is something different, it produces a single scalar from two vectors: https://en.wikipedia.org/wiki/Dot_product

Can somebody resolve these two uses of the term "dot"?

648

asked Jan 16 '18 00:01

Stephen

2 Answers

It is common to write multiplication using a centrally positioned dot:

A⋅B

The name almost certainly comes from this notation. There is actually a dedicated codepoint for it named DOT OPERATOR under the "Mathematical Operators" block of unicode: chr(0x22c5). The comments mention this as

...for denotation of multiplication

Now, regarding this comment:

However the "dot product" is something different, it produces a single scalar from two vectors

They are not altogether unrelated! In a 2-d matrix multiplication A⋅B, the element at position (i,j) in the result has come from a dot product of row i by column j.

answered Oct 17 '22 13:10

wim

The "dot" or "inner" product has an expanded definition in tensor algebra compared to linear algebra.

For n and m dimensional tensors N and M, N⋅M will have dimension (n + m - 2), as the "inner dimensions" (last dimension of N and first dimension of M) will be summed over after multiplication.

(as an aside, N.dot(M) actually sums over the last dimension of N and the second-last dimension of M, because . . . reasons. The actual tensor algebra "dot product" functionality is relegated to np.tensordot)

For n = m = 1 (i.e. both are 1-d tensors, or "vectors"), the output is a 0-d tensor, or "scalar", and this is equivalent to the "scalar" or "dot" product from linear algebra.

For n = m = 2 (i.e. both are 2-d tensors, or "matrices"), the output is a 2-d tensor, or "matrix", and the operation is eqivalent to "matrix multiplication".

And for n = 2, m = 1, the output is a 1d tensor, or "vector", and this is called (at least by my professors) "mapping."

Note that since the order of dimensions is relevent to the construction of the inner product,

N.dot(M) == M.dot(N)

is not generally True unless n = m = 1 - i.e. only in the case of the scalar product.

answered Oct 17 '22 13:10

Daniel F

Related questions
                            
                                Chaining string operations on Pandas Series
                            
                                Pandas counting occurrence of list contained in column of lists
                            
                                SQLAlchemy: How to filter after aggregation
                            
                                Loading Excel file chunk by chunk with Python instead of loading full file into memory
                            
                                How do I force pip to install from the last commit of a branch in a repo?
                            
                                Streaming multiple numpy arrays to a file
                            
                                Remove history button from Django admin
                            
                                grep library output from within Python
                            
                                SQLite: return only top 2 results within each group
                            
                                could not broadcast input array from shape (20,310,310) into shape (20)
                            
                                How do you go from a sip.voidptr (QImage.constBits()) to a ctypes void or char pointer?
                            
                                comparing HOG feature vectors without SVM
                            
                                flask-socket.io: frequent time-outs
                            
                                Opencv Python open dng format
                            
                                Python: performing FFT on music file
                            
                                Time limit for mixed integer programming with Python PuLP
                            
                                Why do I receive "WebSocket connection is closed: code = 1000 (OK), no reason"
                            
                                Fuzzy matching a word inside a pyspark dataframe string
                            
                                Inserting list values from a list to another in a specific order in Python
                            
                                ValueError: operands could not be broadcast together with shapes in Naive bayes classifier

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With