I am confused between the multiplication between two tensors using * and matmul. Below is my code <pre class="prettyprint"><code>import torch torch.manual_seed(7) features = torch.randn((2, 5)) weights = torch.randn_like(features) </code></pre> here, i want to multiply weights and features. so, one way to do it is as follows <pre class="prettyprint"><code>print(torch.sum(features * weights)) </code></pre> Output: <pre class="prettyprint"><code>tensor(-2.6123) </code></pre> Another way to do is using matmul <pre class="prettyprint"><code>print(torch.mm(features,weights.view((5,2)))) </code></pre> but, here output is <pre class="prettyprint"><code>tensor([[ 2.8089, 4.6439], [-2.3988, -1.9238]]) </code></pre> What i don't understand here is that why <code>matmul</code> and usual multiplication are giving different outputs, when both are same. Am i doing anything wrong here? Edit: When, i am using feature of shape <code>(1,5)</code> both * and <code>matmul</code> outputs are same. but, its not the same when the shape is <code>(2,5)</code>.

When you use <code>*</code>, the multiplication is elementwise, when you use <code>torch.mm</code> it is matrix multiplication. Example: <pre class="prettyprint"><code>a = torch.rand(2,5) b = torch.rand(2,5) result = a*b </code></pre> <code>result</code> will be shaped the same as <code>a</code> or <code>b</code> i.e <code>(2,5)</code> whereas considering operation <pre class="prettyprint"><code>result = torch.mm(a,b) </code></pre> It will give a size mismatch error, as this is proper matrix multiplication (as we study in linear algebra) and <code>a.shape[1] != b.shape[0]</code>. When you apply the view operation in <code>torch.mm</code> you are trying to match the dimensions. In the special case of the shape in some particular dimension being 1, it becomes a dot product and hence <code>sum (a*b)</code> is same as <code>mm(a, b.view(5,1))</code>

is there any difference between matmul and usual multiplication of tensors

Tags:

python

numpy

tensor

pytorch

I am confused between the multiplication between two tensors using * and matmul. Below is my code

import torch
torch.manual_seed(7)
features = torch.randn((2, 5))
weights = torch.randn_like(features)

here, i want to multiply weights and features. so, one way to do it is as follows

print(torch.sum(features * weights))

Output:

tensor(-2.6123)

Another way to do is using matmul

print(torch.mm(features,weights.view((5,2))))

but, here output is

tensor([[ 2.8089,  4.6439],
        [-2.3988, -1.9238]])

What i don't understand here is that why matmul and usual multiplication are giving different outputs, when both are same. Am i doing anything wrong here?

Edit: When, i am using feature of shape (1,5) both * and matmul outputs are same. but, its not the same when the shape is (2,5).

612

asked Nov 08 '18 06:11

InAFlash

1 Answers

When you use *, the multiplication is elementwise, when you use torch.mm it is matrix multiplication.

Example:

a = torch.rand(2,5)
b = torch.rand(2,5)
result = a*b

result will be shaped the same as a or b i.e (2,5) whereas considering operation

result = torch.mm(a,b)

It will give a size mismatch error, as this is proper matrix multiplication (as we study in linear algebra) and a.shape[1] != b.shape[0]. When you apply the view operation in torch.mm you are trying to match the dimensions.

In the special case of the shape in some particular dimension being 1, it becomes a dot product and hence sum (a*b) is same as mm(a, b.view(5,1))

174

answered Oct 17 '22 07:10

Umang Gupta

Related questions
                            
                                How to annotate that a classmethod returns an instance of that class [duplicate]
                            
                                using the timedelta.round() function
                            
                                Grouping import statements in python
                            
                                How to make video from an updating numpy array in Python
                            
                                how is asyncio.sleep() in python implemented?
                            
                                Generate a list a(n) is not of the form prime + a(k), k < n
                            
                                Python: how to replace NaN with conditions in a dataframe?
                            
                                Python : How to make label bold in kivy
                            
                                Speed of np.empty vs np.zeros
                            
                                Using pytest's parametrize, how can I skip the remaining tests if one test case fails?
                            
                                Pandas Join on String Datatype
                            
                                pixel/array position to lat long gdal Python
                            
                                Replacing nan with blanks in Python
                            
                                Efficiently compute n-body gravitation in python
                            
                                How to evaluate Word2Vec model
                            
                                How do I change --NotebookApp.iopub_data_rate_limit for .jupyter?
                            
                                How to create upright vertical oriented text in matplotlib?
                            
                                How to choose your conda environment in Jupyter Notebook
                            
                                Ignoring a specific flake8 rule for a folder
                            
                                flask "get_or_404" like function but with another status code

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With