How can I matrix-multiply two PyTorch quantized Tensors?

Tags:

I am new to tensor quantization, and tried doing something as simple as

import torch
x = torch.rand(10, 3)
y = torch.rand(10, 3)

[email protected]

with PyTorch quantized tensors running on CPU. I thus tried

scale, zero_point = 1e-4, 2
dtype = torch.qint32
qx = torch.quantize_per_tensor(x, scale, zero_point, dtype)
qy = torch.quantize_per_tensor(y, scale, zero_point, dtype)

[email protected] # I tried...

..and got as error

RuntimeError: Could not run 'aten::mm' with arguments from the 'QuantizedCPUTensorId' backend. 'aten::mm' is only available for these backends: [CUDATensorId, SparseCPUTensorId, VariableTensorId, CPUTensorId, SparseCUDATensorId].

Is matrix multiplication just not supported, or am I doing something wrong?

390

asked Feb 20 '20 17:02

Davide Fiocco

1 Answers

It is not straight forward to implement matrix multiplication for quantized matrices. Therefore, the "conventional" matrix multiplication (@) does not support it (as your error message suggests).

You should look at quantized operations, e.g., torch.nn.quantized.functional.linear:

torch.nn.quantized.functional.linear(qx[None,...], qy.T)

126

answered Oct 21 '22 16:10

Shai

Related questions
                            
                                Pytorch - inference all images and back-propagate batch by batch
                            
                                How Batch learning in Pytorch is performed?
                            
                                AttributeError: module 'torch' has no attribute 'hub'
                            
                                How to Multi-Head learning
                            
                                How can I build an LSTM AutoEncoder with PyTorch?
                            
                                Can you reverse a PyTorch neural network and activate the inputs from the outputs?
                            
                                How can I load a partial pretrained pytorch model?
                            
                                How to use the past with HuggingFace Transformers GPT-2?
                            
                                PyTorch how to compute second order Jacobian?
                            
                                What is hp_metric in TensorBoard and how to get rid of it?
                            
                                IDE autocomplete for pytorch
                            
                                multi-variable linear regression with pytorch
                            
                                Simple LSTM in PyTorch with Sequential module
                            
                                How can I install python modules in a docker image?
                            
                                what is the fastest way of loading images?
                            
                                pytorch variable index lost one dimension
                            
                                Running through a dataloader in Pytorch using Google Colab
                            
                                How to load a checkpoint file in a pytorch model?
                            
                                Unpickling saved pytorch model throws AttributeError: Can't get attribute 'Net' on <module '__main__' despite adding class definition inline
                            
                                pytorch debugging timeout with PyCharm

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I matrix-multiply two PyTorch quantized Tensors?

Tags:

pytorch

matrix-multiplication

quantization

Davide Fiocco

People also ask

1 Answers

Shai

Recent Activity

Donate For Us