We can allocate a tensor on GPU using <code>torch.Tensor([1., 2.], device='cuda')</code>. Are there any differences using that way rather than <code>torch.cuda.Tensor([1., 2.])</code>, except we can pass in a specific CUDA device to the former one? Or in other words, in which scenario is <code>torch.cuda.Tensor()</code> necessary?

So generally both <code>torch.Tensor</code> and <code>torch.cuda.Tensor</code> are equivalent. You can do everything you like with them both. The key difference is just that <code>torch.Tensor</code> occupies CPU memory while <code>torch.cuda.Tensor</code> occupies GPU memory. Of course operations on a CPU Tensor are computed with CPU while operations for the GPU / CUDA Tensor are computed on GPU. The reason you need these two tensor types is that the underlying hardware interface is completely different. Apart from the point it doesn't make sense computationally, you will get an error as soon as you try to do computations between <code>torch.Tensor</code> and <code>torch.cuda.Tensor</code>: <pre class="prettyprint lang-py prettyprint-override"><code>import torch # device will be 'cuda' if a GPU is available device = torch.device('cuda' if torch.cuda.is_available() else 'cpu') # creating a CPU tensor cpu_tensor = torch.rand(10) # moving same tensor to GPU gpu_tensor = cpu_tensor.to(device) print(cpu_tensor, cpu_tensor.dtype, type(cpu_tensor), cpu_tensor.type()) print(gpu_tensor, gpu_tensor.dtype, type(gpu_tensor), gpu_tensor.type()) print(cpu_tensor*gpu_tensor) </code></pre> Output: <pre class="prettyprint lang-py prettyprint-override"><code>tensor([0.8571, 0.9171, 0.6626, 0.8086, 0.6440, 0.3682, 0.9920, 0.4298, 0.0172, 0.1619]) torch.float32 <class 'torch.Tensor'> torch.FloatTensor tensor([0.8571, 0.9171, 0.6626, 0.8086, 0.6440, 0.3682, 0.9920, 0.4298, 0.0172, 0.1619], device='cuda:0') torch.float32 <class 'torch.Tensor'> torch.cuda.FloatTensor --------------------------------------------------------------------------- RuntimeError Traceback (most recent call last) <ipython-input-15-ac794171c178> in <module>() 12 print(gpu_tensor, gpu_tensor.dtype, type(gpu_tensor), gpu_tensor.type()) 13 ---> 14 print(cpu_tensor*gpu_tensor) RuntimeError: Expected object of type torch.FloatTensor but found type torch.cuda.FloatTensor for argument #2 'other' </code></pre> As the underlying hardware interface is completely different, CPU Tensors are just compatible with CPU Tensor and verse visa GPU Tensors are just compatible to GPU Tensors. Edit: As you can see here that a tensor which is moved to GPU is actually a tensor of type: <code>torch.cuda.*Tensor</code> i.e. <code>torch.cuda.FloatTensor</code>. So <code>cpu_tensor.to(device)</code> or <code>torch.Tensor([1., 2.], device='cuda')</code> will actually return a tensor of type <code>torch.cuda.FloatTensor</code>. In which scenario is <code>torch.cuda.Tensor()</code> necessary? When you want to use GPU acceleration (which is much faster in most cases) for your program, you need to use <code>torch.cuda.Tensor</code>, but you have to make sure that ALL tensors you are using are CUDA Tensors, mixing is not possible here.

Differences between `torch.Tensor` and `torch.cuda.Tensor`

Tags:

pytorch

We can allocate a tensor on GPU using torch.Tensor([1., 2.], device='cuda'). Are there any differences using that way rather than torch.cuda.Tensor([1., 2.]), except we can pass in a specific CUDA device to the former one?

Or in other words, in which scenario is torch.cuda.Tensor() necessary?

247

asked Dec 05 '18 09:12

lincr

1 Answers

So generally both torch.Tensor and torch.cuda.Tensor are equivalent. You can do everything you like with them both.

The key difference is just that torch.Tensor occupies CPU memory while torch.cuda.Tensor occupies GPU memory. Of course operations on a CPU Tensor are computed with CPU while operations for the GPU / CUDA Tensor are computed on GPU.

The reason you need these two tensor types is that the underlying hardware interface is completely different. Apart from the point it doesn't make sense computationally, you will get an error as soon as you try to do computations between torch.Tensor and torch.cuda.Tensor:

import torch

# device will be 'cuda' if a GPU is available
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

# creating a CPU tensor
cpu_tensor = torch.rand(10)
# moving same tensor to GPU
gpu_tensor = cpu_tensor.to(device)

print(cpu_tensor, cpu_tensor.dtype, type(cpu_tensor), cpu_tensor.type())
print(gpu_tensor, gpu_tensor.dtype, type(gpu_tensor), gpu_tensor.type())

print(cpu_tensor*gpu_tensor)

Output:

tensor([0.8571, 0.9171, 0.6626, 0.8086, 0.6440, 0.3682, 0.9920, 0.4298, 0.0172,
        0.1619]) torch.float32 <class 'torch.Tensor'> torch.FloatTensor
tensor([0.8571, 0.9171, 0.6626, 0.8086, 0.6440, 0.3682, 0.9920, 0.4298, 0.0172,
        0.1619], device='cuda:0') torch.float32 <class 'torch.Tensor'> torch.cuda.FloatTensor
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
<ipython-input-15-ac794171c178> in <module>()
     12 print(gpu_tensor, gpu_tensor.dtype, type(gpu_tensor), gpu_tensor.type())
     13 
---> 14 print(cpu_tensor*gpu_tensor)

RuntimeError: Expected object of type torch.FloatTensor but found type torch.cuda.FloatTensor for argument #2 'other'

As the underlying hardware interface is completely different, CPU Tensors are just compatible with CPU Tensor and verse visa GPU Tensors are just compatible to GPU Tensors.

Edit:

As you can see here that a tensor which is moved to GPU is actually a tensor of type: torch.cuda.*Tensor i.e. torch.cuda.FloatTensor.

So cpu_tensor.to(device) or torch.Tensor([1., 2.], device='cuda') will actually return a tensor of type torch.cuda.FloatTensor.

In which scenario is torch.cuda.Tensor() necessary?

When you want to use GPU acceleration (which is much faster in most cases) for your program, you need to use torch.cuda.Tensor, but you have to make sure that ALL tensors you are using are CUDA Tensors, mixing is not possible here.

177

answered Sep 20 '22 13:09

MBT

Related questions
                            
                                How can I do a seq2seq task with PyTorch Transformers if I am not trying to be autoregressive?
                            
                                How to calculate Batch Pairwise Distance in PyTorch efficiently
                            
                                How to run inference of a pytorch model on pyspark dataframe (create new column with prediction) using pandas_udf?
                            
                                Validation loss for pytorch Faster-RCNN
                            
                                How to find functions imported from torch._C in source code
                            
                                How to apply a custom function to specific columns in a matrix in PyTorch
                            
                                PyTorch: passing numpy array for weight initialization
                            
                                Is it possible to load a pretrained Pytorch model from a GCS bucket URL without first persisting locally?
                            
                                BertTokenizer - when encoding and decoding sequences extra spaces appear
                            
                                Pytorch Autograd gives different gradients when using .clamp instead of torch.relu
                            
                                After some number of epochs fake image creation become worst in GAN
                            
                                How to run PyTorch on GPU by default?
                            
                                PyTorch - better way to get back original tensor order after torch.sort
                            
                                Pytorch: How to create an update rule that doesn't come from derivatives?
                            
                                Error while applying image augmentation transformations to data in FastAI
                            
                                What is a fused kernel (or fused layer) in deep learning?
                            
                                How to install CUDA enabled PyTorch in a Docker container?
                            
                                PyTorch specific inspection issues in PyCharm
                            
                                Cross entropy loss in pytorch nn.CrossEntropyLoss()
                            
                                Add / substract between matrix and vector in pytorch

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With