Suppose the model is originally stored on CPU, and then I want to move it to GPU0, then I can do: <pre class="prettyprint lang-py prettyprint-override"><code>device = torch.device('cuda:0') model = model.to(device) # or model.to(device) </code></pre> What is the difference between those two lines?

Citing the documentation on <code>to</code>: <blockquote> When loading a model on a GPU that was trained and saved on GPU, simply convert the initialized model to a CUDA optimized model using <code>model.to(torch.device('cuda'))</code>. Also, be sure to use the <code>.to(torch.device('cuda'))</code> function on all model inputs to prepare the data for the model. Note that calling <code>my_tensor.to(device)</code> returns a new copy of <code>my_tensor</code> on GPU. It does NOT overwrite <code>my_tensor</code>. Therefore, remember to manually overwrite tensors: <code>my_tensor = my_tensor.to(torch.device('cuda'))</code>. </blockquote> Mostly, when using <code>to</code> on a <code>torch.nn.Module</code>, it does not matter whether you save the return value or not, and as a micro-optimization, it is actually better to not save the return value. When used on a torch tensor, you must save the return value - seeing you are actually receiving a copy of the tensor. Ref: Pytorch to()

What is the difference between model.to(device) and model=model.to(device)?

Tags:

python

pytorch

Suppose the model is originally stored on CPU, and then I want to move it to GPU0, then I can do:

device = torch.device('cuda:0')
model = model.to(device)
# or
model.to(device)

What is the difference between those two lines?

485

asked Jan 02 '20 07:01

Obsidian

2 Answers

No semantic difference. nn.Module.to function moves the model to the device.

But be cautious.

For tensors (documentation):

# tensor a is in CPU
device = torch.device('cuda:0')
b = a.to(device)
# a is still in CPU!
# b is in GPU!
# a and b are different

For models (documentation):

# model a is in CPU
device = torch.device('cuda:0')
b = a.to(device)
# a and b are in GPU
# a and b point to the same model

172

answered Sep 18 '22 19:09

youkaichao

Citing the documentation on to:

When loading a model on a GPU that was trained and saved on GPU, simply convert the initialized model to a CUDA optimized model using model.to(torch.device('cuda')). Also, be sure to use the .to(torch.device('cuda')) function on all model inputs to prepare the data for the model. Note that calling my_tensor.to(device) returns a new copy of my_tensor on GPU. It does NOT overwrite my_tensor. Therefore, remember to manually overwrite tensors: my_tensor = my_tensor.to(torch.device('cuda')).

Mostly, when using to on a torch.nn.Module, it does not matter whether you save the return value or not, and as a micro-optimization, it is actually better to not save the return value. When used on a torch tensor, you must save the return value - seeing you are actually receiving a copy of the tensor.

Ref: Pytorch to()

answered Sep 18 '22 19:09

Mano

Related questions
                            
                                TypeError: descriptor '__init__' requires a 'super' object but received a 'str'
                            
                                How to use Chrome Profile in Selenium Webdriver Python 3
                            
                                remove last few characters in PySpark dataframe column
                            
                                How do I do conditional array arithmetic on a numpy array?
                            
                                How to make mysql connection that requires CA-CERT with sqlalchemy or SQLObject
                            
                                Creating a video using OpenCV 2.4.0 in python
                            
                                How to actually upload a file using Flask WTF FileField
                            
                                Issue setting Kivy to fullscreen
                            
                                Unable to connect aws s3 bucket using boto
                            
                                Converting Pandas DataFrame to Orange Table
                            
                                Finding consecutive consonants in a word
                            
                                Wait until task is completed on Remote Machine through Python [duplicate]
                            
                                Select query in pymysql
                            
                                Can't drop NAN with dropna in pandas
                            
                                How to pass a list as an input of a function in Python
                            
                                Django BooleanField as a dropdown
                            
                                Where is the .profile file on mac?
                            
                                Python tutorial code from RabbitMQ failing to run
                            
                                How to list all blobs inside of a specific subdirectory in Azure Cloud Storage using Python?
                            
                                Incrementing the upper limit of range inside a loop doesn't make it run forever [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With