Why does autograd not produce gradient for intermediate variables?

Tags:

pytorch

autograd

trying to wrap my head around how gradients are represented and how autograd works:

import torch
from torch.autograd import Variable

x = Variable(torch.Tensor([2]), requires_grad=True)
y = x * x
z = y * y

z.backward()

print(x.grad)
#Variable containing:
#32
#[torch.FloatTensor of size 1]

print(y.grad)
#None

Why does it not produce a gradient for y? If y.grad = dz/dy, then shouldn't it at least produce a variable like y.grad = 2*y?

204

asked Aug 31 '17 18:08

foobar

1 Answers

By default, gradients are only retained for leaf variables. non-leaf variables' gradients are not retained to be inspected later. This was done by design, to save memory.

-soumith chintala

See: https://discuss.pytorch.org/t/why-cant-i-see-grad-of-an-intermediate-variable/94

Option 1:

Call y.retain_grad()

x = Variable(torch.Tensor([2]), requires_grad=True)
y = x * x
z = y * y

y.retain_grad()

z.backward()

print(y.grad)
#Variable containing:
# 8
#[torch.FloatTensor of size 1]

Source: https://discuss.pytorch.org/t/why-cant-i-see-grad-of-an-intermediate-variable/94/16

Option 2:

Register a hook, which is basically a function called when that gradient is calculated. Then you can save it, assign it, print it, whatever...

from __future__ import print_function
import torch
from torch.autograd import Variable

x = Variable(torch.Tensor([2]), requires_grad=True)
y = x * x
z = y * y

y.register_hook(print) ## this can be anything you need it to be

z.backward()

output:

Variable containing:  8 [torch.FloatTensor of size 1

Source: https://discuss.pytorch.org/t/why-cant-i-see-grad-of-an-intermediate-variable/94/2

Also see: https://discuss.pytorch.org/t/why-cant-i-see-grad-of-an-intermediate-variable/94/7

183

answered Sep 21 '22 22:09

T. Scharf

Related questions
                            
                                Get the data type of a PyTorch tensor
                            
                                Saving PyTorch model with no access to model class code
                            
                                Pytorch : W ParallelNative.cpp:206
                            
                                Understanding accumulated gradients in PyTorch
                            
                                Parsing CSV into Pytorch tensors
                            
                                How do I install PyTorch v1.0.0+ on Google Colab?
                            
                                How to convert a list or numpy array to a 1d torch tensor?
                            
                                pytorch error: multi-target not supported in CrossEntropyLoss()
                            
                                KL Divergence for two probability distributions in PyTorch
                            
                                PyTorch NotImplementedError in forward
                            
                                How to install pytorch in windows?
                            
                                Accessing PyTorch GPU matrix from TensorFlow directly
                            
                                What is the difference between detach, clone and deepcopy in Pytorch tensors in detail?
                            
                                What does the copy_initial_weights documentation mean in the higher library for Pytorch?
                            
                                How to get Docker to recognize NVIDIA drivers?
                            
                                PyTorch: RuntimeError: Input, output and indices must be on the current device
                            
                                BertForSequenceClassification vs. BertForMultipleChoice for sentence multi-class classification
                            
                                How to implement dropout in Pytorch, and where to apply it
                            
                                Concatenating two tensors with different dimensions in Pytorch
                            
                                PyTorch: Testing with torchvision.datasets.ImageFolder and DataLoader

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With