pytorch how to set .requires_grad False

Tags:

I want to set some of my model frozen. Following the official docs:

with torch.no_grad():     linear = nn.Linear(1, 1)     linear.eval()     print(linear.weight.requires_grad)

But it prints True instead of False. If I want to set the model in eval mode, what should I do?

606

asked Aug 08 '18 13:08

Qian Wang

1 Answers

requires_grad=False

If you want to freeze part of your model and train the rest, you can set requires_grad of the parameters you want to freeze to False.

For example, if you only want to keep the convolutional part of VGG16 fixed:

model = torchvision.models.vgg16(pretrained=True) for param in model.features.parameters():     param.requires_grad = False

By switching the requires_grad flags to False, no intermediate buffers will be saved, until the computation gets to some point where one of the inputs of the operation requires the gradient.

torch.no_grad()

Using the context manager torch.no_grad is a different way to achieve that goal: in the no_grad context, all the results of the computations will have requires_grad=False, even if the inputs have requires_grad=True. Notice that you won't be able to backpropagate the gradient to layers before the no_grad. For example:

x = torch.randn(2, 2) x.requires_grad = True  lin0 = nn.Linear(2, 2) lin1 = nn.Linear(2, 2) lin2 = nn.Linear(2, 2) x1 = lin0(x) with torch.no_grad():         x2 = lin1(x1) x3 = lin2(x2) x3.sum().backward() print(lin0.weight.grad, lin1.weight.grad, lin2.weight.grad)

outputs:

(None, None, tensor([[-1.4481, -1.1789],          [-1.4481, -1.1789]]))

Here lin1.weight.requires_grad was True, but the gradient wasn't computed because the oepration was done in the no_grad context.

model.eval()

If your goal is not to finetune, but to set your model in inference mode, the most convenient way is to use the torch.no_grad context manager. In this case you also have to set your model to evaluation mode, this is achieved by calling eval() on the nn.Module, for example:

model = torchvision.models.vgg16(pretrained=True) model.eval()

This operation sets the attribute self.training of the layers to False, in practice this will change the behavior of operations like Dropout or BatchNorm that must behave differently at training and test time.

174

answered Oct 23 '22 19:10

iacolippo

Related questions
                            
                                Perform a reverse cumulative sum on a numpy array
                            
                                Search python docs offline?
                            
                                Pycharm's code style inspection: ignore/switch off specific rules
                            
                                How to select a range of values in a pandas dataframe column?
                            
                                pandas combine two columns with null values
                            
                                python write string directly to tarfile
                            
                                A good way to escape quotes in a database query string?
                            
                                Python sort() method on list vs builtin sorted() function
                            
                                Deleting read-only directory in Python
                            
                                Get first list index containing sub-string?
                            
                                PyCharm does not recognize modules installed in development mode
                            
                                Python pandas integer YYYYMMDD to datetime
                            
                                Pandas Groupby: Count and mean combined
                            
                                Count how many arguments passed as positional
                            
                                Filter Django database for field containing any value in an array
                            
                                Plot a black-and-white binary map in matplotlib
                            
                                changing default x range in histogram matplotlib
                            
                                Difference between pygame.display.update and pygame.display.flip
                            
                                How to change the plot line color from blue to black?
                            
                                Python - How to show graph in Visual Studio Code itself?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pytorch how to set .requires_grad False

Tags:

python

pytorch

gradient-descent