How to apply layer-wise learning rate in Pytorch?

Tags:

I know that it is possible to freeze single layers in a network for example to train only the last layers of a pre-trained model. What I’m looking for is a way to apply certain learning rates to different layers.

So for example a very low learning rate of 0.000001 for the first layer and then increasing the learning rate gradually for each of the following layers. So that the last layer then ends up with a learning rate of 0.01 or so.

Is this possible in pytorch? Any idea how I can archive this?

691

asked Aug 11 '18 16:08

MBT

1 Answers

Here is the solution:

from torch.optim import Adam  model = Net()  optim = Adam(     [         {"params": model.fc.parameters(), "lr": 1e-3},         {"params": model.agroupoflayer.parameters()},         {"params": model.lastlayer.parameters(), "lr": 4e-2},     ],     lr=5e-4, )

Other parameters that are didn't specify in optimizer will not optimize. So you should state all layers or groups(OR the layers you want to optimize). and if you didn't specify the learning rate it will take the global learning rate(5e-4). The trick is when you create the model you should give names to the layers or you can group it.

195

answered Sep 24 '22 23:09

Salih Karagoz

Related questions
                            
                                Crashlytics vs Fabric vs Firebase Crash Reporting — I'm lost
                            
                                horizontal grid only (in python using pandas plot + pyplot)
                            
                                Intellisense not working in code snippets - VS Code
                            
                                Pytorch RuntimeError: Expected tensor for argument #1 'indices' to have scalar type Long; but got CUDAType instead
                            
                                What is the difference between throttleTime vs debounceTime in RxJS and when to choose which?
                            
                                UI state restoration for a scene in iOS 13 while still supporting iOS 12. No storyboards
                            
                                Define default constructor for record
                            
                                Jest Folder Structure
                            
                                Pip 21.1 can't import InvalidSchemeCombination
                            
                                Alternatives to System.exit(1)
                            
                                What is LTS (Long Term Support)?
                            
                                jQuery.getJSON doesn't trigger callback

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to apply layer-wise learning rate in Pytorch?

Tags:

MBT

People also ask

1 Answers

Salih Karagoz

Recent Activity

Donate For Us