I'm doing fine-tuning with pytorch using resnet50 and want to set the learning rate of the last fully connected layer to 10^-3 while the learning rate of other layers be set to 10^-6. I know that I can just follow the method in its document: <pre class="prettyprint"><code>optim.SGD([{'params': model.base.parameters()}, {'params': model.classifier.parameters(), 'lr': 1e-3}], lr=1e-2, momentum=0.9) </code></pre> But is there anyway that I do not need to set the parameters layer by layer

You can group layers. If you want to group all linear layers, the best way to do it is use <code>modules</code>: <pre class="prettyprint"><code>param_grp = [] for idx, m in enumerate(model.modules()): if isinstance(m, nn.Linear): param_grp.append(m.weight) </code></pre>

How to set different learning rate for different layer in pytorch?

Tags:

machine-learning

python-2.7

pytorch

I'm doing fine-tuning with pytorch using resnet50 and want to set the learning rate of the last fully connected layer to 10^-3 while the learning rate of other layers be set to 10^-6. I know that I can just follow the method in its document:

optim.SGD([{'params': model.base.parameters()},
           {'params': model.classifier.parameters(), 'lr': 1e-3}], 
          lr=1e-2, momentum=0.9)

But is there anyway that I do not need to set the parameters layer by layer

475

asked May 06 '17 08:05

w.wei

1 Answers

You can group layers. If you want to group all linear layers, the best way to do it is use modules:

param_grp = []

for idx, m in enumerate(model.modules()):
    if isinstance(m, nn.Linear):
        param_grp.append(m.weight)

137

answered Oct 21 '22 17:10

cedrickchee

Related questions
                            
                                how to convert hex string to ObjectId in Python [duplicate]
                            
                                Flask-Restful error: "as_view" method not inherited
                            
                                convert xldate to python datetime
                            
                                How rename the images in folder
                            
                                No module named 'matplotlib.pyplot'; 'matplotlib' is not a package
                            
                                Python code to check if service is running or not.?
                            
                                Why does the 'int' object is not callable error occur when using the sum() function? [duplicate]
                            
                                How to convert integers in list to string in python
                            
                                SyntaxError: prefix 'a' not found in prefix map
                            
                                Python ImportError: cannot import name utils
                            
                                How do I use psql "\copy" in Python?
                            
                                Difference between plt.subplots() and plt.figure()
                            
                                Tkinter highlightcolor options on Win7
                            
                                Mail request body as well to Django Admin while server error
                            
                                How to plot multiple graphs in one chart using pygal?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With