In order to access a model's parameters in pytorch, I saw two methods: using <code>state_dict</code> and using <code>parameters()</code> I wonder what's the difference, or if one is good practice and the other is bad practice. Thanks

Besides the differences in @kHarshit 's answer, the attribute <code>requires_grad</code> of trainable tensors in <code>net.parameters()</code> is <code>True</code>, while <code>False</code> in <code>net.state_dict()</code>

PyTorch: What's the difference between state_dict and parameters()?

2 Answers

The parameters() only gives the module parameters i.e. weights and biases.

Returns an iterator over module parameters.

You can check the list of the parameters as follows:

for name, param in model.named_parameters():
    if param.requires_grad:
        print(name)

On the other hand, state_dict returns a dictionary containing a whole state of the module. Check its source code that contains not just the call to parameters but also buffers, etc.

Both parameters and persistent buffers (e.g. running averages) are included. Keys are the corresponding parameter and buffer names.

Check all keys that state_dict contains using:

model.state_dict().keys()

For example, in state_dict, you'll find entries like bn1.running_mean and running_var, which are not present in .parameters().

If you only want to access parameters, you can simply use .parameters(), while for purposes like saving and loading model as in transfer learning, you'll need to save state_dict not just parameters.

169

answered Oct 25 '22 00:10

kHarshit

Besides the differences in @kHarshit 's answer, the attribute requires_grad of trainable tensors in net.parameters() is True, while False in net.state_dict()

answered Oct 25 '22 00:10

david

Related questions
                            
                                Using with sns.set in seaborn plots
                            
                                Cython: Buffer type mismatch, expected 'int' but got 'long'
                            
                                Implementing Bi-directional LSTM-CRF Network
                            
                                Why not use python's assert statement in tests, these days?
                            
                                Complete a multipart_upload with boto3?
                            
                                figure.add_subplot() vs pyplot.subplot()
                            
                                Passing arguments (for argparse) with unittest discover
                            
                                sqlalchemy, using check constraints
                            
                                TensorBoard: How to plot histogram for gradients?
                            
                                How to smooth by interpolation when using pcolormesh?
                            
                                Is there a comprehensive table of Python's "magic constants"?
                            
                                Simplifying / optimizing a chain of for-loops
                            
                                Heroku - No web process running
                            
                                Search and replace placeholder text in PDF with Python
                            
                                Why does a newly created variable in Python have a ref-count of four?
                            
                                Recommended way to implement __eq__ and __hash__
                            
                                ModuleNotFoundError: No module named 'BaseHTTPServer'
                            
                                python a,b = b,a implementation? How is it different from C++ swap function?
                            
                                VSCode: The term 'python' is not recognized...but py works
                            
                                Python and Dart Integration in Flutter Mobile Application

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

PyTorch: What's the difference between state_dict and parameters()?

Tags:

python

machine-learning

deep-learning

pytorch

Gulzar

People also ask

2 Answers

kHarshit

david

Recent Activity

Donate For Us