How to access the network weights while using PyTorch 'nn.Sequential'?

Tags:

I'm building a neural network and I don't know how to access the model weights for each layer.

I've tried

model.input_size.weight

Code:

input_size = 784
hidden_sizes = [128, 64]
output_size = 10

# Build a feed-forward network
model = nn.Sequential(nn.Linear(input_size, hidden_sizes[0]),
                      nn.ReLU(),
                      nn.Linear(hidden_sizes[0], hidden_sizes[1]),
                      nn.ReLU(),
                      nn.Linear(hidden_sizes[1], output_size),
                      nn.Softmax(dim=1))

I expected to get the weights but I got

'Sequential' object has no attribute 'input_size'

909

asked Jun 04 '19 00:06

4 Answers

If you print out the model usingprint(model), you would get

Sequential(
  (0): Linear(in_features=784, out_features=128, bias=True)
  (1): ReLU()
  (2): Linear(in_features=128, out_features=64, bias=True)
  (3): ReLU()
  (4): Linear(in_features=64, out_features=10, bias=True)
  (5): Softmax(dim=1) )

Now you have access to all indices of layers so you can get the weights of (let's say) second linear layer by model[4].weight.

answered Nov 02 '22 17:11

Muhammad Shamel

Let's say you define the model as a class. Then you can call model.parameters().

`# Build a feed-forward network
 class FFN(nn.Module):
     def __init__(self):
         super().__init__()
         self.layer1 = nn.Linear(input_size, hidden_sizes[0])
         self.layer2 = nn.Linear(hidden_sizes[0], hidden_sizes[1])
         self.layer3 = nn.Linear(hidden_sizes[1], output_size)
         self.relu = nn.ReLU()
         self.softmax = nn.Softmax(dim=1)
     def forward(self, x):
         x = self.relu(self.layer1(x))
         x = self.relu(self.layer2(x))
         x = self.softmax(self.layer3(x))
         return x

model = FFN()
print(model.parameters())`

Which will print <generator object Module.parameters at 0x7f99886d0d58>, so you can pass that to an optimizer right away!

But, if you want to access particular weights or look at them manually, you can just convert to a list: print(list(model.parameters())). Which will spit out a giant list of weights.

But, let's say you only want the last layer, then you can do: print(list(model.parameters())[-1]), which will print: tensor([-0.0347, -0.0289, -0.0652, -0.1233, 0.1093, 0.1187, -0.0407, 0.0885, -0.0045, -0.1238], requires_grad=True)

answered Nov 02 '22 17:11

Yume

Related questions
                            
                                Normalization VS. numpy way to normalize?
                            
                                Pip install fails: SSL required
                            
                                How to insert zeros between elements in a numpy array?
                            
                                Python Statsmodels Mixedlm (Mixed Linear Model) random effects
                            
                                Python Pandas: Groupby Sum AND Concatenate Strings
                            
                                How generate all pairs of values, from the result of a groupby, in a pandas dataframe
                            
                                Doing the opposite of pivot in pandas Python
                            
                                Restricting all the views to authenticated users in Django
                            
                                How to filter JSON Array in Django JSONField
                            
                                access remote files on server with smb protocol python3
                            
                                Running Julia .jl file in python
                            
                                Pandas: convert date 'object' to int
                            
                                Pandas - Add Column Name to Results of groupby [duplicate]
                            
                                Dynamic table with Python
                            
                                Transposing selected MultiIndex levels in Pandas DataFrame
                            
                                Conda command working in command prompt but not in bash script
                            
                                Python 3.6 DateTime Strptime Returns error while Python 3.7 works well
                            
                                Anaconda prompt closes immediately - the system was unable to find the specified registry key or value
                            
                                How to upload multiple files with flask-wtf?
                            
                                Theoretical vs actual time-complexity for algorithm calculating 2^n

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to access the network weights while using PyTorch 'nn.Sequential'?

Tags:

python

neural-network

pytorch

torch

Muhammad Shamel

People also ask

4 Answers

Saeed

Anubhav Singh

Muhammad Shamel

Yume

Recent Activity

Donate For Us