Understanding PyTorch CNN Channels

Question

I'm a bit confused at how CNNs and channels work. Specifically, how come these two implementations are not equal? Isn't the # of output channels just applying however many # of filters?

    self.conv1 = nn.Conv2d(1, 10, kernel_size=(3, self.embeds_size))
    self.conv2 = nn.ModuleList([nn.Conv2d(1, 1, kernel_size=(3, self.embeds_size)) for f in range(10)])
    ...


    conv1s = self.conv1(x)
    conv2s = [conv(x) for conv in self.conv2]
    conv2s = torch.stack(conv2s, 1).squeeze(2)
    print(torch.equal(conv1s, conv2s))

Jens Petersen · Accepted Answer

Check the state dicts of the different modules. Unless you're doing something fancy that you didn't tell us about, PyTorch will initialize the weights randomly. Specifically, try this:

print(self.conv1.state_dict()["weight"][0])
print(self.conv2[0].state_dict()["weight"][0])

They will be different.

Understanding PyTorch CNN Channels

Tags:

python

machine-learning

neural-network

deep-learning

pytorch

Matt

1 Answers

Jens Petersen

Recent Activity

Donate For Us

Understanding PyTorch CNN Channels

Tags:

python

machine-learning

neural-network

deep-learning

pytorch

Matt

1 Answers

Jens Petersen

Related questions

Recent Activity

Donate For Us