How does a convolution kernel get trained in a CNN?

Tags:

In a CNN, the convolution operation 'convolves' a kernel matrix over an input matrix. Now, I know how a fully connected layer makes use of gradient descent and backpropagation to get trained. But how does the kernel matrix change over time?

There are multiple ways in which the kernel matrix is initialized as mentioned here, in the Keras documentation. However, I am interested to know how it is trained? If it uses backpropagation too, then is there any paper that describes in detail the training process?

This post also raises a similar question, but it is unanswered.

656

asked Aug 20 '18 09:08

Rangan Das

1 Answers

Here you have a well explained post about backpropagation for Convolutional layer. In short, it is also gradient descent just like with FC layer. In fact, you can effectively turn a Convolutional layer into a Fuly Connected layer as explained here.

166

answered Oct 08 '22 21:10

ibarrond

Related questions
                            
                                understanding output shape of keras Conv2DTranspose
                            
                                Keras - LeakyReLU has no attribute name error when saving model
                            
                                What that mean this message when I update tensorflow and keras in Anaconda Prompt ? Is wrong or Okay?
                            
                                Grid Search for Keras with multiple inputs
                            
                                How do I go from Pandas DataFrame to Tensorflow BatchDataset for NLP?
                            
                                Modify trained model architecture and continue training Keras
                            
                                Train and predict on variable length sequences
                            
                                ValueError: Input 0 is incompatible with layer model: expected shape=(None, 14999, 7), found shape=(None, 7)
                            
                                Multi-Output Multi-Class Keras Model
                            
                                Limit neural network output to subset of trained classes
                            
                                ValueError: Error when checking target: expected dense_2 to have shape (None, 2) but got array with shape (1, 1)
                            
                                Train Multi-Input Keras NN with batch of training data
                            
                                Keras NN loss not decreasing
                            
                                keras combining two losses with adjustable weights
                            
                                Understanding Seq2Seq model
                            
                                Is it possible to have dynamic batchsize in keras?
                            
                                Does calling the model.fit method again reinitialize the already trained weights?
                            
                                Crop the center of the image in Keras ImageDataGenerator or flow_from_directory
                            
                                Use multiple directories for flow_from_directory in Keras
                            
                                Getting some form of keras multi-processing/threading to work on Windows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does a convolution kernel get trained in a CNN?

Tags:

backpropagation

keras

conv-neural-network

convolution

Rangan Das

People also ask

1 Answers

ibarrond

Recent Activity

Donate For Us