What's the difference between Conv layer and Pooling layer in CNN?

1 Answers

The difference can be summarized in (1) how do you compute them and (2) what is used for.

How do you compute them:

Take for example an input data that is a matrix (5x5) -think about an image of 5 by 5 pixels-. The pooling layer and the convolution layer are operations that are applied to each of the input "pixels". Let's take a pixel in the center of the image (to avoid to discuss what happens with the corners, will elaborate later) and define a "kernel" for both the pooling layer and the convolution layer of (3x3).

Pooling layer: you super-impose the pooling kernel on the input pixel (in the figure you put the center of the blue matrix on top of the black X_00, and take the maximum.

Convolutional layer: you super-impose the convolutional kernel on the input pixel (in the figure you put the center of the orange matrix on top of the black X_00) and then perform the element wise multiplication and then summation as indicated in the figure.

The convolution coefficients, F_.., where are they taken from ? they are learnt when training the network. For the maxpooling, you do not have to learn nothing, you take the maximum. You can consider the maxpooling is like a convolution but with fixed coefficients, and instead of summing, taking the maximum.

You perform this for each input element. What happens an the input image corners, depens on what your choice: discard the input elements at the sides/corners, pad, etc.. Also you can not move continuously, pixel by pixel, by jumping, etc...

what is used for: max_pooling reduces the size of the input, and performs kind of summarization of the data, and at the same time provides some invariance to translational transformations (e.g. if the object moves left-right, up-down). convultion, depending on the conditions on the filter coefficients (e.g. a column must be negative, while other positive) can be regarded as filters allowing to extract some patterns, like vertical lines, horizontal lines, etc...

input image, max_pool_kernel, conv_kernel

106

answered Sep 18 '22 09:09

Antoni

Related questions
                            
                                Use brain.js neural network to do text analysis
                            
                                Given input size: (128x1x1). Calculated output size: (128x0x0). Output size is too small
                            
                                keras predict always output same value in multi-classification
                            
                                How can I reuse a Dense layer?
                            
                                What is NEAT (Neuroevolution of Augmenting Topologies)?
                            
                                Training only one output of a network in Keras
                            
                                How does the epsilon hyperparameter affect tf.train.AdamOptimizer?
                            
                                Neural networks in Lisp - advice
                            
                                Prediction using Recurrent Neural Network on Time series dataset
                            
                                How to enable multithreading with Caffe?
                            
                                How does shift-and-stitch in a fully convolutional network work?
                            
                                Keras: ValueError: No data provided for "input_1". Need data for each key
                            
                                Neural Network Cost Function in MATLAB
                            
                                additive Gaussian noise in Tensorflow
                            
                                How is using im2col operation in convolutional nets more efficient?
                            
                                What is Train loss, Valid loss, and Train/Val mean in NNs
                            
                                Keras embedding layers: how do they work?
                            
                                Neural network for square (x^2) approximation
                            
                                neuralnet prediction returns the same values for all predictions
                            
                                How to use model.reset_states() in Keras?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the difference between Conv layer and Pooling layer in CNN?

Tags:

neural-network

conv-neural-network

hrsma2i

People also ask

1 Answers

Antoni

Recent Activity

Donate For Us