The theory from these links show that the order of Convolutional Network is: <code>Convolutional Layer - Non-linear Activation - Pooling Layer</code>. <ol> <li>Neural networks and deep learning (equation (125)</li> <li>Deep learning book (page 304, 1st paragraph)</li> <li>Lenet (the equation)</li> <li>The source in this headline</li> </ol> But, in the last implementation from those sites, it said that the order is: <code>Convolutional Layer - Pooling Layer - Non-linear Activation</code> <ol> <li>network3.py</li> <li>The sourcecode, LeNetConvPoolLayer class</li> </ol> I've tried too to explore a Conv2D operation syntax, but there is no activation function, it's only convolution with flipped kernel. Can someone help me to explain why is this happen?

In many papers people use <code>conv -> pooling -> non-linearity</code>. It does not mean that you can't use another order and get reasonable results. In case of max-pooling layer and ReLU the order does not matter (both calculate the same thing): <img src="https://i.stack.imgur.com/DggnU.png" alt="enter image description here"> You can proof that this is the case by remembering that ReLU is an element-wise operation and a non-decreasing function so <img src="https://i.stack.imgur.com/LHbZl.png" alt="enter image description here"> The same thing happens for almost every activation function (most of them are non-decreasing). But does not work for a general pooling layer (average-pooling). <hr> Nonetheless both orders produce the same result, <code>Activation(MaxPool(x))</code> does it significantly faster by doing less amount of operations. For a pooling layer of size <code>k</code>, it uses <code>k^2</code> times less calls to activation function. Sadly this optimization is negligible for CNN, because majority of the time is used in convolutional layers.

Activation function after pooling layer or convolutional layer?

2 Answers

Well, max-pooling and monotonely increasing non-linearities commute. This means that MaxPool(Relu(x)) = Relu(MaxPool(x)) for any input. So the result is the same in that case. So it is technically better to first subsample through max-pooling and then apply the non-linearity (if it is costly, such as the sigmoid). In practice it is often done the other way round - it doesn't seem to change much in performance.

As for conv2D, it does not flip the kernel. It implements exactly the definition of convolution. This is a linear operation, so you have to add the non-linearity yourself in the next step, e.g. theano.tensor.nnet.relu.

178

answered Sep 25 '22 06:09

eickenberg

In many papers people use conv -> pooling -> non-linearity. It does not mean that you can't use another order and get reasonable results. In case of max-pooling layer and ReLU the order does not matter (both calculate the same thing):

enter image description here

You can proof that this is the case by remembering that ReLU is an element-wise operation and a non-decreasing function so

enter image description here

The same thing happens for almost every activation function (most of them are non-decreasing). But does not work for a general pooling layer (average-pooling).

Nonetheless both orders produce the same result, Activation(MaxPool(x)) does it significantly faster by doing less amount of operations. For a pooling layer of size k, it uses k^2 times less calls to activation function.

Sadly this optimization is negligible for CNN, because majority of the time is used in convolutional layers.

answered Sep 24 '22 06:09

Salvador Dali

Related questions
                            
                                What is Adaptive average pooling and How does it work?
                            
                                Tensorflow Keras Copy Weights From One Model to Another
                            
                                how to implement custom metric in keras?
                            
                                Tensorflow: Cannot interpret feed_dict key as Tensor
                            
                                TimeDistributed(Dense) vs Dense in Keras - Same number of parameters
                            
                                Keras confusion about number of layers
                            
                                Neural network backpropagation with RELU
                            
                                Linear vs nonlinear neural network?
                            
                                Continuous output in Neural Networks
                            
                                How to fix MatMul Op has type float64 that does not match type float32 TypeError?
                            
                                record the computation time for each epoch in Keras during model.fit()
                            
                                How to turn off dropout for testing in Tensorflow?
                            
                                Get learning rate of keras model
                            
                                DCGAN debugging. Getting just garbage
                            
                                Tackling Class Imbalance: scaling contribution to loss and sgd
                            
                                tensorflow deep neural network for regression always predict same results in one batch
                            
                                What is Depth of a convolutional neural network?
                            
                                Why is this TensorFlow implementation vastly less successful than Matlab's NN?
                            
                                SKlearn import MLPClassifier fails
                            
                                Neural Network training with PyBrain won't converge

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Activation function after pooling layer or convolutional layer?

Tags:

neural-network

convolution

theano

malioboro

People also ask

2 Answers

eickenberg

Salvador Dali

Recent Activity

Donate For Us