How do I calculate the output size in a convolution layer? For example, I have a 2D convolution layer that takes a 3x128x128 input and has 40 filters of size 5x5.

you can use this formula <code>[(W−K+2P)/S]+1</code>. <ul> <li>W is the input volume - in your case 128 </li> <li>K is the Kernel size - in your case 5</li> <li>P is the padding - in your case 0 i believe</li> <li>S is the stride - which you have not provided. </li> </ul> So, we input into the formula: <pre class="prettyprint"><code>Output_Shape = (128-5+0)/1+1 Output_Shape = (124,124,40) </code></pre> NOTE: Stride defaults to 1 if not provided and the <code>40</code> in <code>(124, 124, 40)</code> is the number of filters provided by the user.

You can find it in two ways: simple method: input_size - (filter_size - 1) <pre class="prettyprint"><code>W - (K-1) Here W = Input size K = Filter size S = Stride P = Padding </code></pre> But the second method is the standard to find the output size. <pre class="prettyprint"><code>Second method: (((W - K + 2P)/S) + 1) Here W = Input size K = Filter size S = Stride P = Padding </code></pre>

Calculate the Output size in Convolution layer [closed]

3 Answers

you can use this formula [(W−K+2P)/S]+1.

W is the input volume - in your case 128
K is the Kernel size - in your case 5
P is the padding - in your case 0 i believe
S is the stride - which you have not provided.

So, we input into the formula:

Output_Shape = (128-5+0)/1+1

Output_Shape = (124,124,40)

NOTE: Stride defaults to 1 if not provided and the 40 in (124, 124, 40) is the number of filters provided by the user.

answered Oct 14 '22 04:10

The BrownBatman

You can find it in two ways: simple method: input_size - (filter_size - 1)

W - (K-1)
Here W = Input size
            K = Filter size
            S = Stride
            P = Padding

But the second method is the standard to find the output size.

Second method: (((W - K + 2P)/S) + 1)
        Here W = Input size
        K = Filter size
        S = Stride
        P = Padding

answered Oct 14 '22 04:10

Ramzan Shahid

Let me start simple; since you have square matrices for both input and filter let me get one dimension. Then you can apply the same for other dimension(s). Imagine your are building fences between trees, if there are N trees, you have to build N-1 fences. Now apply that analogy to convolution layers.

Your output size will be: input size - filter size + 1

Because your filter can only have n-1 steps as fences I mentioned.

Let's calculate your output with that idea. 128 - 5 + 1 = 124 Same for other dimension too. So now you have a 124 x 124 image.

That is for one filter.

If you apply this 40 times you will have another dimension: 124 x 124 x 40

Here is a great guide if you want to know more about advanced convolution arithmetic: https://arxiv.org/pdf/1603.07285.pdf

answered Oct 14 '22 06:10

Sam Oz

Related questions
                            
                                What is the difference between a sigmoid followed by the cross entropy and sigmoid_cross_entropy_with_logits in TensorFlow?
                            
                                How to do multi class classification using Support Vector Machines (SVM)
                            
                                Java-R integration?
                            
                                What is the difference between sparse_categorical_crossentropy and categorical_crossentropy?
                            
                                Meaning of an Epoch in Neural Networks Training
                            
                                ConvergenceWarning: lbfgs failed to converge (status=1): STOP: TOTAL NO. of ITERATIONS REACHED LIMIT
                            
                                Cost Function, Linear Regression, trying to avoid hard coding theta. Octave.
                            
                                How does one debug NaN values in TensorFlow?
                            
                                How do I visualize a net in Pytorch?
                            
                                Feature/Variable importance after a PCA analysis
                            
                                Show progress bar for each epoch during batchwise training in Keras
                            
                                Keras accuracy does not change
                            
                                How to log Keras loss output to a file
                            
                                Save MinMaxScaler model in sklearn
                            
                                Can someone explain to me the difference between a cost function and the gradient descent equation in logistic regression?
                            
                                Converting a Vision VNTextObservation to a String
                            
                                Training data for sentiment analysis [closed]
                            
                                What does the "fit" method in scikit-learn do? [closed]
                            
                                How can I use a pre-trained neural network with grayscale images?
                            
                                What is a Learning Curve in machine learning?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Calculate the Output size in Convolution layer [closed]

Tags:

machine-learning

deep-learning

pytorch

conv-neural-network

Monk247uk

People also ask

3 Answers

The BrownBatman

Ramzan Shahid

Sam Oz

Recent Activity

Donate For Us