How to calculate the number of parameters of convolutional neural networks?

Tags:

I can't give the correct number of parameters of AlexNet or VGG Net.

For example, to calculate the number of parameters of a conv3-256 layer of VGG Net, the answer is 0.59M = (3*3)*(256*256), that is (kernel size) * (product of both number of channels in the joint layers), however in that way, I can't get the 138M parameters.

So could you please show me where is wrong with my calculation, or show me the right calculation procedure?

371

asked Jan 30 '15 08:01

Eric

1 Answers

If you refer to VGG Net with 16-layer (table 1, column D) then 138M refers to the total number of parameters of this network, i.e including all convolutional layers, but also the fully connected ones.

Looking at the 3rd convolutional stage composed of 3 x conv3-256 layers:

the first one has N=128 input planes and F=256 output planes,
the two other ones have N=256 input planes and F=256 output planes.

The convolution kernel is 3x3 for each of these layers. In terms of parameters this gives:

128x3x3x256 (weights) + 256 (biases) = 295,168 parameters for the 1st one,
256x3x3x256 (weights) + 256 (biases) = 590,080 parameters for the two other ones.

As explained above you have to do that for all layers, but also the fully-connected ones, and sum these values to obtain the final 138M number.

UPDATE: the breakdown among layers give:

conv3-64  x 2       : 38,720 conv3-128 x 2       : 221,440 conv3-256 x 3       : 1,475,328 conv3-512 x 3       : 5,899,776 conv3-512 x 3       : 7,079,424 fc1                 : 102,764,544 fc2                 : 16,781,312 fc3                 : 4,097,000 TOTAL               : 138,357,544

In particular for the fully-connected layers (fc):

 fc1 (x): (512x7x7)x4,096 (weights) + 4,096 (biases)  fc2    : 4,096x4,096     (weights) + 4,096 (biases)  fc3    : 4,096x1,000     (weights) + 1,000 (biases)

(x) see section 3.2 of the article: the fully-connected layers are first converted to convolutional layers (the first FC layer to a 7 × 7 conv. layer, the last two FC layers to 1 × 1 conv. layers).

Details about fc1

As precised above the spatial resolution right before feeding the fully-connected layers is 7x7 pixels. This is because this VGG Net uses spatial padding before convolutions, as detailed within section 2.1 of the paper:

[...] the spatial padding of conv. layer input is such that the spatial resolution is preserved after convolution, i.e. the padding is 1 pixel for 3×3 conv. layers.

With such a padding, and working with a 224x224 pixels input image, the resolution decreases as follow along the layers: 112x112, 56x56, 28x28, 14x14 and 7x7 after the last convolution/pooling stage which has 512 feature maps.

This gives a feature vector passed to fc1 with dimension: 512x7x7.

answered Oct 07 '22 02:10

deltheil

Related questions
                            
                                How to tune parameters in Random Forest, using Scikit Learn?
                            
                                How to find probability distribution and parameters for real data? (Python 3)
                            
                                How do you use Keras LeakyReLU in Python?
                            
                                What is the difference between an Embedding Layer and a Dense Layer?
                            
                                Tensorflow Precision / Recall / F1 score and Confusion matrix
                            
                                Why feature scaling in SVM?
                            
                                How to calculate prediction uncertainty using Keras?
                            
                                How to predict time series in scikit-learn?
                            
                                Options for deploying R models in production
                            
                                scikit-learn: how to scale back the 'y' predicted result
                            
                                How can I classify data with the nearest-neighbor algorithm using Python?
                            
                                Evaluate multiple scores on sklearn cross_val_score
                            
                                How to tell which Keras model is better?
                            
                                What is the use of train_on_batch() in keras?
                            
                                What is the correct way to change image channel ordering between channels first and channels last?
                            
                                PCA For categorical features?
                            
                                Machine Learning and Natural Language Processing [closed]
                            
                                What is the difference between Keras model.evaluate() and model.predict()?
                            
                                Different decision tree algorithms with comparison of complexity or performance
                            
                                Received a label value of 1 which is outside the valid range of [0, 1) - Python, Keras

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to calculate the number of parameters of convolutional neural networks?

Tags:

machine-learning

neural-network

computer-vision

vgg-net

Eric

People also ask

1 Answers

deltheil

Recent Activity

Donate For Us