Counting the number of multiply-add operations (MAC) in Caffe CNN's architecture

Tags:

Lately I've been benchmarking some CNNs regarding time, # of multiply-add operations (MAC), # of parameters and model size. I have seen some similar SO questions (here and here) and in the latter, they suggest using Netscope CNN Analyzer. This tool allows me to calculate most of the things I need just by inputing my Caffe network definition.

However, the number of multiply-add operations of some architectures I've seen in papers and over the internet doesn't match what Netscope is outputting, whereas other architectures match. I'm always comparing either FLOPs or MAC with the MACC column in netscope, but there a ~10x factor that I'm forgetting at some point (check table bellow for more detail).

Architecture  ----  MAC (paper/internet) ---- macc column in netscope
VGG 16                    ~15.5G                       ~157G
GoogLeNet                 ~1.55G                       ~16G

Reference about GoogLeNet macc number and VGG16 macc number in Netscope.

Does anybody that used that tool could point me out on what mistake I'm doing while reading Netscope output?

645

asked Jun 12 '17 19:06

rafaspadilha

1 Answers

I've found what was causing the discrepancy between Netscope and the information I'd found in papers. Most preset architectures in Nestcope were using a batch size of 10 (this is the case for VGG and GoogLeNet, for example), therefore the x10 factor that multiplied the number of mult-add operations.

142

answered Sep 18 '22 20:09

rafaspadilha

Related questions
                            
                                Retrain Tensorflow final layer but still use previous Imagenet classes
                            
                                How to properly set steps_per_epoch and validation_steps in Keras?
                            
                                How to show topics of reuters dataset in Keras?
                            
                                Google Colaboratory local runtime using local GPU
                            
                                Backpropagation algorithm through cross-channel local response normalization (LRN) layer
                            
                                How to create a tensorflow serving client for the 'wide and deep' model?
                            
                                Multi label regression in Caffe
                            
                                Getting access to GPU on Docker on Windows 10
                            
                                Optimize deep Q network with long episode
                            
                                Implement word2vec in Keras
                            
                                CNN gives biased results
                            
                                Tensorflow Convolution Neural Network with different sized images
                            
                                Why does the gated activation function (used in Wavenet) work better than a ReLU?
                            
                                How to deal with length variations for text classification using CNN (Keras)
                            
                                Default activation function in Keras
                            
                                Why in Keras subclassing API, the call method is never called and as an alternative the input is passed by calling the object of this class?
                            
                                "g++ not detected" while data set goes larger, is there any limit to matrix size in GPU?
                            
                                Neural network bias for each neuron
                            
                                How to annotate the ground truth for image segmentation?
                            
                                None dimension raise ValueError in batch_norm with Tensorflow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Counting the number of multiply-add operations (MAC) in Caffe CNN's architecture

Tags:

flops

deep-learning

caffe

conv-neural-network

rafaspadilha

People also ask

1 Answers

rafaspadilha

Recent Activity

Donate For Us