Difference between DepthwiseConv2D and SeparableConv2D

Tags:

keras

From the document, I know SeparableConv2D is a combination of depthwise and pointwise operation. However, when I call

SeparableConv2D(100, 5, input_shape=(416,416,10) 

# total parameters is 1350

model.add(DepthwiseConv2D(5, input_shape=(416,416,10)))
model.add(Conv2D(100, 1))

# total parameters is 1360

Does it mean SeparableConv2D does not use bias in depthwise phase by default?

Thanks.

981

asked Jun 26 '19 02:06

1 Answers

Correct, checking the source code (I did this for tf.keras but I suppose it is the same for standalone keras) shows that in SeparableConv2D, the separable convolution works using only filters, no biases, and a single bias vector is added at the end. The second version, on the other hand, has biases for both DepthwiseConv2D and Conv2D.

Given that convolution is a linear operation and you are using no non-linearity inbetween depthwise and 1x1 convolution, I would suppose that having two biases is unnecessary in this case, similar to how you don't use biases in a layer that is followed by batch normalization, for example. As such, the extra 10 parameters wouldn't actually improve the model (nor should they really hurt either).

answered Oct 01 '22 13:10

xdurch0

Related questions
                            
                                Changing activation function of a keras layer w/o replacing whole layer
                            
                                0% accuracy with evaluate_generator but 75% accuracy during training with same data - what is going on?
                            
                                Issue of batch sizes when using custom loss functions in Keras
                            
                                Convert a saved .h5 file to a JSON file
                            
                                What is the utility of `Tensor` (as opposed to `EagerTensor`) in Tensorflow 2.0?
                            
                                Saving and loading multiple models with the same graph in TensorFlow Functional API
                            
                                Does changing a token name in an image caption model affect performance?
                            
                                Keras ImageDataGenerator Slow
                            
                                Keras LSTM training data format
                            
                                Keras: Tokenizer with fit_generator() on text data
                            
                                Keras multi-class prediction output is limited to one class
                            
                                Accessing gradient values of keras model outputs with respect to inputs
                            
                                Keras Tensorboard callback not writing images
                            
                                keras load_model raise error when executed a second time
                            
                                Siamese Network with LSTM for sentence similarity in Keras gives periodically the same result
                            
                                Why does get_weights return an empty list?
                            
                                Keras LSTM: dropout vs recurrent_dropout
                            
                                Meaning of batch_size in model.evaluate()
                            
                                Parallelizing keras models in R using doParallel
                            
                                ValueError: Unknown layer:name when loading a keras model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between DepthwiseConv2D and SeparableConv2D

Tags:

keras

michaelowenliu

People also ask

1 Answers

xdurch0

Recent Activity

Donate For Us