Why Tensorflow Object Detection disable regularization for Faster R-CNN

Tags:

In Tensorflow Object Detection sample configuration files, all Faster R-CNN configuration files disabled the regularization term as

regularizer {
    l2_regularizer {
      weight: 0.0
    }
  }

I feel this not reasonable and very likely to get over fitting. Any explanations for such settings? Thank you.

415

asked Nov 02 '17 20:11

Brandon

1 Answers

"Strong regularization such as maxout or dropout is applied to obtain the best results on this dataset. In this paper, we use no maxout/dropout and just simply impose regularization via deep and thin architectures by design, without distracting from the focus on the difficulties of optimization. But combining with stronger regularization may improve results, which we will study in the future." [He et. al, Deep Residual Learning for Image Recognition]

I think the regularization the authors refer to which is being applied directly within the RESNET architecture comes from the batch norm layers that are sandwiched between every conv layer and every activation. While the authors don't say anything about the use of L2 regularization, their statement about maxout and dropout ought apply. BN layers have the effect of regularizing the network without imposing an explicit penalty, so L2 regularization isn't necessary.

That said, the option is there in case you want to try out stronger regularization.

192

answered Sep 20 '22 18:09

DaveB

Related questions
                            
                                Getting Model Explanations with Tensorflow Serving and SavedModel Estimators
                            
                                Inputting an obscure file type into tensorflow
                            
                                How to store result of an operation (like TOPK) per epoch in keras
                            
                                error when using Mirrored strategy in Tensorflow
                            
                                Keras custom loss function to ignore false negatives of a specific class during semantic segmentation?
                            
                                Layer names for pretrained inception v3 model (tensorflow) [duplicate]
                            
                                Embedding lookup table doesn't mask padding value
                            
                                How to detect which variable is 'nonetype' in tensorflow
                            
                                How to use textsum?
                            
                                Computer restarts with large mini batches in TensorFlow
                            
                                Difference in matrix multiplication tensorflow vs numpy
                            
                                Training TensorFlow model with summary operations is much slower than without summary operations
                            
                                Bi-directional LSTM for variable-length sequence in Tensorflow
                            
                                tensorflow neural network with 3d mesh as input
                            
                                How can I handle TensorFlow sessions to train multiple Keras models at the same time?
                            
                                What is the meaning of the implementation of the KL divergence in Keras?
                            
                                Reloading Keras Tokenizer during Testing
                            
                                How to approximate the determinant with keras
                            
                                Speeding up inference of Keras models
                            
                                How could I limit the range of a variable in tensorflow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why Tensorflow Object Detection disable regularization for Faster R-CNN

Tags:

tensorflow

object-detection

object-detection-api

Brandon

People also ask

1 Answers

DaveB

Recent Activity

Donate For Us