Is Bias necessarily need at Colvolution Layer?

Tags:

I'm building CNN + Ensemble model for classify images with Tensorflow at Python. I crawled dog and cat images at google images. Then changed them to 126 * 126 pixel size and gray scale, add label 0 to dog, 1 to cat. CNN has 5 conv layer and 2 fc layer. HE, PReLU, max-pooling, drop-out, Adam are used in model. When Parameter Tuning finished, I added Early-Stopping, the model learned 65~70 epoch, finished with 92.5~92.7% accuracy. After learning finished, I want change my CNN model to VGG network, I checked my CNN parameter, shockingly, I found I didn't add Bias at conv layer. 2 fc layer had Bias but 5 conv layer didn't have Bias. So I added Bias at 5 conv layer, BUT my model could not learn. Cost increased to infinite.

Bias is not necessarily at Deep Convolution Layer?

882

asked Jul 17 '17 01:07

배준호

1 Answers

How did you add your bias to the convolutional layer? There are two ways to do this: Tied biases which share one bias per kernel and untied biases which use one bias per kernel and output. Also read this.

Regarding your question whether or not they are necessary, the answer is no. Biases in convolutional layers increase the capacity of your model, making it theoretically able to represent more complex data. If your model however already has the capacity to do this, they are not necessary.

An example is this implementation of the 152 layer ResNet architecture where the convolution layers have no bias. Instead the bias is added in the subsequent batch normalization layers.

166

answered Dec 05 '22 08:12

Djib2011

Related questions
                            
                                Implementing Adversarial Training in TensorFlow
                            
                                Using SparseTensor as a trainable variable?
                            
                                How to stack multiple layers of conv2d_transpose() of Tensorflow
                            
                                Tensorflow uses same amount of gpu memory regardless of batch size
                            
                                Named Entity Recognition with Syntaxnet
                            
                                How do I combine tf.absolute and tf.square to create the Huber loss function in Tensorflow?
                            
                                Multiplying along an arbitrary axis?
                            
                                How to Implement Center Loss and Other Running Averages of Labeled Embeddings
                            
                                L2 normalised output with keras
                            
                                TensorFlow: tf.placeholder and tf.Variable - why is the dimension not required?
                            
                                No response from celery worker with TensorFlow
                            
                                TensorArray TensorArray_1_0: Could not read from TensorArray index 0 because it has not yet been written to
                            
                                Importing tensorflow when embedding python in c++ returns null
                            
                                TensorFlow - How to predict with trained model on a different test dataset?
                            
                                FLAGS = None meaning?
                            
                                TensorFlow: Incompatible shapes: [100,155] vs. [128,155] when combining CNN and LSTM
                            
                                Tensorboard scalars and graphs duplicated
                            
                                Adverserial images in TensorFlow
                            
                                How to evaluate a pretrained model in Tensorflow object detection api
                            
                                Saving layer weights at each epoch during training into a numpy type/array? Converting TensorFlow Variable to numpy array?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is Bias necessarily need at Colvolution Layer?

Tags:

tensorflow

conv-neural-network

bias-neuron

배준호

People also ask

1 Answers

Djib2011

Recent Activity

Donate For Us