Behavior of Dropout layers in test / training phase

Question

According to the Keras documentation dropout layers show different behaviors in training and test phase:

Note that if your model has a different behavior in training and testing phase (e.g. if it uses Dropout, BatchNormalization, etc.), you will need to pass the learning phase flag to your function:

Unfortunately, nobody talks about the actual differences. Why should dropout behave differently in test phase? I expect the layer to set a certain amount of neurons to 0. Why should this behavior depend on the training/test phase?

Wilmar van Ommeren · Accepted Answer

Dropout is used in the training phase to reduce the chance of overfitting. As you mention this layer deactivates certain neurons. The model will become more insensitive to weights of other nodes. Basically with the dropout layer the trained model will be the average of many thinned models. Check a more detailed explanation here

However, in when you apply your trained model you want to use the full power of the model. You want to use all neurons in the trained (average) network to get the highest accuracy.

Behavior of Dropout layers in test / training phase

Tags:

neural-network

tensorflow

keras

keras-layer

theano

null

1 Answers

Wilmar van Ommeren

Recent Activity

Donate For Us

Behavior of Dropout layers in test / training phase

Tags:

neural-network

tensorflow

keras

keras-layer

theano

null

1 Answers

Wilmar van Ommeren

Related questions

Recent Activity

Donate For Us