Dropout behavior in Keras with rate=1 (dropping all input units) not as expected

Tags:

input0 = keras.layers.Input((32, 32, 3), name='Input0')
flatten = keras.layers.Flatten(name='Flatten')(input0)
relu1 = keras.layers.Dense(256, activation='relu', name='ReLU1')(flatten)
dropout = keras.layers.Dropout(1., name='Dropout')(relu1)
softmax2 = keras.layers.Dense(10, activation='softmax', name='Softmax2')(dropout)
model = keras.models.Model(inputs=input0, outputs=softmax2, name='cifar')

just to test whether dropout is working..

I set dropout rate to be 1.0

the state in each epoch should be freezed without any tuning to parameters

however the accuracy keep growing although i drop all the hidden nodes enter image description here

what's wrong?

907

asked Jan 20 '18 11:01

Daniel H. Leung

1 Answers

Nice catch!

It would seem that the issue linked in the comment above by Dennis Soemers, Keras Dropout layer changes results with dropout=0.0, has not been fully resolved, and it somehow blunders when faced with a dropout rate of 1.0 [see UPDATE at the end of post]; modifying the model shown in the Keras MNIST MLP example:

model = Sequential()
model.add(Dense(512, activation='relu', use_bias=False, input_shape=(784,)))
model.add(Dropout(1.0))
model.add(Dense(512, activation='relu'))
model.add(Dropout(1.0))
model.add(Dense(num_classes, activation='softmax'))

model.compile(loss='categorical_crossentropy',
          optimizer=RMSprop(),
          metrics=['accuracy'])

model.fit(x_train, y_train,
          batch_size=128,
          epochs=3,
          verbose=1,
          validation_data=(x_test, y_test))

gives indeed a model being trained, despite all neurons being dropped, as you report:

Train on 60000 samples, validate on 10000 samples
Epoch 1/3
60000/60000 [==============================] - 15s 251us/step - loss: 0.2180 - acc: 0.9324 - val_loss: 0.1072 - val_acc: 0.9654
Epoch 2/3
60000/60000 [==============================] - 15s 246us/step - loss: 0.0831 - acc: 0.9743 - val_loss: 0.0719 - val_acc: 0.9788
Epoch 3/3
60000/60000 [==============================] - 15s 245us/step - loss: 0.0526 - acc: 0.9837 - val_loss: 0.0997 - val_acc: 0.9723

Nevertheless, if you try a dropout rate of 0.99, i.e. replacing the two dropout layers in the above model with

model.add(Dropout(0.99))

then indeed you have effectively no training taking place, as it should be the case:

Train on 60000 samples, validate on 10000 samples
Epoch 1/3
60000/60000 [==============================] - 16s 265us/step - loss: 3.4344 - acc: 0.1064 - val_loss: 2.3008 - val_acc: 0.1136
Epoch 2/3
60000/60000 [==============================] - 16s 261us/step - loss: 2.3342 - acc: 0.1112 - val_loss: 2.3010 - val_acc: 0.1135
Epoch 3/3
60000/60000 [==============================] - 16s 266us/step - loss: 2.3167 - acc: 0.1122 - val_loss: 2.3010 - val_acc: 0.1135

UPDATE (after comment by Yu-Yang in OP): It seems as a design choice (deal link now, see update below) not to do anything when the dropout rate is equal to either 0 or 1; the Dropout class becomes effective only

if 0. < self.rate < 1.

Nevertheless, as already commented, a warning message in such cases (and a relevant note in the documentation) would arguably be a good idea.

UPDATE (July 2021): There have been some changes since Jan 2018 when the answer was written; now, under the hood, Keras calls tf.nn.dropout, which does not seem to allow for dropout=1 (source).

107

answered Oct 12 '22 14:10

desertnaut

Related questions
                            
                                feature names from sklearn pipeline: not fitted error
                            
                                Force Airflow's backfill command to run sequentially
                            
                                Matplotlib 2.0 stripes in histogram
                            
                                TensorFlow crashes when fitting TensorForestEstimator
                            
                                Airflow custom jinja2 filters
                            
                                scraping a website that requires you to scroll down
                            
                                How to create short uuid with Django
                            
                                How to serve multiple domains which share the application back-end in Flask?
                            
                                Difference between conda and pip installs within a conda environment
                            
                                Can you change kernel in colab notebook?
                            
                                how to enable auto scaling with Celery multi workers?
                            
                                How to screenshot a specified WebElement in Selenium using Python
                            
                                pickle.load - EOFError: Ran out of input
                            
                                Safely running code in a process, redirect stdout in multithreading.Process
                            
                                Indexing numpy array by a numpy array of coordinates
                            
                                Python: Multiple conditions for if statement in Jinja templates
                            
                                Confusion matrix for Clustering in scikit-learn
                            
                                How to detect rectangular items in image with Python
                            
                                How to cleanly shutdown Change Streams with Motor?
                            
                                Simplest lambda function to copy a file from one s3 bucket to another

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Dropout behavior in Keras with rate=1 (dropping all input units) not as expected

Tags:

python

machine-learning

neural-network

tensorflow

keras

Daniel H. Leung

People also ask

1 Answers

desertnaut

Recent Activity

Donate For Us