How to interpret tf.layers.dropout training arg

Tags:

tensorflow

For tf.layers.dropout() the documentation for the training arg is not clear to me.

The documentation states:

training: Either a Python boolean, or a TensorFlow boolean scalar tensor
      (e.g. a placeholder). Whether to return the output in training mode
      (apply dropout) or in inference mode (return the input untouched).

My interpretation is that depending on if training = True or training = False the dropout will be applied. However, it's not clear to me if True or False will apply the dropout (ie. which is in training mode). Given that this is an optional argument, I expected that tf.layers.dropout() would apply by default, but the default is False which intuitively training=False would suggest that the default is not training.

It appears that in order for tf.layers.dropout() to actually apply, one would need something like:

tf.layers.dropout(input, 0.5, training = mode == Modes.TRAIN)

This is not very obvious to me from the documentation as training is an optional argument.

Does this appear to be the correct implementation of tf.layers.dropout? Why would the training flag just not automatically be tied to Modes.TRAIN as the default and then need to be adjusted for other cases? The default being training=False seems to be very misleading

403

asked Feb 13 '18 19:02

reese0106

1 Answers

Your interpretation of dropout() and its training argument is correct. However, an automatic Modes.TRAIN check as you suggest is impossible. A mode is usually tied to an estimator model_fn() as an optional parameter. Estimators constitute an higher-level abstraction and are not required in a TensorFlow model.

As to why TensorFlow designed their API with a false default value, we can only speculate. An explanation would be that the layers abstraction as a whole was intended to default to an inference mode, thereby explaining the dropout() training default value.

answered Oct 04 '22 18:10

sgc

Related questions
                            
                                Explanation for a code needed : Appending the values to list in python [duplicate]
                            
                                Exscript: How do I switch between interactive and non interactive sessions?
                            
                                How to iterate through this text file faster?
                            
                                sqlalchemy.exc.ResourceClosedError: This transaction is closed
                            
                                How to show hidden layer outputs in Tensorflow
                            
                                Sort and Compare List of Dicts Python
                            
                                Python multithreading - memory not released when ran using While statement
                            
                                Using openpyxl to refresh pivot tables in Excle
                            
                                Python HTML real time plotting
                            
                                matplotlib plot_surface 3D plot with non-linear color map
                            
                                How to Download only the first x bytes of data Python
                            
                                How to avoid SSL issues when using proxpy?
                            
                                How to obtain the convex curve for weights vs loss in a neural network [closed]
                            
                                Can pytest be made to fail if nothing is asserted?
                            
                                How to open <del>named pipe</del>character device special file for reading and writing in Python
                            
                                Include .kv/.json files while packaing kivy with PyInstaller --onefile?
                            
                                Is there a python plotly/dash image widget that can render numpy array data?
                            
                                LSTM Autoencoder on timeseries
                            
                                How do I unlock locked files and folders (mac) with Python
                            
                                Recursion - Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to interpret tf.layers.dropout training arg

Tags:

python

tensorflow

reese0106

People also ask

1 Answers

sgc

Recent Activity

Donate For Us