<code>ReduceLROnPlateau</code> callback in Keras seems to be an interesting tool to use in training models. But I could not really figure out exactly what the <code>cooldown</code> parameter means in the callback function <code>ReduceLROnPlateau</code> in Keras. Here is what the documentation says: First, the interface of the function: <pre class="prettyprint"><code>keras.callbacks.ReduceLROnPlateau(monitor='val_loss', factor=0.1, patience=10, verbose=0, mode='auto', min_delta=0.0001, cooldown=0, min_lr=0) </code></pre> <code>ReduceLROnPlateau</code>: Models often benefit from reducing the learning rate by a factor of 2-10 once learning stagnates. This callback monitors a quantity and if no improvement is seen for a 'patience' number of epochs, the learning rate is reduced. <code>cooldown</code>: number of epochs to wait before resuming normal operation after lr has been reduced. The explanation does not really make it clear to me. Is it meant here that: - Say that <code>lr=A</code>. And the learning rate is reduced if the relevant monitored metric does not improve during <code>patience</code> number of epochs. (And say that <code>lr=B</code> after reducing it.) - And the learning rate is set to its first value (<code>lr=A</code> again) after <code>cooldown</code> number of epochs. Is my understanding correct? If not, what is the real function of cooldown parameter here? PS. When I google it, I see some examples where people set the <code>cooldown</code> parameter to zero, which makes me think that my perception on this parameter is wrong.

True, it does not state it clearly in the description. What it means is that if you set a cooldown you have to wait before resuming normal operation (i.e. beginning to monitor if there is any improvement in the monitored metric over a <code>patience</code> epochs). For example, let's say <code>cooldown=5</code>. After the learning rate is reduced, the algorithm waits <code>5</code> epochs before starting to monitor the metrics again. So if there is no improvement in the metric and <code>patience=10</code>, the learning rate will be reduced again after <code>15</code> epochs. You can confirm this by looking at the corresponding code.

Keras callback ReduceLROnPlateau - cooldown parameter

Tags:

callback

machine-learning

tensorflow

keras

ReduceLROnPlateau callback in Keras seems to be an interesting tool to use in training models. But I could not really figure out exactly what the cooldown parameter means in the callback function ReduceLROnPlateau in Keras.

Here is what the documentation says:

First, the interface of the function:

keras.callbacks.ReduceLROnPlateau(monitor='val_loss', 
                                  factor=0.1, 
                                  patience=10, 
                                  verbose=0, 
                                  mode='auto', 
                                  min_delta=0.0001, 
                                  cooldown=0, 
                                  min_lr=0)

ReduceLROnPlateau: Models often benefit from reducing the learning rate by a factor of 2-10 once learning stagnates. This callback monitors a quantity and if no improvement is seen for a 'patience' number of epochs, the learning rate is reduced.

cooldown: number of epochs to wait before resuming normal operation after lr has been reduced.

The explanation does not really make it clear to me. Is it meant here that: - Say that lr=A. And the learning rate is reduced if the relevant monitored metric does not improve during patience number of epochs. (And say that lr=B after reducing it.) - And the learning rate is set to its first value (lr=A again) after cooldown number of epochs.

Is my understanding correct? If not, what is the real function of cooldown parameter here?

PS. When I google it, I see some examples where people set the cooldown parameter to zero, which makes me think that my perception on this parameter is wrong.

244

asked Sep 14 '18 19:09

edn

1 Answers

True, it does not state it clearly in the description. What it means is that if you set a cooldown you have to wait before resuming normal operation (i.e. beginning to monitor if there is any improvement in the monitored metric over a patience epochs).

For example, let's say cooldown=5. After the learning rate is reduced, the algorithm waits 5 epochs before starting to monitor the metrics again. So if there is no improvement in the metric and patience=10, the learning rate will be reduced again after 15 epochs.

You can confirm this by looking at the corresponding code.

answered Oct 01 '22 23:10

Djib2011

Related questions
                            
                                Changing the number of threads in TensorFlow on Cifar10
                            
                                In Tensorflow, what is the difference between sampled_softmax_loss and softmax_cross_entropy_with_logits
                            
                                How do I set TensorFlow RNN state when state_is_tuple=True?
                            
                                Removing then Inserting a New Middle Layer in a Keras Model
                            
                                Does tensorflow map_fn support taking more than one tensor?
                            
                                keras version to use with tensorflow-gpu 1.4
                            
                                CuDNNLSTM: Failed to call ThenRnnForward
                            
                                Tensorflow apply op to each element of a 2d tensor
                            
                                Google Storage (gs) wrapper file input/out for Cloud ML?
                            
                                ModuleNotFoundError: No module named 'tensorflow.tensorboard.tensorboard'
                            
                                Tensorflow: Where is tf.nn.conv2d Actually Executed?
                            
                                How to specify the correlation coefficient as the loss function in keras
                            
                                Deep neural network skip connection implemented as summation vs concatenation? [closed]
                            
                                Difference between model(x) and model.predict(x) in Keras?
                            
                                Tensorflow vocabularyprocessor
                            
                                Get the last output of a dynamic_rnn in TensorFlow
                            
                                ModuleNotFoundError: No module named 'tensorflow_core.estimator' for tensorflow 2.1.0
                            
                                How to create a Tensorflow Tensorboard Empty Graph
                            
                                How can I use GPU again on Google Colab after exceeding usage limit?
                            
                                TensorFlow/TFLearn: ValueError: Cannot feed value of shape (64,) for Tensor u'target/Y:0', which has shape '(?, 10)'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With