I am developing a model using <code>nadam</code> optimizer. I was wondering if there is a way to switch to <code>sgd</code> during training if validation loss does not reduce for two epochs.

Would something like this work ? <pre class="prettyprint"><code>model.compile( optimizer='Adam', ...) model.fit( X, y, epochs=100, callback=[EarlyStoppingCallback] ) # now switch to SGD and finish training model.compile( optimizer='SGD', ...) model.fit( X, y, epochs=10 ) </code></pre> Or would the second call to compile over-write all the variables (ie. do something like tf.initialize_all_variables() (It's actually a followup question - but I'm writing this as an answer - because stackoverflow does not allow code in comments)

Changing optimizer in keras during training

2 Answers

Would something like this work ?

model.compile( optimizer='Adam', ...) 
model.fit( X, y, epochs=100, callback=[EarlyStoppingCallback] ) 
# now switch to SGD and finish training
model.compile( optimizer='SGD', ...) 
model.fit( X, y, epochs=10 )

Or would the second call to compile over-write all the variables (ie. do something like tf.initialize_all_variables()

(It's actually a followup question - but I'm writing this as an answer - because stackoverflow does not allow code in comments)

197

answered Oct 15 '22 19:10

firdaus

You can create an EarlyStopping callback that will stop the training, and in this callback, you create a function to change your optimizer and fit again.

The following callback will monitor the validation loss (val_loss) and stop training after two epochs (patience) without an improvement greater than min_delta.

min_delta = 0.000000000001

stopper = EarlyStopping(monitor='val_loss',min_delta=min_delta,patience=2)

But for adding an extra action after the training is finished, we can extend this callback and change the on_train_end method:

class OptimizerChanger(EarlyStopping):

    def __init__(self, on_train_end, **kwargs):

        self.do_on_train_end = on_train_end
        super(OptimizerChanger,self).__init__(**kwargs)

    def on_train_end(self, logs=None):
        super(OptimizerChanger,self).on_train_end(self,logs)
        self.do_on_train_end()

For the custom function to call when the model ends training:

def do_after_training():

    #warining, this creates a new optimizer and,
    #at the beginning, it might give you a worse training performance than before
    model.compile(optimizer = 'SGD', loss=...., metrics = ...)
    model.fit(.....)

Now let's use the callbacks:

changer = OptimizerChanger(on_train_end= do_after_training, 
                           monitor='val_loss',
                           min_delta=min_delta,
                           patience=2)

model.fit(..., ..., callbacks = [changer])

answered Oct 15 '22 20:10

Daniel Möller

Related questions
                            
                                Using Deep Learning to Predict Subsequence from Sequence
                            
                                ValueError: Input 0 is incompatible with layer conv1d_1: expected ndim=3, found ndim=4
                            
                                How to construct input data to LSTM for time series multi-step horizon with external features?
                            
                                Low GPU usage by Keras / Tensorflow?
                            
                                Keras Classification - Object Detection
                            
                                How to load an image and show the image using keras?
                            
                                Custom weighted loss function in Keras for weighing each element
                            
                                Inputs to eager execution function cannot be Keras symbolic tensors
                            
                                Keras lstm with masking layer for variable-length inputs
                            
                                Save model every 10 epochs tensorflow.keras v2
                            
                                Saving Keras models with Custom Layers
                            
                                What is the preferred ratio between the vocabulary size and embedding dimension?
                            
                                How can I one hot encode a list of strings with Keras?
                            
                                Keras: difference of InputLayer and Input
                            
                                Using Dropout with Keras and LSTM/GRU cell
                            
                                Keras: what is the difference between model.evaluate_generator and model.predict_generator
                            
                                Value error: Input arrays should have the same number of samples as target arrays. Found 1600 input samples and 6400 target samples
                            
                                What is the difference between Loss, accuracy, validation loss, Validation accuracy?
                            
                                Proper way to feed time-series data to stateful LSTM?
                            
                                Keras + TensorFlow Realtime training chart

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Changing optimizer in keras during training

Tags:

keras

Haroon S.

People also ask

2 Answers

firdaus

Daniel Möller

Recent Activity

Donate For Us