When using something like: <pre class="prettyprint"><code>callbacks = [ EarlyStopping(patience=15, monitor='val_loss', min_delta=0, mode='min'), ModelCheckpoint('best-weights.h5', monitor='val_loss', save_best_only=True, save_weights_only=True) ] model.fit(..., callbacks=callbacks) y_pred = model.predict(x_test) </code></pre> am I doing the prediction with the best weights calculated during training or <code>model</code> is using the last weights (which may not be the best ones)? So, is the above a safe approach or should I load <code>best-weights.h5</code> into the model even if the predictions are done right after training?

After the training stops by <code>EarlyStopping</code> callback, the current model may not be the best model with the highest/lowest monitored quantity. As a result a new argument, <code>restore_best_weights</code>, has been introduced in Keras 2.2.3 release for <code>EarlyStopping</code> callback if you would like to restore the best weights: <blockquote> restore_best_weights: whether to restore model weights from the epoch with the best value of the monitored quantity. If <code>False</code>, the model weights obtained at the last step of training are used. </blockquote>

Does EarlyStopping in Keras save the best model?

Tags:

python

machine-learning

keras

When using something like:

callbacks = [
    EarlyStopping(patience=15, monitor='val_loss', min_delta=0, mode='min'),
    ModelCheckpoint('best-weights.h5', monitor='val_loss', save_best_only=True, save_weights_only=True)
]

model.fit(..., callbacks=callbacks)

y_pred = model.predict(x_test)

am I doing the prediction with the best weights calculated during training or model is using the last weights (which may not be the best ones)?

So, is the above a safe approach or should I load best-weights.h5 into the model even if the predictions are done right after training?

230

asked Aug 15 '18 11:08

crash

2 Answers

After the training stops by EarlyStopping callback, the current model may not be the best model with the highest/lowest monitored quantity. As a result a new argument, restore_best_weights, has been introduced in Keras 2.2.3 release for EarlyStopping callback if you would like to restore the best weights:

restore_best_weights: whether to restore model weights from the epoch with the best value of the monitored quantity. If False, the model weights obtained at the last step of training are used.

102

answered Oct 15 '22 10:10

today

EarlyStopping callback doesn't save anything on its own (you can double check it looking at its source code https://github.com/keras-team/keras/blob/master/keras/callbacks.py#L458). Thus your code saves the last model that achieved the best result on dev set before the training was stopped by the early stopping callback. I would say that, if you are saving only the best model according to dev, it is not useful to have also an early stopping callback (unless you don't want to save time and your are sure enough you are not going to find any better model if you continue the training)

answered Oct 15 '22 10:10

Tommaso Pasini

Related questions
                            
                                How to run a BigQuery query in Python
                            
                                Create indicator matrix from two arrays in Python Numpy
                            
                                Define heap key for an array of tuples
                            
                                Locate source code from pip install packages in Ubuntu
                            
                                Is there a quick way to turn a pandas DataFrame into a pretty HTML table?
                            
                                exec() and variable scope [duplicate]
                            
                                aiohttp: when is the response.status available?
                            
                                How to view initialized weights (i.e. before training)?
                            
                                Selenium + ChromeDriver printToPDF
                            
                                changing arrowhead type in networkx
                            
                                How do I embed a Flask-Security login form on my page?
                            
                                disk I/O error with SQLite3 in Python 3 when writing to a database
                            
                                Why is this warning "Expected type 'int' (matched generic type '_T'), got 'Dict[str, None]' instead"?
                            
                                How to display a pandas dataframe as datatable?
                            
                                Running Flask & a Discord bot in the same application
                            
                                Empty class with comment same as pass?
                            
                                How to cancel the effect of numpy seed()?
                            
                                Massive overfit during resnet50 transfer learning
                            
                                How can I specify the figsize of a graphviz representation of a decision tree?
                            
                                python pytest occasionally fails with OSError: reading from stdin while output is captured

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With