Which parameters should be used for early stopping?

Tags:

I'm training a neural network for my project using Keras. Keras has provided a function for early stopping. May I know what parameters should be observed to avoid my neural network from overfitting by using early stopping?

438

asked May 11 '17 03:05

AizuddinAzman

2 Answers

early stopping

Early stopping is basically stopping the training once your loss starts to increase (or in other words validation accuracy starts to decrease). According to documents it is used as follows;

keras.callbacks.EarlyStopping(monitor='val_loss',                               min_delta=0,                               patience=0,                               verbose=0, mode='auto')

Values depends on your implementation (problem, batch size etc...) but generally to prevent overfitting I would use;

Monitor the validation loss (need to use cross validation or at least train/test sets) by setting the monitor argument to 'val_loss'.
min_delta is a threshold to whether quantify a loss at some epoch as improvement or not. If the difference of loss is below min_delta, it is quantified as no improvement. Better to leave it as 0 since we're interested in when loss becomes worse.
patience argument represents the number of epochs before stopping once your loss starts to increase (stops improving). This depends on your implementation, if you use very small batches or a large learning rate your loss zig-zag (accuracy will be more noisy) so better set a large patience argument. If you use large batches and a small learning rate your loss will be smoother so you can use a smaller patience argument. Either way I'll leave it as 2 so I would give the model more chance.
verbose decides what to print, leave it at default (0).
mode argument depends on what direction your monitored quantity has (is it supposed to be decreasing or increasing), since we monitor the loss, we can use min. But let's leave keras handle that for us and set that to auto

So I would use something like this and experiment by plotting the error loss with and without early stopping.

keras.callbacks.EarlyStopping(monitor='val_loss',                               min_delta=0,                               patience=2,                               verbose=0, mode='auto')

For possible ambiguity on how callbacks work, I'll try to explain more. Once you call fit(... callbacks=[es]) on your model, Keras calls given callback objects predetermined functions. These functions can be called on_train_begin, on_train_end, on_epoch_begin, on_epoch_end and on_batch_begin, on_batch_end. Early stopping callback is called on every epoch end, compares the best monitored value with the current one and stops if conditions are met (how many epochs have past since the observation of the best monitored value and is it more than patience argument, the difference between last value is bigger than min_delta etc..).

As pointed by @BrentFaust in comments, model's training will continue until either Early Stopping conditions are met or epochs parameter (default=10) in fit() is satisfied. Setting an Early Stopping callback will not make the model to train beyond its epochs parameter. So calling fit() function with a larger epochs value would benefit more from Early Stopping callback.

answered Oct 06 '22 22:10

umutto

Here's an example of EarlyStopping from another project, AutoKeras (https://autokeras.com/), an automated machine learning (AutoML) library. The library sets two EarlyStopping parameters: patience=10 and min_delta=1e-4

https://github.com/keras-team/autokeras/blob/5e233956f32fddcf7a6f72a164048767a0021b9a/autokeras/engine/tuner.py#L170

the default quantity to monitor for both AutoKeras and Keras is the val_loss:

https://github.com/keras-team/keras/blob/cb306b4cc446675271e5b15b4a7197efd3b60c34/keras/callbacks.py#L1748 https://autokeras.com/image_classifier/

answered Oct 06 '22 22:10

cannin

Related questions
                            
                                Writing a dictionary to a csv file with one line for every 'key: value'
                            
                                Count vs len on a Django QuerySet
                            
                                SyntaxError of Non-ASCII character [duplicate]
                            
                                AttributeError: 'module' object has no attribute 'tests'
                            
                                How to "properly" print a list?
                            
                                Python, remove all non-alphabet chars from string
                            
                                Python: How to create a unique file name?
                            
                                Bitwise operation and usage
                            
                                Splitting a number into the integer and decimal parts
                            
                                Batch Renaming of Files in a Directory
                            
                                adding noise to a signal in python
                            
                                What is the currently correct way to dynamically update plots in Jupyter/iPython?
                            
                                Python string interning
                            
                                Multiple Models in a single django ModelForm?
                            
                                Can't create a docker image for COPY failed: stat /var/lib/docker/tmp/docker-builder error
                            
                                How to concatenate two layers in keras?
                            
                                Link to class method in python docstring
                            
                                `if __name__ == '__main__'` equivalent in Ruby
                            
                                Sharing a result queue among several processes
                            
                                Pandas: create two new columns in a dataframe with values calculated from a pre-existing column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Which parameters should be used for early stopping?

Tags:

python

deep-learning

keras

conv-neural-network

AizuddinAzman

People also ask

2 Answers

umutto

cannin

Recent Activity

Donate For Us