What does initial_epoch in Keras mean?

Tags:

I'm a little bit confused about initial_epoch value in fit and fit_generator methods. Here is the doc:

initial_epoch: Integer. Epoch at which to start training (useful for resuming a previous training run).

I understand, it is not useful if you start training from scratch. It is useful if you trained your dataset and want to improve accuracy or other values (correct me if I'm wrong). But I'm not sure what it really does.

So after all this, I have 2 questions:

What does initial_epoch do and what is it for?
When can I use initial_epoch?

When I change my dataset?
When I change the learning rate, optimizer or loss function?
Both of them?

409

asked Sep 24 '18 09:09

ibrahimozgon

1 Answers

Since in some of the optimizers, some of their internal values (e.g. learning rate) are set using the current epoch value, or even you may have (custom) callbacks that depend on the current value of epoch, the initial_epoch argument let you specify the initial value of epoch to start from when training.

As stated in the documentation, this is mostly useful when you have trained your model for some epochs, say 10, and then saved it and now you want to load it and resume the training for another 10 epochs without disrupting the state of epoch-dependent objects (e.g. optimizer). So you would set initial_epoch=10 (i.e. we have trained the model for 10 epochs) and epochs=20 (not 10, since the total number of epochs to reach is 20) and then everything resume as if you were initially trained the model for 20 epochs in one single training session.

However, note that when using built-in optimizers of Keras you don't need to use initial_epoch, since they store and update their state internally (without considering the value of current epoch) and also when saving a model the state of the optimizer will be stored as well.

119

answered Nov 23 '22 04:11

today

Related questions
                            
                                ReactJS: Material ui breakpoints
                            
                                How to specify ListTile height in Flutter
                            
                                How To Run kubectl apply commands in terraform
                            
                                Android databinding errors while building
                            
                                AutocompleteFragment results return a place with null attributes
                            
                                Remove object from array based on array of some property of that object
                            
                                travis: sh: 0: Can't open /etc/init.d/xvfb
                            
                                How to install multiple yarn packages using `add`?
                            
                                C++ forcing function parameter evaluation order
                            
                                Creating a 16x16 grid using JavaScript
                            
                                ERROR: Could not find method viewBinding() for arguments
                            
                                In Blazor how to call a function at Page Load (event name)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With