Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Does changing a token name in an image caption model affect performance?

If I train an image caption model then stop to rename a few tokens:

  1. Should I train the model from scratch?
  2. Or can I reload the model and continue training from the last epoch with the updated vocabulary?

Will either approach effect model accuracy/performance differently?

like image 984
Paul Gwamanda Avatar asked Jun 02 '20 13:06

Paul Gwamanda


1 Answers

I would go for option 2.

When training the model from scratch, you are initializing the model's weights randomly and then you fit them based on your problem. However, if, instead of using random weights, you use weights that have already been trained for a similar problem, you may decrease the convergence time. This option is kind similar to the idea of transfer learning.

like image 90
Pedrolarben Avatar answered Oct 10 '22 16:10

Pedrolarben