Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How long will it take to train the VGG-16 model on IMAGENET using GOOGLE COLAB TPU?

Just curious, on how long will it take to train the VGG16 model on IMAGENET using GOOGLE COLAB TPU? If someone can explain me the calcuations they did to get to the answer, that would be great!

like image 494
Umair Javaid Avatar asked Oct 27 '19 19:10

Umair Javaid


People also ask

What is the accuracy of VGG16?

VGG16 is object detection and classification algorithm which is able to classify 1000 images of 1000 different categories with 92.7% accuracy. It is one of the popular algorithms for image classification and is easy to use with transfer learning.

How many classes are there in VGG16?

This model achieves 92.7% top-5 test accuracy on the ImageNet dataset which contains 14 million images belonging to 1000 classes.


2 Answers

It's very hard to accurately estimate the how long it'll take to train a model e2e. But assuming you're just looking for a very rough estimate, we can start off by noting that this ResNet50 implementation we have (code) runs to convergence (76%+ top1 accuracy trained on 90 epochs) in roughly 7.3 hours on a v2-8 TPU device. Given that VGG16 are close enough in step time (https://github.com/jcjohnson/cnn-benchmarks#cnn-benchmarks) I'd expect convergence for it to also be proportional to that. However, disclaimer that this is a very rough estimation and actual performance would also depend on how optimized the implementation is.

like image 185
jysohn Avatar answered Oct 16 '22 18:10

jysohn


Here is the official TPU example. Training VGG-16 on optimized tfrecord dataset with 2990 train images, IMAGE_SIZE = [331, 331], batch_size=128, 12 epochs takes 2m15sec. I think training with 1,281,167 ImageNet images will takes approximately 15 hours.

like image 3
balezz Avatar answered Oct 16 '22 19:10

balezz