How to choose which pre-trained weights to use for my model?

Tags:

I am a beginner, and I am very confused about how we can choose a pre-trained model that will improve my model.

I am trying to create a cat breed classifier using pre-trained weights of a model, lets say VGG16 trained on digits dataset, will that improve the performance of the model? or if I train my model just on the database without using any other weights will be better, or will both be the same as those pre-trained weights will be just a starting point.

Also if I use weights of the VGG16 trained for cat vs dog data as a starting point of my cat breed classification model will that help me in improving the model?

474

asked Aug 06 '19 05:08

hR 312

2 Answers

Sane weight initialization

The pre-trained weights to choose depends upon the type of classes you wish to classify. Since, you wish to classify Cat Breeds, use pre-trained weights from a classifier that is trained on similar task. As mentioned by the above answers the initial layers learn things like edges, horizontal or vertical lines, blobs, etc. As you go deeper, the model starts learning problem specific features. So for generic tasks you can use say imagenet & then fine-tune it for the problem at hand.

However, having a pre-trained model which closely resembles your training data helps immensely. A while ago, I had participated in Scene Classification Challenge where we initialized our model with the ResNet50 weights trained on Places365 dataset. Since, the classes in the above challenge were all present in the Places365 dataset, we used the weights available here and fine-tuned our model. This gave us a great boost in our accuracy & we ended up at top positions on the leaderboard. You can find some more details about it in this blog

Also, understand that the one of the advantages of transfer learning is saving computations. Using a model with randomly initialized weights is like training a neural net from scratch. If you use VGG16 weights trained on digits dataset, then it might have already learned something, so it will definitely save some training time. If you train a model from scratch then it will eventually learn all the patterns which using a pre-trained digits classifier weights would have learnt.

On the other hand using weights from a Dog-vs-Cat classifier should give you better performance as it already has learned features to detect say paws, ears, nose or whiskers.

138

answered Sep 19 '22 18:09

Aditya Mishra

Could you provide more information, what do you want to classify exactly? I see you wish to classify images, which type of images (containing what?) and in which classes?

As a general remark : If you use a trained model, it must fit your need, of course. Keep in mind that a model which was trained on a given dataset, learned only the information contained in that dataset and can classify / indentify information analogous to the one in the training dataset.

If you want to classify an image containing an animal with a Y/N (binary) classifier, (cat or not cat) you should use a model trained on different animals, cats among them.
If you want to classify an image of a cat into classes corresponding to cat races, let's say, you should use a model trained only on cats images.

I should say you should use a pipeline, containing steps 1. followed by 2.

answered Sep 22 '22 18:09

Catalina Chircu

Related questions
                            
                                UserWarning: Discrepancy between trainable weights and collected trainable weights error
                            
                                How does upsampling in Fully Connected Convolutional network work?
                            
                                Euclidean Loss Layer in Caffe
                            
                                Keras VGG16 fine tuning
                            
                                Difference between Keras' BatchNormalization and PyTorch's BatchNorm2d?
                            
                                Receptive Fields on ConvNets (Receptive Field size confusion)
                            
                                How do you compute accuracy in a regression model, after rounding predictions to classes, in keras?
                            
                                Keras embedding layer masking. Why does input_dim need to be |vocabulary| + 2?
                            
                                Batch normalization during testing
                            
                                How can I use a custom data model with Deeplearning4j?
                            
                                What is the backward process of max operation in deep learning?
                            
                                ValueError: Unknown activation function: my_custom_activation_function
                            
                                what does C-contiguous fashion mean in caffe blob storage?
                            
                                scheduled sampling in Tensorflow
                            
                                Number of parameters for Keras SimpleRNN
                            
                                TypeError: write() argument must be str, not bytes while saving .npy file
                            
                                Keras: ValueError: decode_predictions expects a batch of predictions
                            
                                Keras EarlyStopping patience parameter
                            
                                How to set class_weight in keras package of R?
                            
                                Yolo v1 bounding boxes during training step

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to choose which pre-trained weights to use for my model?

Tags:

classification

deep-learning

pre-trained-model

transfer-learning