I had followed this blog --> https://medium.com/@teyou21/training-your-object-detection-model-on-tensorflow-part-2-e9e12714bdf , and built a SSD Mobilenet model which is pre-trained on the COCO Dataset called "ssd_mobilenet_v2_quantized_coco". What happens here is that it perfectly detects my new classes, but I want to include the pre-trained classes as well. I tried changing the number of classes to 96 ( 90 pre-trained + 6 new ) and edited the "labelmap.pbtxt" with the name and corresponding id of all labels from the COCO Dataset with the new classes being added at the last from ids 91 - 96. It still detects only the new classes only. What should I do to detect both the pre-trained and new classes?

It depends on how you use the pre trained weights: <ol> <li>Use for transfer learning (I think in the link you send this is what they do)</li> <li>Use has a starting point for fitting the entire model.</li> </ol> The first option only trains the detection head and not the backbone of the network - This means that the backbone weights are sherd between your model and the original model. In the second option you train all the network, backbone + detection head- This means that you have two different models If in your case you use the second option then the only way to do what you want is to load both networks and run inference on the image once with the original network and second with your new network. Then you combine your results. If you use the first option then you could do the following: <ol> <li>Train the network on your data and save the new detection head weights.</li> <li>Create a new network that has the same backbone but two detection heads: one with the original weights and the second head with the new weights.</li> </ol> The idea is that because the backbone is the same for both we can use the backbone to extract the features for the image and then feed each detection head with the features. This is a tutorial on how to extract weights from one graph and combine them in a new one (This is for TF1) TensorFlow: saving/restoring and mixing multiple models Here you can read on how to save and restore part of a model - save-and-restore-a-subset-of-variables

How to add additional classes to a pre-trained object detection model and train it to detect all of the classes (pre-trained + new)?

Tags:

python

tensorflow

deep-learning

object-detection

I had followed this blog --> https://medium.com/@teyou21/training-your-object-detection-model-on-tensorflow-part-2-e9e12714bdf , and built a SSD Mobilenet model which is pre-trained on the COCO Dataset called "ssd_mobilenet_v2_quantized_coco".

What happens here is that it perfectly detects my new classes, but I want to include the pre-trained classes as well.

I tried changing the number of classes to 96 ( 90 pre-trained + 6 new ) and edited the "labelmap.pbtxt" with the name and corresponding id of all labels from the COCO Dataset with the new classes being added at the last from ids 91 - 96.

It still detects only the new classes only.

What should I do to detect both the pre-trained and new classes?

590

asked Sep 14 '19 06:09

Aadit Narendar

1 Answers

It depends on how you use the pre trained weights:

Use for transfer learning (I think in the link you send this is what they do)
Use has a starting point for fitting the entire model.

The first option only trains the detection head and not the backbone of the network - This means that the backbone weights are sherd between your model and the original model.

In the second option you train all the network, backbone + detection head- This means that you have two different models

If in your case you use the second option then the only way to do what you want is to load both networks and run inference on the image once with the original network and second with your new network. Then you combine your results.

If you use the first option then you could do the following:

Train the network on your data and save the new detection head weights.
Create a new network that has the same backbone but two detection heads: one with the original weights and the second head with the new weights.

The idea is that because the backbone is the same for both we can use the backbone to extract the features for the image and then feed each detection head with the features.

This is a tutorial on how to extract weights from one graph and combine them in a new one (This is for TF1) TensorFlow: saving/restoring and mixing multiple models

Here you can read on how to save and restore part of a model - save-and-restore-a-subset-of-variables

195

answered Sep 17 '22 15:09

Amitay Nachmani

Related questions
                            
                                How to configure uWSGI in order to debug with pdb (--honour-stdin configuration issue)
                            
                                Truncated Backpropagation in keras with one sequence per batch
                            
                                pdb bypass error/Jump failed: can only jump from a 'line' trace event
                            
                                Anaconda Prompt Stuck/Closing after Keras installation
                            
                                How to specify multiple sys_platforms with Pipenv
                            
                                How to change geolocation of chrome selenium driver in Python?
                            
                                How to send commands from Python turtle graphics to an EV3 Lego brick?
                            
                                Plotly graph component cannot accept viewport units to set text annotation font size
                            
                                Why does increasing precision make this program faster?
                            
                                aiohttp concurrent GET requests lead to ClientConnectorError(8, 'nodename nor servname provided, or not known')
                            
                                Import a library which is in a sibling of the current folder
                            
                                Pandas Interpolate 'time' vs 'linear'
                            
                                Text classification beyond the keyword dependency and inferring the actual meaning
                            
                                Keras: Custom layer without inputs
                            
                                GAE/P: Transaction safety with API calls
                            
                                How to use the new Int64 pandas object when saving to a parquet file
                            
                                Adding clippath information to an image
                            
                                Can't upgrade JupyterLab to latest version
                            
                                How to document a flask-restplus response with list of strings
                            
                                How to access Type Hints inside of a method after it's called in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With