I would like to train a Mobilenet SSD Model on a custom dataset. I have looked into the workflow of retraining a model and noticed the image_resizer{} block in the config file: https://github.com/tensorflow/models/blob/d6d0868209833e014074d6cb4f32558e7acf2a6d/research/object_detection/samples/configs/ssd_mobilenet_v1_pets.config#L43 Does the aspect ratio here have to be 1:1 like 300x300 or can I specify a custom ratio? All my dataset images are 960x256 - so could I just input this size for height and width? Or do I need to resize all the images to have an aspect ratio of 1:1?

Choose the height and width, in the model file (as per your link), to be the shape of the input image at which you want your model to train and operate. The model will resize input images to the specified size, if it has to. So this could be the size of your input images (if your hardware can train and operate a model at that size): <pre class="prettyprint"><code>image_resizer { fixed_shape_resizer { height: 256 width: 960 } } </code></pre> The choice will depend on the size of the training images and the resources required to train (and use) that size of model. I typically use 512x288 as this size model runs happily on a Raspberry Pi. I prepare training images, at a variety of scales, at exactly this size. So the image resizer does no work during training. For inference, I input images at 1920x1080, so the image resizer scales them to 512x288 before they pass into the Mobilenet, maintaining the aspect ratio. However, the aspect ratio is not important in my domain since such distortions occur naturally. So yes, just use your training image dimensions.

Mobilenet SSD Input Image Size

1 Answers

Choose the height and width, in the model file (as per your link), to be the shape of the input image at which you want your model to train and operate. The model will resize input images to the specified size, if it has to.

So this could be the size of your input images (if your hardware can train and operate a model at that size):

image_resizer {
    fixed_shape_resizer {
        height: 256
        width: 960
    }
}

The choice will depend on the size of the training images and the resources required to train (and use) that size of model.

I typically use 512x288 as this size model runs happily on a Raspberry Pi. I prepare training images, at a variety of scales, at exactly this size. So the image resizer does no work during training.

For inference, I input images at 1920x1080, so the image resizer scales them to 512x288 before they pass into the Mobilenet, maintaining the aspect ratio.

However, the aspect ratio is not important in my domain since such distortions occur naturally.

So yes, just use your training image dimensions.

146

answered Oct 21 '22 22:10

Alaric Dobson

Related questions
                            
                                Tensorflow: Saving/Restoring session, checkpoint, metagraph
                            
                                How does `MonitoredTrainingSession()` work with "restore" and "testing mode"?
                            
                                How to save a trained model (Estimator) and Load it back to test it with data in Tensorflow?
                            
                                Failed to load the native TensorFlow runtime : error while importing tensorflow
                            
                                Implementing a multi-input model in Keras, each with a different sample sizes each (different batch sizes each)
                            
                                How can I identify the images with 'Possibly corrupt EXIF data'
                            
                                cv2.remap or scipy.interpolate.map_coordinates equivalent/implementation in Tensorflow?
                            
                                How to get loss function history using tf.contrib.opt.ScipyOptimizerInterface
                            
                                How to make the weights of an RNN cell untrainable in Tensorflow?
                            
                                Do I need to install keras 2.0 seprately after installing tensorflow 1.3?
                            
                                What does BasicLSTMCell do?
                            
                                Tensorflow compilation on Odroid XU4
                            
                                tensorflow neural network multi layer perceptron for regression example
                            
                                Slicing-based assignment in Keras / Tensorflow?
                            
                                Tensorflow- ImportError: libcublas.so.8.0: cannot open shared object file: No such file or directory
                            
                                ValueError at /image/ Tensor Tensor("activation_5/Softmax:0", shape=(?, 4), dtype=float32) is not an element of this graph
                            
                                You must feed a value for placeholder tensor 'Placeholder' with dtype float and shape [?,784] for MNIST dataset
                            
                                Get the loss that a given optimizer is minimizing in Tensorflow
                            
                                Run a Tensorflow model without having Tensorflow installed
                            
                                Converting google-cloud-ml github Reddit example from regression to classification and adding keys?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mobilenet SSD Input Image Size

Tags:

tensorflow

object-detection

object-detection-api

Tesla.

People also ask

1 Answers

Alaric Dobson

Recent Activity

Donate For Us