I'm trying to use a pretrained VGG 16 from keras. But I'm really unsure about what the input range should be. Quick answer, which of these color orders? <ul> <li>RGB </li> <li>BGR</li> </ul> And which range? <ul> <li>0 to 255?</li> <li>balanced from about -125 to about +130?</li> <li>0 to 1?</li> <li>-1 to 1?</li> </ul> I notice the file where the model is defined imports an input preprocessor: <pre class="prettyprint"><code>from .imagenet_utils import preprocess_input </code></pre> But this preprocessor is never used in the rest of the file. Also, when I check the code for this preprocessor, it has two modes: <code>caffe</code> and <code>tf</code> (tensorflow). Each mode works differently. Finally, I can't find consistent documentation on the internet. So, what is the best range for working? To what range are the model weights trained?

The model weights were ported from caffe, so it's in BGR format. <blockquote> Caffe uses a BGR color channel scheme for reading image files. This is due to the underlying OpenCV implementation of imread. The assumption of RGB is a common mistake. </blockquote> You can find the original caffe model weight files on VGG website. This link can also be found on Keras documentation. I think the second range would be the closest one. There's no scaling during training, but the authors have subtracted the mean value of the ILSVRC2014 training set. As stated in the original VGG paper, section 2.1: <blockquote> The only preprocessing we do is subtracting the mean RGB value, computed on the training set, from each pixel. </blockquote> This sentence is actually what <code>imagenet_utils.preprocess_input(mode='caffe')</code> does. <ol> <li>Convert from RGB to BGR: because <code>keras.preprocessing.image.load_img()</code> loads images in RGB format, this conversion is required for VGG16 (and all models ported from caffe).</li> <li>Subtract the mean BGR values: <code>(103.939, 116.779, 123.68)</code> is subtracted from the image array.</li> </ol> The preprocessor is not used in <code>vgg16.py</code>. It's imported in the file so that users can use the preprocess function by calling <code>keras.applications.vgg16.preprocess_input(rgb_img_array)</code>, without caring about where model weights come from. The argument for <code>preprocess_input()</code> is always an image array in RGB format. If the model was trained with caffe, <code>preprocess_input()</code> will convert the array into BGR format. Note that the function <code>preprocess_input()</code> is not intended to be called from <code>imagenet_utils</code> module. If you are using VGG16, call <code>keras.applications.vgg16.preprocess_input()</code> and the images will be converted to a suitable format and range that VGG16 was trained on. Similarly, if you are using Inception V3, call <code>keras.applications.inception_v3.preprocess_input()</code> and the images will be converted to the range that Inception V3 was trained on.

What is the expected input range for working with Keras VGG models?

Tags:

python

image-processing

tensorflow

keras

vgg-net

I'm trying to use a pretrained VGG 16 from keras. But I'm really unsure about what the input range should be.

Quick answer, which of these color orders?

And which range?

0 to 255?
balanced from about -125 to about +130?
0 to 1?
-1 to 1?

I notice the file where the model is defined imports an input preprocessor:

from .imagenet_utils import preprocess_input

But this preprocessor is never used in the rest of the file.

Also, when I check the code for this preprocessor, it has two modes: caffe and tf (tensorflow).

Each mode works differently.

Finally, I can't find consistent documentation on the internet.

So, what is the best range for working? To what range are the model weights trained?

832

asked Oct 07 '17 16:10

Daniel Möller

1 Answers

The model weights were ported from caffe, so it's in BGR format.

Caffe uses a BGR color channel scheme for reading image files. This is due to the underlying OpenCV implementation of imread. The assumption of RGB is a common mistake.

You can find the original caffe model weight files on VGG website. This link can also be found on Keras documentation.

I think the second range would be the closest one. There's no scaling during training, but the authors have subtracted the mean value of the ILSVRC2014 training set. As stated in the original VGG paper, section 2.1:

The only preprocessing we do is subtracting the mean RGB value, computed on the training set, from each pixel.

This sentence is actually what imagenet_utils.preprocess_input(mode='caffe') does.

Convert from RGB to BGR: because keras.preprocessing.image.load_img() loads images in RGB format, this conversion is required for VGG16 (and all models ported from caffe).
Subtract the mean BGR values: (103.939, 116.779, 123.68) is subtracted from the image array.

The preprocessor is not used in vgg16.py. It's imported in the file so that users can use the preprocess function by calling keras.applications.vgg16.preprocess_input(rgb_img_array), without caring about where model weights come from. The argument for preprocess_input() is always an image array in RGB format. If the model was trained with caffe, preprocess_input() will convert the array into BGR format.

Note that the function preprocess_input() is not intended to be called from imagenet_utils module. If you are using VGG16, call keras.applications.vgg16.preprocess_input() and the images will be converted to a suitable format and range that VGG16 was trained on. Similarly, if you are using Inception V3, call keras.applications.inception_v3.preprocess_input() and the images will be converted to the range that Inception V3 was trained on.

124

answered Sep 28 '22 06:09

Yu-Yang

Related questions
                            
                                Limit the length of a Python list
                            
                                How to fix "plural forms could be dangerous" django error?
                            
                                How to convert string back to list using Pandas
                            
                                Python - Turn a file content into a binary array
                            
                                How to dynamically generate marshmallow schemas for SQLAlchemy models
                            
                                MyPy: what is the type of a requests object?
                            
                                Pandas: assigning columns with multiple conditions and date thresholds
                            
                                Access Flask session with ReactJS
                            
                                Matplotlib - add titles to the legend rows
                            
                                Pandas: How to store cProfile output in a pandas DataFrame?
                            
                                python regex: duplicate names in named groups
                            
                                Scipy sparse matrix exponentiation: a**16 is slower than a*a*a*a*a*a*a*a*a*a*a*a*a*a*a*a?
                            
                                What is the best way to perform an anti-transpose in python?
                            
                                Tensorflow - Using tf.summary with 1.2 Estimator API
                            
                                Python and Selenium - get text excluding child node's text
                            
                                Python requests CA certificates as a string
                            
                                Importing with dot notation
                            
                                Use python's pty to create a live console
                            
                                Is it possible to force pandas not to convert data type when using DataFrame.replace
                            
                                Combine dictionary of dataframes into 1 single dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With