What is the right way to preprocess the data in Keras while fine-tuning the pre-trained models in keras.applications for our own data? Keras provides the following <code>preprocess_input</code> functions keras.applications.imagenet_utils.preprocess_input keras.applications.inception_v3.preprocess_input keras.applications.xception.preprocess_input keras.applications.inception_resnet_v2.preprocess_input Looking inside it seems like for inception_v3, xception, and inception_resnet_v2, it calls keras.applications.imagenet_utils.preprocess_input with <code>mode='tf'</code>. While for other models it sets <code>mode='caffe'</code> each of which perform a different transformation. In the blog post about transfer learning from Francois chollet -- https://blog.keras.io/building-powerful-image-classification-models-using-very-little-data.html -- it is normalized to <code>[0, 1]</code> through a division with 255. Shouldn't the preprocess_input functions in Keras be used instead? Also it is not clear whether the input images should be in RGB or BGR? Is there any consistency regarding this or is it specific to the pre-trained model being used?

Always use the <code>preprocess_input</code> function in the corresponding model-level module. That is, use <code>keras.applications.inception_v3.preprocess_input</code> for <code>InceptionV3</code> and <code>keras.applications.resnet50.preprocess_input</code> for <code>ResNet50</code>. The <code>mode</code> argument specifies the preprocessing method used when training the original model. <code>mode='tf'</code> means that the pre-trained weights are converted from TF, where the authors trained model with <code>[-1, 1]</code> input range. So are <code>mode='caffe'</code> and <code>mode='torch'</code>. The input to <code>applications.*.preprocess_input</code> is always RGB. If a model expects BGR input, the channels will be permuted inside <code>preprocess_input</code>. The blog post you've mentioned was posted before the <code>keras.applications</code> module was introduced. I wouldn't recommend using it as a reference for transfer learning with <code>keras.applications</code>. Maybe it'll be better to try the examples in the docs instead.

What is the right way to preprocess images in Keras while fine-tuning pre-trained models

Tags:

python

machine-learning

deep-learning

keras

What is the right way to preprocess the data in Keras while fine-tuning the pre-trained models in keras.applications for our own data?

Keras provides the following preprocess_input functions

keras.applications.imagenet_utils.preprocess_input

keras.applications.inception_v3.preprocess_input

keras.applications.xception.preprocess_input

keras.applications.inception_resnet_v2.preprocess_input

Looking inside it seems like for inception_v3, xception, and inception_resnet_v2, it calls keras.applications.imagenet_utils.preprocess_input with mode='tf'. While for other models it sets mode='caffe' each of which perform a different transformation.

In the blog post about transfer learning from Francois chollet -- https://blog.keras.io/building-powerful-image-classification-models-using-very-little-data.html -- it is normalized to [0, 1] through a division with 255. Shouldn't the preprocess_input functions in Keras be used instead?

Also it is not clear whether the input images should be in RGB or BGR? Is there any consistency regarding this or is it specific to the pre-trained model being used?

852

asked Feb 08 '18 03:02

cdeepakroy

1 Answers

Always use the preprocess_input function in the corresponding model-level module. That is, use keras.applications.inception_v3.preprocess_input for InceptionV3 and keras.applications.resnet50.preprocess_input for ResNet50.

The mode argument specifies the preprocessing method used when training the original model. mode='tf' means that the pre-trained weights are converted from TF, where the authors trained model with [-1, 1] input range. So are mode='caffe' and mode='torch'.

The input to applications.*.preprocess_input is always RGB. If a model expects BGR input, the channels will be permuted inside preprocess_input.

The blog post you've mentioned was posted before the keras.applications module was introduced. I wouldn't recommend using it as a reference for transfer learning with keras.applications. Maybe it'll be better to try the examples in the docs instead.

answered Oct 09 '22 05:10

Yu-Yang

Related questions
                            
                                Sort dict by value and return dict, not list of tuples [duplicate]
                            
                                Python C-API functions that borrow and steal references
                            
                                Python class that extends int doesn't entirely behave like an int
                            
                                What NLP tools to use to match phrases having similar meaning or semantics
                            
                                Django ManagementForm data is missing or has been tampered with
                            
                                Does this prime function actually work?
                            
                                Python: Howto launch a full process not a child process and retrieve the PID
                            
                                Dealing with piecewise equations returned by sympy integrate
                            
                                SqlAlchemy: Convert inherited type from one to another
                            
                                When to use Threadpool in Gevent
                            
                                Concurrent db table indexing through alembic script
                            
                                Send a multidimensional numpy array over a socket
                            
                                Eigenvectors computed with numpy's eigh and svd do not match
                            
                                Extract document-topic matrix from Pyspark LDA Model
                            
                                Pandas : TypeError: float() argument must be a string or a number
                            
                                Efficient pandas rolling aggregation over date range by group - Python 2.7 Windows - Pandas 0.19.2
                            
                                Package a python app like spyder does
                            
                                Why isn't it possible to use "await" in f-strings?
                            
                                Yield Request call produce weird result in recursive method with scrapy
                            
                                Are there some pre-trained LSTM, RNN or ANN models for time-series prediction?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With