Quantization aware training in TensorFlow version 2 and BatchNorm folding

Tags:

I'm wondering what the current available options are for simulating BatchNorm folding during quantization aware training in Tensorflow 2. Tensorflow 1 has the tf.contrib.quantize.create_training_graph function which inserts FakeQuantization layers into the graph and takes care of simulating batch normalization folding (according to this white paper).

Tensorflow 2 has a tutorial on how to use quantization in their recently adopted tf.keras API, but they don't mention anything about batch normalization. I tried the following simple example with a BatchNorm layer:

import tensorflow_model_optimization as tfmo

model = tf.keras.Sequential([
      l.Conv2D(32, 5, padding='same', activation='relu', input_shape=input_shape),
      l.MaxPooling2D((2, 2), (2, 2), padding='same'),
      l.Conv2D(64, 5, padding='same', activation='relu'),
      l.BatchNormalization(),    # BN!
      l.MaxPooling2D((2, 2), (2, 2), padding='same'),
      l.Flatten(),
      l.Dense(1024, activation='relu'),
      l.Dropout(0.4),
      l.Dense(num_classes),
      l.Softmax(),
])
model = tfmo.quantization.keras.quantize_model(model)

It however gives the following exception:

RuntimeError: Layer batch_normalization:<class 'tensorflow.python.keras.layers.normalization.BatchNormalization'> is not supported. You can quantize this layer by passing a `tfmot.quantization.keras.QuantizeConfig` instance to the `quantize_annotate_layer` API.

which indicates that TF does not know what to do with it.

I also saw this related topic where they apply tf.contrib.quantize.create_training_graph on a keras constructed model. They however don't use BatchNorm layers, so I'm not sure this will work.

So what are the options for using this BatchNorm folding feature in TF2? Can this be done from the keras API, or should I switch back to the TensorFlow 1 API and define a graph the old way?

924

asked Mar 27 '20 10:03

MaartenVds

1 Answers

If you add BatchNormalization before activation, you would not have issues with Quantization. Note: Quantization is supported in BatchNormalization only if it the layer is exactly after Conv2D layer. https://www.tensorflow.org/model_optimization/guide/quantization/training

# Change
l.Conv2D(64, 5, padding='same', activation='relu'),
l.BatchNormalization(),    # BN!
# with this
l.Conv2D(64, 5, padding='same'),
l.BatchNormalization(),
l.Activation('relu'),

#Other way of declaring the same
o = (Conv2D(512, (3, 3), padding='valid' , data_format=IMAGE_ORDERING))(o)
o = (BatchNormalization())(o)
o = Activation('relu')(o)

answered Sep 21 '22 16:09

Mohit Arvind khakharia

Related questions
                            
                                Does the django_address module provide a way to seed the initial country data?
                            
                                How to generate asgi.py for existent project?
                            
                                How do I correctly use mock call_args with Python's unittest.mock?
                            
                                Flask endpoint vs Sagemaker endpoint
                            
                                which python vs PYTHONPATH
                            
                                Do I need to split the data for isolation forest?
                            
                                Is it true that in multiprocessing, each process gets it's own GIL in CPython? How different is that from creating new runtimes?
                            
                                Django & mypy: ValuesQuerySet type hint
                            
                                How to process huge datasets in kedro
                            
                                Pandas - Generate Unique ID based on row values
                            
                                sklearn utils compute_class_weight function for large dataset
                            
                                Automatically determine header row when reading csv in pandas
                            
                                Pandas: Select all data from Pandas DataFrame between two dates
                            
                                Diminishing the impact of one variable over output in a regression model
                            
                                Why does Python3 run faster if it is negating vs XOR?
                            
                                Detect circles in openCV
                            
                                Tensorboard for custom training loop in Tensorflow 2
                            
                                Difficulty in GAN training
                            
                                How to count the presence of a set of numbers in a set of intervals efficiently
                            
                                Django+Postgres FATAL: sorry, too many clients already

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Quantization aware training in TensorFlow version 2 and BatchNorm folding

Tags:

python

tensorflow

tensorflow2.0

batch-normalization

quantization-aware-training

MaartenVds

People also ask

1 Answers

Mohit Arvind khakharia

Recent Activity

Donate For Us