is there a way (code scripts) for merging tensorflow batchnorm and dropout layer into convolution layer in inference for faster computation? I have searched a while, but not got relevant answers.

To the best of my knowledge, there is no built-in feature in TensorFlow for folding batch normalization. That being said, it's not that hard to do it manually. One note, there is no such thing as folding dropout as dropout is simply deactivated at inference time. To fold batch normalization there is basically three steps: <ol> <li>Given a TensorFlow graph, filter the variables that need folding,</li> <li>Fold the variables,</li> <li>Create a new graph with the folded variables.</li> </ol> We need to filter the variables that require folding. When using batch normalization, it creates variables with names containing <code>moving_mean</code> and <code>moving_variance</code>. You can use this to extract fairly easily the variables from layers that used batch norm. Now that you know which layers used batch norm, for every such layer, you can extract its weights <code>W</code>, bias <code>b</code>, batch norm variance <code>v</code>, mean <code>m</code>, <code>gamma</code> and <code>beta</code> parameters. You need to create a new variable to store the folded weights and biases as follow: <pre class="prettyprint lang-py prettyprint-override"><code>W_new = gamma * W / var b_new = gamma * (b - mean) / var + beta </code></pre> The last step consists in creating a new graph in which we deactivate batch norm and add <code>bias</code> variables if necessary –which should be the case for every foldable layer since using bias with batch norm is pointless. The whole code should look something like below. Depending on the parameters used for the batch norm, your graph may not have <code>gamma</code> or <code>beta</code>. <pre class="prettyprint lang-py prettyprint-override"><code># ****** (1) Get variables ****** variables = {v.name: session.run(v) for v in tf.global_variables()} # ****** (2) Fold variables ****** folded_variables = {} for v in variables.keys(): if not v.endswith('moving_variance:0'): continue n = get_layer_name(v) # 'model/conv1/moving_variance:0' --> 'model/conv1' W = variable[n + '/weights:0'] # or "/kernel:0", etc. b = variable[n + '/bias:0'] # if a bias existed before gamma = variable[n + '/gamma:0'] beta = variable[n + '/beta:0'] m = variable[n + '/moving_mean:0'] var = variable[n + '/moving_variance:0'] # folding batch norm W_new = gamma * W / var b_new = gamma * (b - mean) / var + beta # remove `b` if no bias folded_variables[n + '/weights:0'] = W_new folded_variables[n + '/bias:0'] = b_new # ****** (3) Create new graph ****** new_graph = tf.Graph() new_session = tf.Session(graph=new_graph) network = ... # instance batch-norm free graph with bias added. # Careful, the names should match the original model for v in tf.global_variables(): try: new_session.run(v.assign(folded_variables[v.name])) except: new_session.run(v.assign(variables[v.name])) </code></pre>

There is a tool provided by tensorflow that optimizes your trained frozen graph for inference: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/tools/graph_transforms/README.md#fold_batch_norms <ol> <li>download tensorflow source. </li> <li> build graph transform tool. <pre class="prettyprint"><code>bazel build tensorflow/tools/graph_transforms:transform_graph </code></pre> </li> <li>freeze your graph. e.g., https://blog.metaflow.fr/tensorflow-how-to-freeze-a-model-and-serve-it-with-a-python-api-d4f3596b3adc</li> <li> run this: <pre class="prettyprint"><code>bazel-bin/tensorflow/tools/graph_transforms/transform_graph \ --in_graph=tensorflow_inception_graph.pb \ --out_graph=optimized_inception_graph.pb \ --inputs='Mul' \ --outputs='softmax' \ --transforms=' strip_unused_nodes(type=float, shape="1,299,299,3") remove_nodes(op=Identity, op=CheckNumerics) fold_constants(ignore_errors=true) fold_batch_norms fold_old_batch_norms' </code></pre> </li> </ol>

tensorflow how to merge batchnorm into convolution for faster inference

2 Answers

To the best of my knowledge, there is no built-in feature in TensorFlow for folding batch normalization. That being said, it's not that hard to do it manually. One note, there is no such thing as folding dropout as dropout is simply deactivated at inference time.

To fold batch normalization there is basically three steps:

Given a TensorFlow graph, filter the variables that need folding,
Fold the variables,
Create a new graph with the folded variables.

We need to filter the variables that require folding. When using batch normalization, it creates variables with names containing moving_mean and moving_variance. You can use this to extract fairly easily the variables from layers that used batch norm.

Now that you know which layers used batch norm, for every such layer, you can extract its weights W, bias b, batch norm variance v, mean m, gamma and beta parameters. You need to create a new variable to store the folded weights and biases as follow:

W_new = gamma * W / var
b_new = gamma * (b - mean) / var + beta

The last step consists in creating a new graph in which we deactivate batch norm and add bias variables if necessary –which should be the case for every foldable layer since using bias with batch norm is pointless.

The whole code should look something like below. Depending on the parameters used for the batch norm, your graph may not have gamma or beta.

# ****** (1) Get variables ******
variables = {v.name: session.run(v) for v in tf.global_variables()}

# ****** (2) Fold variables ******
folded_variables = {}
for v in variables.keys():
    if not v.endswith('moving_variance:0'):
        continue

    n = get_layer_name(v) # 'model/conv1/moving_variance:0' --> 'model/conv1'

    W = variable[n + '/weights:0'] # or "/kernel:0", etc.
    b = variable[n + '/bias:0'] # if a bias existed before
    gamma = variable[n + '/gamma:0']
    beta = variable[n + '/beta:0']
    m = variable[n + '/moving_mean:0']
    var = variable[n + '/moving_variance:0']

    # folding batch norm
    W_new = gamma * W / var
    b_new = gamma * (b - mean) / var + beta # remove `b` if no bias
    folded_variables[n + '/weights:0'] = W_new        
    folded_variables[n + '/bias:0'] = b_new   

    # ****** (3) Create new graph ******
    new_graph = tf.Graph()
    new_session = tf.Session(graph=new_graph) 
    network = ... # instance batch-norm free graph with bias added.
                  # Careful, the names should match the original model

    for v in tf.global_variables():
        try:
            new_session.run(v.assign(folded_variables[v.name]))
        except:
            new_session.run(v.assign(variables[v.name]))

answered Oct 19 '22 01:10

BiBi

There is a tool provided by tensorflow that optimizes your trained frozen graph for inference: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/tools/graph_transforms/README.md#fold_batch_norms

download tensorflow source.

build graph transform tool.

bazel build tensorflow/tools/graph_transforms:transform_graph

freeze your graph. e.g., https://blog.metaflow.fr/tensorflow-how-to-freeze-a-model-and-serve-it-with-a-python-api-d4f3596b3adc

run this:

bazel-bin/tensorflow/tools/graph_transforms/transform_graph \
--in_graph=tensorflow_inception_graph.pb \
--out_graph=optimized_inception_graph.pb \
--inputs='Mul' \
--outputs='softmax' \
--transforms='
  strip_unused_nodes(type=float, shape="1,299,299,3")
  remove_nodes(op=Identity, op=CheckNumerics)
  fold_constants(ignore_errors=true)
  fold_batch_norms
  fold_old_batch_norms'

answered Oct 19 '22 01:10

Zejia Zheng

Related questions
                            
                                Building SVM with tensorflow's LinearClassifier and Panda's Dataframes
                            
                                Check whether Tensorflow is running on GPU
                            
                                What is Tensorflow equivalent of pytorch's conv1d?
                            
                                AttributeError: module 'tensorflow_core.compat.v1' has no attribute 'contrib'
                            
                                Batch normalization when batch size=1
                            
                                Tensorflow Model Subclassing Mutli-Input
                            
                                WARNING:tensorflow with constraint is deprecated and will be removed in a future version
                            
                                Why do I get CUDA out of memory when running PyTorch model [with enough GPU memory]?
                            
                                tf.keras model.predict results in memory leak
                            
                                Getting the output shape of deconvolution layer using tf.nn.conv2d_transpose in tensorflow
                            
                                Tensorflow freeze_graph script failing on model defined with Keras
                            
                                How to get the currently active tf.variable_scope in TensorFlow?
                            
                                Image similarity detection with TensorFlow
                            
                                Tensorflow.strided_slice missing argument 'strides'?
                            
                                Is it possible to export python and its necessary libraries into a environment independent file?
                            
                                Tensorflow: Linear regression with non-negative constraints
                            
                                tensorflow map_fn TensorArray has inconsistent shapes
                            
                                Printing class name and score in Tensorflow Object Detection API
                            
                                get intermediate output from Keras/Tensorflow during prediction
                            
                                Can the sigmoid activation function be used to solve regression problems in Keras?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

tensorflow how to merge batchnorm into convolution for faster inference

Tags:

tensorflow

K.Wanter

People also ask

2 Answers

BiBi

Zejia Zheng

Recent Activity

Donate For Us