In tensorflow 1.4, I found two functions that do batch normalization and they look same: <ol> <li> <code>tf.layers.batch_normalization</code> (link)</li> <li> <code>tf.contrib.layers.batch_norm</code> (link)</li> </ol> Which function should I use? Which one is more stable?

Just to add to the list, there're several more ways to do batch-norm in tensorflow: <ul> <li> <code>tf.nn.batch_normalization</code> is a low-level op. The caller is responsible to handle <code>mean</code> and <code>variance</code> tensors themselves.</li> <li> <code>tf.nn.fused_batch_norm</code> is another low-level op, similar to the previous one. The difference is that it's optimized for 4D input tensors, which is the usual case in convolutional neural networks. <code>tf.nn.batch_normalization</code> accepts tensors of any rank greater than 1.</li> <li> <code>tf.layers.batch_normalization</code> is a high-level wrapper over the previous ops. The biggest difference is that it takes care of creating and managing the running mean and variance tensors, and calls a fast fused op when possible. Usually, this should be the default choice for you.</li> <li> <code>tf.contrib.layers.batch_norm</code> is the early implementation of batch norm, before it's graduated to the core API (i.e., <code>tf.layers</code>). The use of it is not recommended because it may be dropped in the future releases.</li> <li> <code>tf.nn.batch_norm_with_global_normalization</code> is another deprecated op. Currently, delegates the call to <code>tf.nn.batch_normalization</code>, but likely to be dropped in the future.</li> <li>Finally, there's also Keras layer <code>keras.layers.BatchNormalization</code>, which in case of tensorflow backend invokes <code>tf.nn.batch_normalization</code>.</li> </ul>

What is right batch normalization function in Tensorflow?

2 Answers

Just to add to the list, there're several more ways to do batch-norm in tensorflow:

tf.nn.batch_normalization is a low-level op. The caller is responsible to handle mean and variance tensors themselves.
tf.nn.fused_batch_norm is another low-level op, similar to the previous one. The difference is that it's optimized for 4D input tensors, which is the usual case in convolutional neural networks. tf.nn.batch_normalization accepts tensors of any rank greater than 1.
tf.layers.batch_normalization is a high-level wrapper over the previous ops. The biggest difference is that it takes care of creating and managing the running mean and variance tensors, and calls a fast fused op when possible. Usually, this should be the default choice for you.
tf.contrib.layers.batch_norm is the early implementation of batch norm, before it's graduated to the core API (i.e., tf.layers). The use of it is not recommended because it may be dropped in the future releases.
tf.nn.batch_norm_with_global_normalization is another deprecated op. Currently, delegates the call to tf.nn.batch_normalization, but likely to be dropped in the future.
Finally, there's also Keras layer keras.layers.BatchNormalization, which in case of tensorflow backend invokes tf.nn.batch_normalization.

113

answered Sep 28 '22 17:09

Maxim

As show in doc, tf.contrib is a contribution module containing volatile or experimental code. When function is complete, it will be removed from this module. Now there are two, in order to be compatible with the historical version.

So, the former tf.layers.batch_normalization is recommended.

answered Sep 28 '22 18:09

dxf

Related questions
                            
                                Arduino IDE can't find ESP8266WiFi.h file
                            
                                Angular Material button remove autofocus
                            
                                Total distance calculation from LatLng List
                            
                                Unknown compiler options include & exclude
                            
                                How can I write into the browser´s console via Blazor WebAssembly?
                            
                                Cancelling a long running process in VB6.0 without DoEvents?
                            
                                What is the Significance of Pseudo Header used in UDP/TCP
                            
                                Setting ajax url for jQuery in JS file using ASP.NET MVC
                            
                                How I can get the calling methods in C# [duplicate]
                            
                                Set border to table tr, works in everything except IE 6 & 7
                            
                                Deciding between an artificial primary key and a natural key for a Products table
                            
                                Is it wise to use PHP for a daemon?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is right batch normalization function in Tensorflow?

Tags:

KimHee

People also ask

2 Answers

Maxim

dxf

Recent Activity

Donate For Us