Is the batchnorm momentum convention (default=0.1) correct as in other libraries e.g. Tensorflow it seems to usually be 0.9 or 0.99 by default? Or maybe we are just using a different convention?

It seems that the parametrization convention is different in pytorch than in tensorflow, so that 0.1 in pytorch is equivalent to 0.9 in tensorflow. To be more precise: In Tensorflow: <pre class="prettyprint"><code>running_mean = decay*running_mean + (1-decay)*new_value </code></pre> In PyTorch: <pre class="prettyprint"><code>running_mean = (1-decay)*running_mean + decay*new_value </code></pre> This means that a value of <code>decay</code> in PyTorch is equivalent to a value of <code>(1-decay)</code> in Tensorflow.

BatchNorm momentum convention PyTorch

1 Answers

It seems that the parametrization convention is different in pytorch than in tensorflow, so that 0.1 in pytorch is equivalent to 0.9 in tensorflow.

To be more precise:

In Tensorflow:

running_mean = decay*running_mean + (1-decay)*new_value

In PyTorch:

running_mean = (1-decay)*running_mean + decay*new_value

This means that a value of decay in PyTorch is equivalent to a value of (1-decay) in Tensorflow.

151

answered Oct 01 '22 07:10

patapouf_ai

Related questions
                            
                                How to use `style` in conjunction with the `to_html` classes on a DataFrame?
                            
                                Is it possible to replace placeholder with a constant in an existing graph?
                            
                                Why I can't use python-cjson with Python 3.x?
                            
                                Is there a fast algorithm to remove repeated substrings in a string?
                            
                                Maximize Optimization using Scipy
                            
                                SSIS Execute Process Task Python script
                            
                                What is the role of Django csrf token? [closed]
                            
                                Python pandas Timestamp.week returns 52 for first day of year
                            
                                How to plot multi-color line if x-axis is date time index of pandas
                            
                                Getting duplicate keys in YAML using Python
                            
                                Noise Reduction in an Audio file using Python [closed]
                            
                                Matplotlib's autoscale doesn't seem to work on y axis for small values?
                            
                                Django: Is APPEND_SLASH set to True even if not in settings.py?
                            
                                How to get PI in tensorflow?
                            
                                Plotly: How to select graph source using dropdown?
                            
                                How to download an image with Python 3/Selenium if the URL begins with "blob:"?
                            
                                Python Using List/Multiple Arguments in Pool Map
                            
                                Tensorflow: 'tf.get_default_session()` after sess=tf.Session() is None
                            
                                How to optimize a nested for loop in Python
                            
                                pandas dataframe resample aggregate function use multiple columns with a customized function?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

BatchNorm momentum convention PyTorch

Tags:

python

neural-network

deep-learning

pytorch

batch-normalization

peter554

People also ask

1 Answers

patapouf_ai

Recent Activity

Donate For Us