I found in many available neural network code implemented using TensorFlow that regularization terms are often implemented by manually adding an additional term to loss value. My questions are: <ol> <li>Is there a more elegant or recommended way of regularization than doing it manually?</li> <li>I also find that <code>get_variable</code> has an argument <code>regularizer</code>. How should it be used? According to my observation, if we pass a regularizer to it (such as <code>tf.contrib.layers.l2_regularizer</code>, a tensor representing regularized term will be computed and added to a graph collection named <code>tf.GraphKeys.REGULARIZATOIN_LOSSES</code>. Will that collection be automatically used by TensorFlow (e.g. used by optimizers when training)? Or is it expected that I should use that collection by myself?</li> </ol>

A few aspects of the existing answer were not immediately clear to me, so here is a step-by-step guide: <ol> <li> Define a regularizer. This is where the regularization constant can be set, e.g.: <pre class="prettyprint"><code>regularizer = tf.contrib.layers.l2_regularizer(scale=0.1) </code></pre> </li> <li> Create variables via: <pre class="prettyprint"><code> weights = tf.get_variable( name="weights", regularizer=regularizer, ... ) </code></pre> Equivalently, variables can be created via the regular <code>weights = tf.Variable(...)</code> constructor, followed by <code>tf.add_to_collection(tf.GraphKeys.REGULARIZATION_LOSSES, weights)</code>. </li> <li> Define some <code>loss</code> term and add the regularization term: <pre class="prettyprint"><code>reg_variables = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES) reg_term = tf.contrib.layers.apply_regularization(regularizer, reg_variables) loss += reg_term </code></pre> Note: It looks like <code>tf.contrib.layers.apply_regularization</code> is implemented as an <code>AddN</code>, so more or less equivalent to <code>sum(reg_variables)</code>. </li> </ol>

How to add regularizations in TensorFlow?

Tags:

python

neural-network

tensorflow

deep-learning

I found in many available neural network code implemented using TensorFlow that regularization terms are often implemented by manually adding an additional term to loss value.

My questions are:

Is there a more elegant or recommended way of regularization than doing it manually?
I also find that get_variable has an argument regularizer. How should it be used? According to my observation, if we pass a regularizer to it (such as tf.contrib.layers.l2_regularizer, a tensor representing regularized term will be computed and added to a graph collection named tf.GraphKeys.REGULARIZATOIN_LOSSES. Will that collection be automatically used by TensorFlow (e.g. used by optimizers when training)? Or is it expected that I should use that collection by myself?

341

asked May 09 '16 03:05

Lifu Huang

2 Answers

As you say in the second point, using the regularizer argument is the recommended way. You can use it in get_variable, or set it once in your variable_scope and have all your variables regularized.

The losses are collected in the graph, and you need to manually add them to your cost function like this.

  reg_losses = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES)   reg_constant = 0.01  # Choose an appropriate one.   loss = my_normal_loss + reg_constant * sum(reg_losses)

Hope that helps!

answered Oct 01 '22 11:10

Lukasz Kaiser

A few aspects of the existing answer were not immediately clear to me, so here is a step-by-step guide:

Define a regularizer. This is where the regularization constant can be set, e.g.:
```
regularizer = tf.contrib.layers.l2_regularizer(scale=0.1) 
```
Create variables via:
```
    weights = tf.get_variable(         name="weights",         regularizer=regularizer,         ...     ) 
```
Equivalently, variables can be created via the regular weights = tf.Variable(...) constructor, followed by tf.add_to_collection(tf.GraphKeys.REGULARIZATION_LOSSES, weights).
Define some loss term and add the regularization term:
```
reg_variables = tf.get_collection(tf.GraphKeys.REGULARIZATION_LOSSES) reg_term = tf.contrib.layers.apply_regularization(regularizer, reg_variables) loss += reg_term 
```
Note: It looks like tf.contrib.layers.apply_regularization is implemented as an AddN, so more or less equivalent to sum(reg_variables).

answered Oct 01 '22 11:10

bluenote10

Related questions
                            
                                Python inheritance: TypeError: object.__init__() takes no parameters
                            
                                Get a list/tuple/dict of the arguments passed to a function?
                            
                                Threads vs. Async
                            
                                Python serialization - Why pickle?
                            
                                Abstract attribute (not property)?
                            
                                open cv error: (-215) scn == 3 || scn == 4 in function cvtColor
                            
                                Get all related Django model objects
                            
                                What is different between all these OpenCV Python interfaces?
                            
                                How do I use the built in password reset/change views with my own templates
                            
                                Why am I getting AttributeError: Object has no attribute? [closed]
                            
                                re.sub erroring with "Expected string or bytes-like object"
                            
                                Show matplotlib plots (and other GUI) in Ubuntu (WSL1 & WSL2)
                            
                                Set logging levels
                            
                                update to python 3.7 using anaconda
                            
                                MANIFEST.in ignored on "python setup.py install" - no data files installed?
                            
                                setup.py examples?
                            
                                Is the time-complexity of iterative string append actually O(n^2), or O(n)?
                            
                                Binary numbers in Python
                            
                                How to fix RuntimeError "Expected object of scalar type Float but got scalar type Double for argument"?
                            
                                How to set up custom middleware in Django

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With