I have a deep neural network where the weights between layers are stored in a list. <code>layers[j].weights</code> I want to incluse the ridge penalty in my cost function. I need then to use something like <code>tf.nn.l2_loss(layers[j].weights**2 for j in range(self.n_layers))</code> i.e. the squared sum of all the weights. In particular the weights are defined as: <pre class="prettyprint"><code>>>> avs.layers [<neural_network.Layer object at 0x10a4b2a90>, <neural_network.Layer object at 0x10ac85080>, <neural_network.Layer object at 0x10b0f3278>, <neural_network.Layer object at 0x10b0eacf8>, <neural_network.Layer object at 0x10b145588>, <neural_network.Layer object at 0x10b165048>, <neural_network.Layer object at 0x10b155ba8>] >>> >>> avs.layers[0].weights <tensorflow.python.ops.variables.Variable object at 0x10b026748> >>> </code></pre> How can I do that in tensorflow ?

The standard way to sum a list of tensors is to use the <code>tf.add_n()</code> operation, which takes a list of tensors (each having the same size and shape) and produces a single tensor containing the sum. For the particular problem that you have, I am assuming that each <code>layers[j].weights</code> could have a different size. Therefore you will need reduce each element down to a scalar before summing, e.g. using the <code>tf.nn.l2_loss()</code> function itself: <pre class="prettyprint"><code>weights = [layers[j].weights for j in range(self.n_layers)] losses = [tf.nn.l2_loss(w) for w in weights] total_loss = tf.add_n(losses) </code></pre> (Note however that when the values to be added are large, you may find it more efficient to calculate a sequence of <code>tf.add()</code> operations, since TensorFlow keeps the values of each of the <code>add_n</code> arguments in memory until all of them have been computed. A chain of <code>add</code> ops allows some of the computation to happen earlier.)

The <code>tf.nn.l2_loss()</code> function returns a tensor with 0 dimensions. But it's nice to not need to manually apply that to each weight tensor, so storing the weight tensors in a list is one way to solve the problem (as @mrry noted). But rather than needing to write that out every time, what you could do is use the following function <pre class="prettyprint"><code>def l2_loss_sum(list_o_tensors): return tf.add_n([tf.nn.l2_loss(t) for t in list_o_tensors]) </code></pre> In your case this would look like: <pre class="prettyprint"><code>total_loss = l2_loss_sum([layers[j].weights for j in range(self.n_layers)]) </code></pre> Also, <code>tf.nn.l2_loss()</code> implicitly applies the squaring operation to the values as well as multiplying all the squared values by 1/2 , so were you use something like <code>tf.nn.l2_loss(layers[j].weights**2 for j in range(self.n_layers))</code> you would actually be raising the weights to the 4th power. As a result your derivative of this loss term would be strange: it wouldn't cancel the 1/2 to 1 (but would implicitly be doubling your β) and the weights would be cubed.

sum over a list of tensors in tensorflow

Tags:

python

tensorflow

cost-based-optimizer

I have a deep neural network where the weights between layers are stored in a list.

layers[j].weights I want to incluse the ridge penalty in my cost function. I need then to use something like tf.nn.l2_loss(layers[j].weights**2 for j in range(self.n_layers)) i.e. the squared sum of all the weights.

In particular the weights are defined as:

>>> avs.layers
[<neural_network.Layer object at 0x10a4b2a90>, <neural_network.Layer object at 0x10ac85080>, <neural_network.Layer object at 0x10b0f3278>, <neural_network.Layer object at 0x10b0eacf8>, <neural_network.Layer object at 0x10b145588>, <neural_network.Layer object at 0x10b165048>, <neural_network.Layer object at 0x10b155ba8>]
>>>
>>> avs.layers[0].weights
<tensorflow.python.ops.variables.Variable object at 0x10b026748>
>>>

How can I do that in tensorflow ?

527

asked Dec 29 '15 21:12

Donbeo

2 Answers

The standard way to sum a list of tensors is to use the tf.add_n() operation, which takes a list of tensors (each having the same size and shape) and produces a single tensor containing the sum.

For the particular problem that you have, I am assuming that each layers[j].weights could have a different size. Therefore you will need reduce each element down to a scalar before summing, e.g. using the tf.nn.l2_loss() function itself:

weights = [layers[j].weights for j in range(self.n_layers)]
losses = [tf.nn.l2_loss(w) for w in weights]
total_loss = tf.add_n(losses)

(Note however that when the values to be added are large, you may find it more efficient to calculate a sequence of tf.add() operations, since TensorFlow keeps the values of each of the add_n arguments in memory until all of them have been computed. A chain of add ops allows some of the computation to happen earlier.)

answered Oct 19 '22 17:10

mrry

The tf.nn.l2_loss() function returns a tensor with 0 dimensions.

But it's nice to not need to manually apply that to each weight tensor, so storing the weight tensors in a list is one way to solve the problem (as @mrry noted).

But rather than needing to write that out every time, what you could do is use the following function

def l2_loss_sum(list_o_tensors):
    return tf.add_n([tf.nn.l2_loss(t) for t in list_o_tensors])

In your case this would look like:

total_loss = l2_loss_sum([layers[j].weights for j in range(self.n_layers)])

Also, tf.nn.l2_loss() implicitly applies the squaring operation to the values as well as multiplying all the squared values by 1/2 , so were you use something like tf.nn.l2_loss(layers[j].weights**2 for j in range(self.n_layers)) you would actually be raising the weights to the 4th power. As a result your derivative of this loss term would be strange: it wouldn't cancel the 1/2 to 1 (but would implicitly be doubling your β) and the weights would be cubed.

answered Oct 19 '22 18:10

mpacer

Related questions
                            
                                All Permutations of a String in Python (Recursive)
                            
                                List indexes of duplicate values in a list with Python
                            
                                RuntimeWarning: Divide by Zero error: How to avoid? PYTHON, NUMPY
                            
                                how to print out a string and list in one line-python
                            
                                Python Selenium how to wait before clicking on link
                            
                                Printing 2 evenly populated lists side by side evenly
                            
                                Count the number of elements found in selenium python
                            
                                Increase index of pandas DataFrame by one
                            
                                How to draw rounded line ends using matplotlib
                            
                                Graphing a Parabola using Matplotlib in Python
                            
                                Read csv with dd.mm.yyyy in Python and Pandas
                            
                                why do i have error "Address already in use"?
                            
                                Why is using a generator function twice as fast in this case?
                            
                                morse code to english python3
                            
                                asyncio: Wait for event from other thread
                            
                                Python code for Bluetooth throws error after I had to reset the adapter
                            
                                How to feed caffe multi label data in HDF5 format?
                            
                                Add inline model to django admin site
                            
                                Does Python traceback.print_exc() prints to stdout or stderr?
                            
                                Average value in multiple dictionaries based on key in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With