I'm trying to understand the difference between tf.assign and the assignment operator(=). I have three sets of code First, using simple tf.assign <pre class="prettyprint"><code>import tensorflow as tf with tf.Graph().as_default(): a = tf.Variable(1, name="a") assign_op = tf.assign(a, tf.add(a,1)) with tf.Session() as sess: sess.run(tf.global_variables_initializer()) print sess.run(assign_op) print a.eval() print a.eval() </code></pre> The output is expected as <pre class="prettyprint"><code>2 2 2 </code></pre> Second, using assignment operator <pre class="prettyprint"><code>import tensorflow as tf with tf.Graph().as_default(): a = tf.Variable(1, name="a") a = a + 1 with tf.Session() as sess: sess.run(tf.global_variables_initializer()) print sess.run(a) print a.eval() print a.eval() </code></pre> The results are still 2, 2, 2. Third, I use both tf.assign and assignment operator <pre class="prettyprint"><code>import tensorflow as tf with tf.Graph().as_default(): a = tf.Variable(1, name="a") a = tf.assign(a, tf.add(a,1)) with tf.Session() as sess: sess.run(tf.global_variables_initializer()) print sess.run(a) print a.eval() print a.eval() </code></pre> Now, the output becomes 2, 3, 4. My questions are <ol> <li>In the 2nd snippet using (=), when I have sess.run(a), it seems I'm running an assign op. So does "a = a+1" internally create an assignment op like assign_op = tf.assign(a, a+1)? Is the op run by the session really just the assign_op? But when I run a.eval(), it doesn't continue to increment a, hence it appears eval is evaluating a "static" variable.</li> <li>I'm not sure how to explain the 3rd snippet. Why the two evals increment a, but the two evals in the 2nd snippet doesn't?</li> </ol> Thanks.

First, the anwser is not really precise. IMO, there's no distinguish between python object and tf object. They are all memory objects managed by python GC. If you change second <code>a</code> to <code>b</code>, and print vars out, <pre class="prettyprint lang-py prettyprint-override"><code>In [2]: g = tf.Graph() In [3]: with g.as_default(): ...: a = tf.Variable(1, name='a') ...: b = a + 1 ...: In [4]: print(a) <tf.Variable 'a:0' shape=() dtype=int32_ref> In [5]: print(b) Tensor("add:0", shape=(), dtype=int32) In [6]: id(a) Out[6]: 140253111576208 In [7]: id(b) Out[7]: 140252306449616 </code></pre> <code>a</code> and <code>b</code> are not referring the same object in memory. Draw the computation graph, or memory graph first-line, <pre class="prettyprint"><code># a = tf.Varaible(... a -> var(a) </code></pre> second line, <pre class="prettyprint"><code># b = a + 1 b -> add - var(a) | \-- 1 </code></pre> now if you replace it back to your <code>b = a + 1</code> to <code>a = a + 1</code>, the <code>a</code> after assign operation is pointing to an <code>tf.add</code> object instead of the variable <code>a</code> incremented by 1. When you run <code>sess.run</code>, you are fetching the result by that <code>add</code> operator with no side effect to the original <code>a</code> variable. <code>tf.assign</code>, on the other hand, will have the side effect of updating the state of the graph under the session.

The main confusion here is that doing <code>a = a + 1</code> will reassign the Python variable <code>a</code> to the resulting tensor of the addition operation <code>a + 1</code>. <code>tf.assign</code>, on the other hand, is an operation for setting the value of a TensorFlow variable. <pre class="prettyprint"><code>a = tf.Variable(1, name="a") a = a + 1 </code></pre> This is equivalent to: <pre class="prettyprint"><code>a = tf.add(tf.Variable(1, name="a"), 1) </code></pre> With that in mind: <blockquote> In the 2nd snippet using (=), when I have sess.run(a), it seems I'm running an assign op. So does "a = a+1" internally create an assignment op like assign_op = tf.assign(a, a+1)? [...] </blockquote> It might look so, but not true. As explained above, this will only reassign the Python variable. And without <code>tf.assign</code> or any other operation that changes the variable, it stays with the value 1. Each time <code>a</code> is evaluated, the program will always calculate <code>a + 1 => 1 + 1</code>. <blockquote> I'm not sure how to explain the 3rd snippet. Why the two evals increment a, but the two evals in the 2nd snippet doesn't? </blockquote> That's because calling <code>eval()</code> on the assignment tensor in the third snippet also triggers the variable assignment (note that this isn't much different from doing <code>session.run(a)</code> with the current session).

Difference between tf.assign and assignment operator (=)

Q: Why does R use <- for assignment?

As you all know, R comes from S. But you might not know a lot about S (I don't). This language used <- as an assignment operator. It's partly because it was inspired by a language called APL, which also had this sign for assignment.

Tags:

tensorflow

I'm trying to understand the difference between tf.assign and the assignment operator(=). I have three sets of code

First, using simple tf.assign

import tensorflow as tf

with tf.Graph().as_default():
  a = tf.Variable(1, name="a")
  assign_op = tf.assign(a, tf.add(a,1))
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    print sess.run(assign_op)
    print a.eval()
    print a.eval()

The output is expected as

2
2
2

Second, using assignment operator

import tensorflow as tf

with tf.Graph().as_default():
  a = tf.Variable(1, name="a")
  a = a + 1
  with tf.Session() as sess:
   sess.run(tf.global_variables_initializer())
   print sess.run(a)
   print a.eval()
   print a.eval()

The results are still 2, 2, 2.

Third, I use both tf.assign and assignment operator

import tensorflow as tf

with tf.Graph().as_default():
  a = tf.Variable(1, name="a")
  a = tf.assign(a, tf.add(a,1))
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    print sess.run(a)
    print a.eval()
    print a.eval()

Now, the output becomes 2, 3, 4.

My questions are

In the 2nd snippet using (=), when I have sess.run(a), it seems I'm running an assign op. So does "a = a+1" internally create an assignment op like assign_op = tf.assign(a, a+1)? Is the op run by the session really just the assign_op? But when I run a.eval(), it doesn't continue to increment a, hence it appears eval is evaluating a "static" variable.
I'm not sure how to explain the 3rd snippet. Why the two evals increment a, but the two evals in the 2nd snippet doesn't?

Thanks.

866

asked Aug 20 '17 06:08

user8490020

2 Answers

First, the anwser is not really precise. IMO, there's no distinguish between python object and tf object. They are all memory objects managed by python GC.

If you change second a to b, and print vars out,

In [2]: g = tf.Graph()

In [3]: with g.as_default():
   ...:     a = tf.Variable(1, name='a')
   ...:     b = a + 1
   ...:

In [4]: print(a)
<tf.Variable 'a:0' shape=() dtype=int32_ref>

In [5]: print(b)
Tensor("add:0", shape=(), dtype=int32)

In [6]: id(a)
Out[6]: 140253111576208

In [7]: id(b)
Out[7]: 140252306449616

a and b are not referring the same object in memory.

Draw the computation graph, or memory graph

first-line,

# a = tf.Varaible(...
a -> var(a)

second line,

# b = a + 1
b -> add - var(a)
      |
       \-- 1

now if you replace it back to your b = a + 1 to a = a + 1, the a after assign operation is pointing to an tf.add object instead of the variable a incremented by 1.

When you run sess.run, you are fetching the result by that add operator with no side effect to the original a variable.

tf.assign, on the other hand, will have the side effect of updating the state of the graph under the session.

159

answered Oct 21 '22 04:10

Izana

The main confusion here is that doing a = a + 1 will reassign the Python variable a to the resulting tensor of the addition operation a + 1. tf.assign, on the other hand, is an operation for setting the value of a TensorFlow variable.

a = tf.Variable(1, name="a")
a = a + 1

This is equivalent to:

a = tf.add(tf.Variable(1, name="a"), 1)

With that in mind:

In the 2nd snippet using (=), when I have sess.run(a), it seems I'm running an assign op. So does "a = a+1" internally create an assignment op like assign_op = tf.assign(a, a+1)? [...]

It might look so, but not true. As explained above, this will only reassign the Python variable. And without tf.assign or any other operation that changes the variable, it stays with the value 1. Each time a is evaluated, the program will always calculate a + 1 => 1 + 1.

I'm not sure how to explain the 3rd snippet. Why the two evals increment a, but the two evals in the 2nd snippet doesn't?

That's because calling eval() on the assignment tensor in the third snippet also triggers the variable assignment (note that this isn't much different from doing session.run(a) with the current session).

answered Oct 21 '22 06:10

E_net4 stands with Ukraine

Related questions
                            
                                How to get code completion for Tensorflow in PyCharm?
                            
                                How to clear GPU memory occupied by zombie process if it's parent is init?
                            
                                How to Fine tune existing Tensorflow Object Detection model to recognize additional classes? [closed]
                            
                                When to use tf.resource and tf.variant?
                            
                                Tensorflow Estimator: Cache bottlenecks
                            
                                Exact model converging on keras-tf but not on keras
                            
                                looping through dataset once at test time in tensorflow
                            
                                Wide & Deep learning for large data error: GraphDef cannot be larger than 2GB
                            
                                Training of keras model get's slower after each repetition
                            
                                Is there a way to use tensorflow map_fn on GPU?
                            
                                Keras custom loss implementation : ValueError: An operation has `None` for gradient
                            
                                Tensorflow 2.0 Keras is training 4x slower than 2.0 Estimator
                            
                                Why does my keras LSTM model get stuck in an infinite loop?
                            
                                Can ReLU handle a negative input?
                            
                                PyTorch equivalence for softmax_cross_entropy_with_logits
                            
                                Graph optimizations on a tensorflow serveable created using tf.Estimator
                            
                                how is total loss calculated over multiple classes in Keras?
                            
                                Design patterns for tensorflow models

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With