Confused by the behavior of `tf.cond`

Tags:

tensorflow

I need a conditional control flow in my graph. If pred is True, the graph should call an op that updates a variable and then returns it, otherwise it returns the variable unchanged. A simplified version is:

pred = tf.constant(True) x = tf.Variable([1]) assign_x_2 = tf.assign(x, [2]) def update_x_2():   with tf.control_dependencies([assign_x_2]):     return tf.identity(x) y = tf.cond(pred, update_x_2, lambda: tf.identity(x)) with tf.Session() as session:   session.run(tf.initialize_all_variables())   print(y.eval())

However, I find that both pred=True and pred=False lead to the same result y=[2], which means the assign op is also called when update_x_2 is not selected by tf.cond. How to explain this? And how to solve this problem?

984

asked May 06 '16 03:05

bgshi

2 Answers

TL;DR: If you want tf.cond() to perform a side effect (like an assignment) in one of the branches, you must create the op that performs the side effect inside the function that you pass to tf.cond().

The behavior of tf.cond() is a little unintuitive. Because execution in a TensorFlow graph flows forward through the graph, all operations that you refer to in either branch must execute before the conditional is evaluated. This means that both the true and the false branches receive a control dependency on the tf.assign() op, and so y always gets set to 2, even if pred is False.

The solution is to create the tf.assign() op inside the function that defines the true branch. For example, you could structure your code as follows:

pred = tf.placeholder(tf.bool, shape=[]) x = tf.Variable([1]) def update_x_2():   with tf.control_dependencies([tf.assign(x, [2])]):     return tf.identity(x) y = tf.cond(pred, update_x_2, lambda: tf.identity(x)) with tf.Session() as session:   session.run(tf.initialize_all_variables())   print(y.eval(feed_dict={pred: False}))  # ==> [1]   print(y.eval(feed_dict={pred: True}))   # ==> [2]

132

answered Sep 27 '22 18:09

mrry

pred = tf.constant(False) x = tf.Variable([1])  def update_x_2():     assign_x_2 = tf.assign(x, [2])     with tf.control_dependencies([assign_x_2]):         return tf.identity(x) y = tf.cond(pred, update_x_2, lambda: tf.identity(x)) with tf.Session() as session:   session.run(tf.initialize_all_variables())   print(y.eval())

This will get the result of [1].

This answer is quite the same as the above answer. But what I wanna share is you can put every ops you would like to use in its branch function. Because, given your example code, tensor x is can be directly used by the update_x_2 function.

answered Sep 27 '22 18:09

Moonlight Knight

Related questions
                            
                                od_graph_def = tf.GraphDef() AttributeError: module 'tensorflow' has no attribute 'GraphDef'
                            
                                Keras: change learning rate
                            
                                Tensorflow Slim: TypeError: Expected int32, got list containing Tensors of type '_Message' instead
                            
                                What does opt.apply_gradients() do in TensorFlow?
                            
                                tensorflow deep neural network for regression always predict same results in one batch
                            
                                Why is this TensorFlow implementation vastly less successful than Matlab's NN?
                            
                                Tensorflow Dictionary lookup with String tensor
                            
                                How do you read Tensorboard files programmatically?
                            
                                keras vs. tensorflow.python.keras - which one to use?
                            
                                What's the difference between tensorflow dynamic_rnn and rnn?
                            
                                No module named 'absl' error when I import tensorflow
                            
                                Tensor with unspecified dimension in tensorflow
                            
                                Numpy is installed but still getting error
                            
                                Tensorboard not found as magic function in jupyter
                            
                                Non-deterministic behavior of TensorFlow while_loop()
                            
                                pip installation error "No such file or directory: setup.py"
                            
                                what does the question mark in tensorflow shape mean?
                            
                                Difference between `apply_gradients` and `minimize` of optimizer in tensorflow
                            
                                Create keras callback to save model predictions and targets for each batch during training
                            
                                Efficient element-wise multiplication of a matrix and a vector in TensorFlow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With