I am new to tensorflow , I am not able to understand the difference of variable and constant, I get the idea that we use variables for equations and constants for direct values , but why code #1 works only and why not code#2 and #3, and please explain in which cases we have to run our graph first(a) and then our variable(b) i.e <pre class="prettyprint"><code> (a) session.run(model) (b) print(session.run(y)) </code></pre> and in which case I can directly execute this command i.e <pre class="prettyprint"><code>print(session.run(y)) </code></pre> Code #1 : <pre class="prettyprint"><code>x = tf.constant(35, name='x') y = tf.Variable(x + 5, name='y') model = tf.global_variables_initializer() with tf.Session() as session: session.run(model) print(session.run(y)) </code></pre> Code #2 : <pre class="prettyprint"><code>x = tf.Variable(35, name='x') y = tf.Variable(x + 5, name='y') model = tf.global_variables_initializer() with tf.Session() as session: session.run(model) print(session.run(y)) </code></pre> Code #3 : <pre class="prettyprint"><code>x = tf.constant(35, name='x') y = tf.constant(x + 5, name='y') model = tf.global_variables_initializer() with tf.Session() as session: session.run(model) print(session.run(y)) </code></pre>

In TensorFlow the differences between constants and variables are that when you declare some constant, its value can't be changed in the future (also the initialization should be with a value, not with operation). Nevertheless, when you declare a Variable, you can change its value in the future with tf.assign() method (and the initialization can be achieved with a value or operation). The function tf.global_variables_initializer() initialises all variables in your code with the value passed as parameter, but it works in async mode, so doesn't work properly when dependencies exists between variables. Your first code (#1) works properly because there is no dependencies on variable initialization and the constant is constructed with a value. The second code (#2) doesn't work because of the async behavior of <code>tf.global_variables_initializer()</code>. You can fix it using tf.variables_initializer() as follows: <pre class="prettyprint"><code>x = tf.Variable(35, name='x') model_x = tf.variables_initializer([x]) y = tf.Variable(x + 5, name='y') model_y = tf.variables_initializer([y]) with tf.Session() as session: session.run(model_x) session.run(model_y) print(session.run(y)) </code></pre> The third code (#3) doesn't work properly because you are trying to initialize a constant with an operation, that isn't possible. To solve it, an appropriate strategy is (#1). Regarding to your last question. You need to run <code>(a) session.run(model)</code> when there are variables in your calculation graph <code>(b) print(session.run(y))</code>.

I will point the difference when using eager execution. As of Tensorflow 2.0.b1, <code>Variables</code> and <code>Constant</code> trigger different behaviours when using <code>tf.GradientTape</code>. Strangely, the official document is not verbal about it enough. Let's look at the example code in https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/GradientTape <pre class="prettyprint lang-py prettyprint-override"><code>x = tf.constant(3.0) with tf.GradientTape(persistent=True) as g: g.watch(x) y = x * x z = y * y dz_dx = g.gradient(z, x) # 108.0 (4*x^3 at x = 3) dy_dx = g.gradient(y, x) # 6.0 del g # Drop the reference to the tape </code></pre> You had to watch <code>x</code> which is a <code>Constant</code>. <code>GradientTape</code> does NOT automatically watch constants in the context. Additionally, it can watch only one tensor per <code>GradientTape</code>. If you want to get gradients of multiple <code>Constant</code>s, you need to nest <code>GradientTape</code>s. For example, <pre class="prettyprint lang-py prettyprint-override"><code>x = tf.constant(3.0) x2 = tf.constant(3.0) with tf.GradientTape(persistent=True) as g: g.watch(x) with tf.GradientTape(persistent=True) as g2: g2.watch(x2) y = x * x y2 = y * x2 dy_dx = g.gradient(y, x) # 6 dy2_dx2 = g2.gradient(y2, x2) # 9 del g, g2 # Drop the reference to the tape </code></pre> On the other hand, <code>Variable</code>s are automatically watched by <code>GradientTape</code>. <blockquote> By default GradientTape will automatically watch any trainable variables that are accessed inside the context. Source: https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/GradientTape </blockquote> So the above will look like, <pre class="prettyprint lang-py prettyprint-override"><code>x = tf.Variable(3.0) x2 = tf.Variable(3.0) with tf.GradientTape(persistent=True) as g: y = x * x y2 = y * x2 dy_dx = g.gradient(y, x) # 6 dy2_dx2 = g.gradient(y2, x2) # 9 del g # Drop the reference to the tape print(dy_dx) print(dy2_dx2) </code></pre> Of course, you can turn off the automatic watching by passing <code>watch_accessed_variables=False</code>. The examples may not be so practical but I hope this clears someone's confusion.

TensorFlow Variables and Constants

Tags:

python

tensorflow

I am new to tensorflow , I am not able to understand the difference of variable and constant, I get the idea that we use variables for equations and constants for direct values , but why code #1 works only and why not code#2 and #3, and please explain in which cases we have to run our graph first(a) and then our variable(b) i.e

 (a) session.run(model)  (b) print(session.run(y))

and in which case I can directly execute this command i.e

print(session.run(y))

Code #1 :

x = tf.constant(35, name='x') y = tf.Variable(x + 5, name='y')  model = tf.global_variables_initializer()   with tf.Session() as session:     session.run(model)     print(session.run(y))

Code #2 :

x = tf.Variable(35, name='x') y = tf.Variable(x + 5, name='y')  model = tf.global_variables_initializer()   with tf.Session() as session:     session.run(model)     print(session.run(y))

Code #3 :

x = tf.constant(35, name='x') y = tf.constant(x + 5, name='y')  model = tf.global_variables_initializer()   with tf.Session() as session:     session.run(model)     print(session.run(y))

524

asked Jun 25 '17 11:06

Daniyal Javaid

2 Answers

In TensorFlow the differences between constants and variables are that when you declare some constant, its value can't be changed in the future (also the initialization should be with a value, not with operation).

Nevertheless, when you declare a Variable, you can change its value in the future with tf.assign() method (and the initialization can be achieved with a value or operation).

The function tf.global_variables_initializer() initialises all variables in your code with the value passed as parameter, but it works in async mode, so doesn't work properly when dependencies exists between variables.

Your first code (#1) works properly because there is no dependencies on variable initialization and the constant is constructed with a value.

The second code (#2) doesn't work because of the async behavior of tf.global_variables_initializer(). You can fix it using tf.variables_initializer() as follows:

x = tf.Variable(35, name='x') model_x = tf.variables_initializer([x])  y = tf.Variable(x + 5, name='y') model_y = tf.variables_initializer([y])   with tf.Session() as session:    session.run(model_x)    session.run(model_y)    print(session.run(y))

The third code (#3) doesn't work properly because you are trying to initialize a constant with an operation, that isn't possible. To solve it, an appropriate strategy is (#1).

Regarding to your last question. You need to run (a) session.run(model) when there are variables in your calculation graph (b) print(session.run(y)).

answered Sep 19 '22 05:09

garciparedes

I will point the difference when using eager execution.

As of Tensorflow 2.0.b1, Variables and Constant trigger different behaviours when using tf.GradientTape. Strangely, the official document is not verbal about it enough.

Let's look at the example code in https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/GradientTape

x = tf.constant(3.0) with tf.GradientTape(persistent=True) as g:   g.watch(x)   y = x * x   z = y * y dz_dx = g.gradient(z, x)  # 108.0 (4*x^3 at x = 3) dy_dx = g.gradient(y, x)  # 6.0 del g  # Drop the reference to the tape

You had to watch x which is a Constant. GradientTape does NOT automatically watch constants in the context. Additionally, it can watch only one tensor per GradientTape. If you want to get gradients of multiple Constants, you need to nest GradientTapes. For example,

x = tf.constant(3.0) x2 = tf.constant(3.0) with tf.GradientTape(persistent=True) as g:   g.watch(x)   with tf.GradientTape(persistent=True) as g2:     g2.watch(x2)      y = x * x     y2 = y * x2  dy_dx = g.gradient(y, x)       # 6 dy2_dx2 = g2.gradient(y2, x2)  # 9 del g, g2  # Drop the reference to the tape

On the other hand, Variables are automatically watched by GradientTape.

By default GradientTape will automatically watch any trainable variables that are accessed inside the context. Source: https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/GradientTape

So the above will look like,

x = tf.Variable(3.0) x2 = tf.Variable(3.0) with tf.GradientTape(persistent=True) as g:     y = x * x     y2 = y * x2  dy_dx = g.gradient(y, x)       # 6 dy2_dx2 = g.gradient(y2, x2)   # 9 del g  # Drop the reference to the tape print(dy_dx) print(dy2_dx2)

Of course, you can turn off the automatic watching by passing watch_accessed_variables=False. The examples may not be so practical but I hope this clears someone's confusion.

answered Sep 22 '22 05:09

YOUNG

Related questions
                            
                                What does = (equal) do in f-strings inside the expression curly brackets?
                            
                                How to type hint a generic numeric type in Python?
                            
                                Change current process environment's LD_LIBRARY_PATH
                            
                                Find unique elements of floating point array in numpy (with comparison using a delta value)
                            
                                Non blocking python process or thread
                            
                                Parse dates when YYYYMMDD and HH are in separate columns using pandas in Python
                            
                                Python OpenCV access webcam maximum resolution
                            
                                Suppress print output in unittests
                            
                                Using tee to get realtime print statements from python [duplicate]
                            
                                How to decode url to path in python, django
                            
                                Compiler can't find Py_InitModule() .. is it deprecated and if so what should I use?
                            
                                supervisor - how to run multiple commands
                            
                                Django error: relation "users_user" does not exist
                            
                                Is there a performance cost putting python imports inside functions?
                            
                                How do I get the size of a boto3 Collection?
                            
                                Encode and assemble multiple features in PySpark
                            
                                How do you extract only the date from a python datetime? [duplicate]
                            
                                Start async function without importing the asyncio package
                            
                                Changing color scale in seaborn bar plot
                            
                                Randomly insert NA's values in a pandas dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With