I have read about <code>tf.get_variable</code> from this question and also a bit from the documentation available at the tensorflow website. However, I am still not clear and was unable to find an answer online. How does <code>tf.get_variable</code> work? For example: <pre class="prettyprint"><code>var1 = tf.Variable(3.,dtype=float64) var2 = tf.get_variable("var1",[],dtype=tf.float64) </code></pre> Does it mean that var2 is another variable with initialization similar to var1? Or is var2 an alias for var1 (I tried and it doesn't seem to)? How are var1 and var2 related? How is a variable constructed when the variable we are getting doesn't really exist?

<code>tf.get_variable(name)</code> creates a new variable called <code>name</code> (or add _ if <code>name</code> already exists in the current scope) in the tensorflow graph. In your example, you're creating a python variable called <code>var1</code>. The name of that variable in the tensorflow graph is not ** <code>var1</code>, but is <code>Variable:0</code>. Every node you define has its own name that you can specify or let tensorflow give a default (and always different) one. You can see the <code>name</code> value accessing the <code>name</code> property of the python variable. (ie <code>print(var1.name)</code>). On your second line, you're defining a Python variable <code>var2</code> whose name in the tensorflow graph is <code>var1</code>. The script <pre class="prettyprint"><code>import tensorflow as tf var1 = tf.Variable(3.,dtype=tf.float64) print(var1.name) var2 = tf.get_variable("var1",[],dtype=tf.float64) print(var2.name) </code></pre> In fact prints: <pre class="prettyprint"><code>Variable:0 var1:0 </code></pre> If you, instead, want to define a variable (node) called <code>var1</code> in the tensorflow graph and then getting a reference to that node, you cannot simply use <code>tf.get_variable("var1")</code>, because it will create a new different variable valled <code>var1_1</code>. This script <pre class="prettyprint"><code>var1 = tf.Variable(3.,dtype=tf.float64, name="var1") print(var1.name) var2 = tf.get_variable("var1",[],dtype=tf.float64) print(var2.name) </code></pre> prints: <pre class="prettyprint"><code>var1:0 var1_1:0 </code></pre> If you want to create a reference to the node <code>var1</code>, you first: <ol> <li> Have to replace <code>tf.Variable</code> with <code>tf.get_variable</code>. The variables created with <code>tf.Variable</code> can't be shared, while the latter can. </li> <li> Know what the <code>scope</code> of the <code>var1</code> is and allow the <code>reuse</code> of that scope when declaring the reference. </li> </ol> Looking at the code is the better way for understanding <pre class="prettyprint"><code>import tensorflow as tf #var1 = tf.Variable(3.,dtype=tf.float64, name="var1") var1 = tf.get_variable(initializer=tf.constant_initializer(3.), dtype=tf.float64, name="var1", shape=()) current_scope = tf.contrib.framework.get_name_scope() print(var1.name) with tf.variable_scope(current_scope, reuse=True): var2 = tf.get_variable("var1",[],dtype=tf.float64) print(var2.name) </code></pre> outputs: <pre class="prettyprint"><code>var1:0 var1:0 </code></pre>

Tensorflow: How does tf.get_variable work?

Tags:

tensorflow

I have read about tf.get_variable from this question and also a bit from the documentation available at the tensorflow website. However, I am still not clear and was unable to find an answer online.

How does tf.get_variable work? For example:

var1 = tf.Variable(3.,dtype=float64) var2 = tf.get_variable("var1",[],dtype=tf.float64)

Does it mean that var2 is another variable with initialization similar to var1? Or is var2 an alias for var1 (I tried and it doesn't seem to)?

How are var1 and var2 related?

How is a variable constructed when the variable we are getting doesn't really exist?

760

asked Jul 13 '17 07:07

firewithin

2 Answers

tf.get_variable(name) creates a new variable called name (or add _ if name already exists in the current scope) in the tensorflow graph.

In your example, you're creating a python variable called var1.

The name of that variable in the tensorflow graph is not ** var1, but is Variable:0.

Every node you define has its own name that you can specify or let tensorflow give a default (and always different) one. You can see the name value accessing the name property of the python variable. (ie print(var1.name)).

On your second line, you're defining a Python variable var2 whose name in the tensorflow graph is var1.

The script

import tensorflow as tf  var1 = tf.Variable(3.,dtype=tf.float64) print(var1.name) var2 = tf.get_variable("var1",[],dtype=tf.float64) print(var2.name)

In fact prints:

Variable:0 var1:0

If you, instead, want to define a variable (node) called var1 in the tensorflow graph and then getting a reference to that node, you cannot simply use tf.get_variable("var1"), because it will create a new different variable valled var1_1.

This script

var1 = tf.Variable(3.,dtype=tf.float64, name="var1") print(var1.name) var2 = tf.get_variable("var1",[],dtype=tf.float64) print(var2.name)

prints:

var1:0 var1_1:0

If you want to create a reference to the node var1, you first:

Have to replace tf.Variable with tf.get_variable. The variables created with tf.Variable can't be shared, while the latter can.
Know what the scope of the var1 is and allow the reuse of that scope when declaring the reference.

Looking at the code is the better way for understanding

import tensorflow as tf  #var1 = tf.Variable(3.,dtype=tf.float64, name="var1") var1 = tf.get_variable(initializer=tf.constant_initializer(3.), dtype=tf.float64, name="var1", shape=()) current_scope = tf.contrib.framework.get_name_scope() print(var1.name) with tf.variable_scope(current_scope, reuse=True):     var2 = tf.get_variable("var1",[],dtype=tf.float64)     print(var2.name)

outputs:

var1:0 var1:0

161

answered Oct 13 '22 02:10

nessuno

If you define a variable with a name that has been defined before, then TensorFlow throws an exception. Hence, it is convenient to use the tf.get_variable() function instead of tf.Variable(). The function tf.get_variable() returns the existing variable with the same name if it exists, and creates the variable with the specified shape and initializer if it does not exist.

answered Oct 13 '22 02:10

concaption

Related questions
                            
                                Tensorflow: Attempting to use uninitialized value beta1_power
                            
                                Changing the scale of a tensor in tensorflow
                            
                                Replace nan values in tensorflow tensor
                            
                                Save and load model optimizer state
                            
                                How training and test data is split - Keras on Tensorflow
                            
                                Save Tensorflow graph for viewing in Tensorboard without summary operations
                            
                                What does this tensorflow message mean? Any side effect? Was the installation successful?
                            
                                ValueError: Duplicate plugins for name projector
                            
                                Converting from Pandas dataframe to TensorFlow tensor object
                            
                                Should I use @tf.function for all functions?
                            
                                When global_variables_initializer() is actually required
                            
                                What is the TensorFlow checkpoint meta file?
                            
                                Machine Learning (tensorflow / sklearn) in Django?
                            
                                ValueError: Output tensors to a Model must be the output of a TensorFlow `Layer`
                            
                                TensorFlow Variables and Constants
                            
                                In TensorFlow, what is the argument 'axis' in the function 'tf.one_hot'
                            
                                tensorflow: what's the difference between tf.nn.dropout and tf.layers.dropout
                            
                                What does the function control_dependencies do?
                            
                                How to perform k-fold cross validation with tensorflow?
                            
                                Output from TensorFlow `py_func` has unknown rank/shape

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With