I'm unsure about the practical differences between the 4 variations below (they all evaluate to the same value). My understanding is that if I call <code>tf</code>, it will create an operation on the graph, and otherwise it might. If I don't create the <code>tf.constant()</code> at the beginning, I believe that the constants will be created implicitly when doing the addition; but for <code>tf.add(a,b)</code> vs <code>a + b</code> where <code>a</code> and <code>b</code> are both Tensors (#1 and #3), I can see no difference besides the default naming (former is <code>Add</code> and the latter one is <code>add</code>). Can anyone shed some light on the differences between those, and when should one use each? <pre class="prettyprint"><code>## 1 a = tf.constant(1) b = tf.constant(1) x = tf.add(a, b) with tf.Session() as sess: x.eval() ## 2 a = 1 b = 1 x = tf.add(a, b) with tf.Session() as sess: x.eval() ## 3 a = tf.constant(1) b = tf.constant(1) x = a + b with tf.Session() as sess: x.eval() ## 4 a = 1 b = tf.constant(1) x = a + b with tf.Session() as sess: x.eval() </code></pre>

The four examples you gave will all give the same result, and generate the same graph (if you ignore that some of the operation names in the graph are different). TensorFlow will convert many different Python objects into <code>tf.Tensor</code> objects when they are passed as arguments to TensorFlow operators, such as <code>tf.add()</code> here. The <code>+</code> operator is just a simple wrapper on <code>tf.add()</code>, and the overload is used when either the left-hand or right-hand argument is a <code>tf.Tensor</code> (or <code>tf.Variable</code>). Given that you can just pass many Python objects to TensorFlow operators, why would you ever use <code>tf.constant()</code>? There are a few reasons: <ul> <li>If you use the same Python object as the argument to multiple different operations, TensorFlow will convert it to a tensor multiple times, and represent each of those tensors in the graph. Therefore, if your Python object is a large NumPy array, you may run out of memory if you make too many copies of that array's data. In that case, you may wish to convert the array to a <code>tf.Tensor</code> once</li> <li>Creating a <code>tf.constant()</code> explicitly allows you to set its <code>name</code> property, which can be useful for TensorBoard debugging and graph visualization. (Note though that the default TensorFlow ops will attempt to give a meaningful name to each automatically converted tensor, based on the name of the op's argument.)</li> <li>Creating a <code>tf.constant()</code> explicitly allows you to set the exact element type of the tensor. TensorFlow will convert Python <code>int</code> objects to <code>tf.int32</code>, and <code>float</code> objects to <code>tf.float32</code>. If you want <code>tf.int64</code> or <code>tf.float64</code>, you can get this by passing the same value to <code>tf.constant()</code> and passing an explicit <code>dtype</code> argument.</li> <li> The <code>tf.constant()</code> function also offers a useful feature when creating large tensors with a repeated value: <pre class="prettyprint"><code>c = tf.constant(17.0, shape=[1024, 1024], dtype=tf.float32) </code></pre> The tensor <code>c</code> above represents 4 * 1024 * 1024 bytes of data, but TensorFlow will represent it compactly in the graph as a single float <code>17.0</code> plus shape information that indicates how it should be interpreted. If you have many large, filled constants in your graph, it can be more efficient to create them this way. </li> </ul>

TensorFlow simple operations: tensors vs Python variables

Tags:

python

tensorflow

I'm unsure about the practical differences between the 4 variations below (they all evaluate to the same value). My understanding is that if I call tf, it will create an operation on the graph, and otherwise it might. If I don't create the tf.constant() at the beginning, I believe that the constants will be created implicitly when doing the addition; but for tf.add(a,b) vs a + b where a and b are both Tensors (#1 and #3), I can see no difference besides the default naming (former is Add and the latter one is add). Can anyone shed some light on the differences between those, and when should one use each?

## 1
a = tf.constant(1)
b = tf.constant(1)
x = tf.add(a, b)
with tf.Session() as sess:
    x.eval()

## 2
a = 1
b = 1
x = tf.add(a, b)
with tf.Session() as sess:
    x.eval()

## 3
a = tf.constant(1)
b = tf.constant(1)
x = a + b
with tf.Session() as sess:
    x.eval()

## 4
a = 1
b = tf.constant(1)
x = a + b
with tf.Session() as sess:
    x.eval()

335

asked Sep 15 '16 13:09

confused00

2 Answers

The four examples you gave will all give the same result, and generate the same graph (if you ignore that some of the operation names in the graph are different). TensorFlow will convert many different Python objects into tf.Tensor objects when they are passed as arguments to TensorFlow operators, such as tf.add() here. The + operator is just a simple wrapper on tf.add(), and the overload is used when either the left-hand or right-hand argument is a tf.Tensor (or tf.Variable).

Given that you can just pass many Python objects to TensorFlow operators, why would you ever use tf.constant()? There are a few reasons:

If you use the same Python object as the argument to multiple different operations, TensorFlow will convert it to a tensor multiple times, and represent each of those tensors in the graph. Therefore, if your Python object is a large NumPy array, you may run out of memory if you make too many copies of that array's data. In that case, you may wish to convert the array to a tf.Tensor once
Creating a tf.constant() explicitly allows you to set its name property, which can be useful for TensorBoard debugging and graph visualization. (Note though that the default TensorFlow ops will attempt to give a meaningful name to each automatically converted tensor, based on the name of the op's argument.)
Creating a tf.constant() explicitly allows you to set the exact element type of the tensor. TensorFlow will convert Python int objects to tf.int32, and float objects to tf.float32. If you want tf.int64 or tf.float64, you can get this by passing the same value to tf.constant() and passing an explicit dtype argument.
The tf.constant() function also offers a useful feature when creating large tensors with a repeated value:
```
c = tf.constant(17.0, shape=[1024, 1024], dtype=tf.float32)
```
The tensor c above represents 4 * 1024 * 1024 bytes of data, but TensorFlow will represent it compactly in the graph as a single float 17.0 plus shape information that indicates how it should be interpreted. If you have many large, filled constants in your graph, it can be more efficient to create them this way.

answered Oct 04 '22 00:10

mrry

They are all the same.

The python-'+' in a + b is captured by tensorflow and actually does generate the same op as tf.add(a, b) does.

The tf.conctant allows you more specifics, such as defining the shape, type and name of the created tensor. But again tensorflow owns that "a" in your example a = 1 and it is equivalent to tf.constant(1) (treating the constant as an int-value in this case)

answered Oct 04 '22 00:10

Phillip Bock

Related questions
                            
                                Store large data or a service connection per Flask session
                            
                                How can i set the location of minor ticks in matplotlib
                            
                                Equivalent im2double function in OpenCV Python
                            
                                Efficient Matplotlib Redrawing
                            
                                What is the theorical foundation for scikit-learn dummy classifier?
                            
                                Set max number of threads at runtime on numpy/openblas
                            
                                Can I use pymysql.connect() with "with" statement?
                            
                                How to delete a column in pandas dataframe based on a condition?
                            
                                Overriding an inherited property setter
                            
                                pandas to_sql gives unicode decode error
                            
                                requests module and compression
                            
                                How to get value from table's td in BeautifulSoup?
                            
                                python decorator to display passed AND default kwargs
                            
                                How to set "simple" password in Django 1.9
                            
                                Broken pipe error with multiprocessing.Queue
                            
                                Generating single access token with Django OAuth2 Toolkit
                            
                                Python : How to use Multinomial Logistic Regression using SKlearn
                            
                                Tensorflow, best way to save state in RNNs?
                            
                                TensorFlow: Non-repeatable results
                            
                                How to add album art to mp3 file using python 3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With