I have an image that is 478 x 717 x 3 = 1028178 pixels, with a rank of 1. I verified it by calling tf.shape and tf.rank. When I call image.set_shape([478, 717, 3]), it throws the following error. <pre class="prettyprint"><code>"Shapes %s and %s must have the same rank" % (self, other)) ValueError: Shapes (?,) and (478, 717, 3) must have the same rank </code></pre> I tested again by first casting to 1028178, but the error still exists. <pre class="prettyprint"><code>ValueError: Shapes (1028178,) and (478, 717, 3) must have the same rank </code></pre> Well, that does make sense because one is of rank 1 and the other is of rank 3. However, why is it necessary to throw an error, as the total number of pixels still match. I could of course use tf.reshape and it works, but I think that's not optimal. As stated on the TensorFlow FAQ <blockquote> What is the difference between x.set_shape() and x = tf.reshape(x)? The tf.Tensor.set_shape() method updates the static shape of a Tensor object, and it is typically used to provide additional shape information when this cannot be inferred directly. It does not change the dynamic shape of the tensor. The tf.reshape() operation creates a new tensor with a different dynamic shape. </blockquote> Creating a new tensor involves memory allocation and that could potentially be more costly when more training examples are involved. Is this by design, or am I missing something here?

As far as I know (and I wrote that code), there isn't a bug in <code>Tensor.set_shape()</code>. I think the misunderstanding stems from the confusing name of that method. To elaborate on the FAQ entry you quoted, <code>Tensor.set_shape()</code> is a pure-Python function that improves the shape information for a given <code>tf.Tensor</code> object. By "improves", I mean "makes more specific". Therefore, when you have a <code>Tensor</code> object <code>t</code> with shape <code>(?,)</code>, that is a one-dimensional tensor of unknown length. You can call <code>t.set_shape((1028178,))</code>, and then <code>t</code> will have shape <code>(1028178,)</code> when you call <code>t.get_shape()</code>. This doesn't affect the underlying storage, or indeed anything on the backend: it merely means that subsequent shape inference using <code>t</code> can rely on the assertion that it is a vector of length 1028178. If <code>t</code> has shape <code>(?,)</code>, a call to <code>t.set_shape((478, 717, 3))</code> will fail, because TensorFlow already knows that <code>t</code> is a vector, so it cannot have shape <code>(478, 717, 3)</code>. If you want to make a new Tensor with that shape from the contents of <code>t</code>, you can use <code>reshaped_t = tf.reshape(t, (478, 717, 3))</code>. This creates a new <code>tf.Tensor</code> object in Python; the actual implementation of <code>tf.reshape()</code> does this using a shallow copy of the tensor buffer, so it is inexpensive in practice. One analogy is that <code>Tensor.set_shape()</code> is like a run-time cast in an object-oriented language like Java. For example, if you have a pointer to an <code>Object</code> but know that, in fact, it is a <code>String</code>, you might do the cast <code>(String) obj</code> in order to pass <code>obj</code> to a method that expects a <code>String</code> argument. However, if you have a <code>String</code> <code>s</code> and try to cast it to a <code>java.util.Vector</code>, the compiler will give you an error, because these two types are unrelated.

Clarification on tf.Tensor.set_shape()

Tags:

tensorflow

I have an image that is 478 x 717 x 3 = 1028178 pixels, with a rank of 1. I verified it by calling tf.shape and tf.rank.

When I call image.set_shape([478, 717, 3]), it throws the following error.

"Shapes %s and %s must have the same rank" % (self, other))  ValueError: Shapes (?,) and (478, 717, 3) must have the same rank

I tested again by first casting to 1028178, but the error still exists.

ValueError: Shapes (1028178,) and (478, 717, 3) must have the same rank

Well, that does make sense because one is of rank 1 and the other is of rank 3. However, why is it necessary to throw an error, as the total number of pixels still match.

I could of course use tf.reshape and it works, but I think that's not optimal.

As stated on the TensorFlow FAQ

What is the difference between x.set_shape() and x = tf.reshape(x)?

The tf.Tensor.set_shape() method updates the static shape of a Tensor object, and it is typically used to provide additional shape information when this cannot be inferred directly. It does not change the dynamic shape of the tensor.

The tf.reshape() operation creates a new tensor with a different dynamic shape.

Creating a new tensor involves memory allocation and that could potentially be more costly when more training examples are involved. Is this by design, or am I missing something here?

250

asked Feb 17 '16 08:02

jkschin

1 Answers

As far as I know (and I wrote that code), there isn't a bug in Tensor.set_shape(). I think the misunderstanding stems from the confusing name of that method.

To elaborate on the FAQ entry you quoted, Tensor.set_shape() is a pure-Python function that improves the shape information for a given tf.Tensor object. By "improves", I mean "makes more specific".

Therefore, when you have a Tensor object t with shape (?,), that is a one-dimensional tensor of unknown length. You can call t.set_shape((1028178,)), and then t will have shape (1028178,) when you call t.get_shape(). This doesn't affect the underlying storage, or indeed anything on the backend: it merely means that subsequent shape inference using t can rely on the assertion that it is a vector of length 1028178.

If t has shape (?,), a call to t.set_shape((478, 717, 3)) will fail, because TensorFlow already knows that t is a vector, so it cannot have shape (478, 717, 3). If you want to make a new Tensor with that shape from the contents of t, you can use reshaped_t = tf.reshape(t, (478, 717, 3)). This creates a new tf.Tensor object in Python; the actual implementation of tf.reshape() does this using a shallow copy of the tensor buffer, so it is inexpensive in practice.

One analogy is that Tensor.set_shape() is like a run-time cast in an object-oriented language like Java. For example, if you have a pointer to an Object but know that, in fact, it is a String, you might do the cast (String) obj in order to pass obj to a method that expects a String argument. However, if you have a String s and try to cast it to a java.util.Vector, the compiler will give you an error, because these two types are unrelated.

126

answered Sep 22 '22 06:09

mrry

Related questions
                            
                                The print of string constant is always attached with 'b' inTensorFlow [duplicate]
                            
                                Tensorflow Documentation
                            
                                Printing the loss during TensorFlow training
                            
                                How to train a model in nodejs (tensorflow.js)?
                            
                                Use shared GPU memory with TensorFlow?
                            
                                In TensorFlow, how can I get nonzero values and their indices from a tensor with python?
                            
                                How to convert keras(h5) file to a tflite file?
                            
                                Installing tensorflow with anaconda in windows
                            
                                What do the options in ConfigProto like allow_soft_placement and log_device_placement mean?
                            
                                Tensorflow Different ways to Export and Run graph in C++
                            
                                ValueError: Layer sequential_20 expects 1 inputs, but it received 2 input tensors
                            
                                Tensorflow: When use tf.expand_dims?
                            
                                Custom TensorFlow Keras optimizer
                            
                                What's the difference between Tensor and Variable in Tensorflow
                            
                                Are tf.layers.dense() and tf.contrib.layers.fully_connected() interchangeable?
                            
                                How do you get the name of the tensorflow output nodes in a Keras Model?
                            
                                Should TensorFlow users prefer SavedModel over Checkpoint or GraphDef?
                            
                                Multivariate LSTM with missing values
                            
                                Is there an easy way to get something like Keras model.summary in Tensorflow?
                            
                                Can't save custom subclassed model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With