Tensorflow compute_output_shape() Not Working For Custom Layer

Tags:

tensorflow

I have created a custom layer (called GraphGather) in Keras, yet the output tensor prints as :

Tensor("graph_gather/Tanh:0", shape=(?, ?), dtype=float32)

For some reason the shape is being returned as (?,?), which is causing the next dense layer to raise the following error:

ValueError: The last dimension of the inputs to Dense should be defined. Found None.

The GraphGather layer code is as follows:

class GraphGather(tf.keras.layers.Layer):

  def __init__(self, batch_size, num_mols_in_batch, activation_fn=None, **kwargs):
    self.batch_size = batch_size
    self.num_mols_in_batch = num_mols_in_batch
    self.activation_fn = activation_fn
    super(GraphGather, self).__init__(**kwargs)

  def build(self, input_shape):
    super(GraphGather, self).build(input_shape)

 def call(self, x, **kwargs):
    # some operations (most of def call omitted)
    out_tensor = result_of_operations() # this line is pseudo code
    if self.activation_fn is not None:
      out_tensor = self.activation_fn(out_tensor)
    out_tensor = out_tensor
    return out_tensor

  def compute_output_shape(self, input_shape):
    return (self.num_mols_in_batch, 2 * input_shape[0][-1])}

I have also tried hardcoding compute_output_shape to be:python def compute_output_shape(self, input_shape): return (64, 150) ``` Yet the output tensor when printed is still

Tensor("graph_gather/Tanh:0", shape=(?, ?), dtype=float32)

which causes the ValueError written above.

System information

Have written custom code
**OS Platform and Distribution*: Linux Ubuntu 16.04
TensorFlow version (use command below): 1.5.0
Python version: 3.5.5

711

asked Jun 25 '18 17:06

Chase Armer

1 Answers

I had the same problem. My workaround was to add the following lines to the call method:

input_shape = tf.shape(x)

and then:

return tf.reshape(out_tensor, self.compute_output_shape(input_shape))

I haven't run into any problems with it yet.

156

answered Nov 15 '22 04:11

Johnny

Related questions
                            
                                Tensorflow Dataset extremely slow compared to queues
                            
                                ValueError: Shape must be rank 1 but is rank 0 for 'ROIAlign/Crop' (op: 'CropAndResize') with input shapes: [2,360,475,3], [1,4], [], [2]
                            
                                Keyboard interrupt tensorflow run and save at that point
                            
                                Where is `*` documented in tensorflow?
                            
                                Tensorflow - Can't convert Operation to Tensor
                            
                                Tensorflow Object Detection API has slow inference time with tensorflow serving
                            
                                Why does this neural network learn nothing?
                            
                                Why does tf.matmul(a,b, transpose_b=True) work, but not tf.matmul(a, tf.transpose(b))?
                            
                                Keras - model.predict return classes and not probabilities
                            
                                If you use plus sign instead of tf.add, will tensorflow still calculate gradients correctly?
                            
                                How to use tensorflow seq2seq without embeddings?
                            
                                How do I visualize or plot a multidimensional tensor?
                            
                                tensorflow code TypeError: unsupported operand type(s) for *: 'int' and 'Flag'
                            
                                How to create a tensorflow dataset from a DataFrame with vector columns?
                            
                                MirroredStrategy without NCCL
                            
                                Keras: How to slice tensor using information from another tensor?
                            
                                Parallel threads with TensorFlow Dataset API and flat_map
                            
                                How does data normalization work in keras during prediction?
                            
                                Difference between tf.clip_by_value and tf.clip_by_global_norm for RNN's and how to decide max value to clip on?
                            
                                ImportError: Could not find 'cudart64_100.dll

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With