A Keras model can used as a Tensorflow function on a Tensor, through the functional API, as described here. So we can do: <pre class="prettyprint"><code>from keras.layers import InputLayer a = tf.placeholder(dtype=tf.float32, shape=(None, 784)) model = Sequential() model.add(InputLayer(input_tensor=a, input_shape=(None, 784))) model.add(Dense(32, activation='relu')) model.add(Dense(10, activation='softmax')) output = model.output </code></pre> Which is a tensor: <pre class="prettyprint"><code><tf.Tensor 'dense_24/Softmax:0' shape=(?, 10) dtype=float32> </code></pre> But, this also works without any <code>InputLayer</code>: <pre class="prettyprint"><code>a = tf.placeholder(dtype=tf.float32, shape=(None, 784)) model = Sequential() model.add(Dense(32, activation='relu', input_shape=(784,))) model.add(Dense(10, activation='softmax')) output = model(a) </code></pre> works, and <code>output</code> has the same shape as before: <pre class="prettyprint"><code><tf.Tensor 'sequential_9/dense_22/Softmax:0' shape=(?, 10) dtype=float32> </code></pre> I assume the first form permits: <ul> <li>to explicitely attach the <code>inputs</code> and <code>outputs</code> as attributes of the model (of the same names), so we can reuse them elsewhere. For example with other TF ops.</li> <li>to transform the tensors given as inputs into Keras inputs, with additional metadata (such as <code>_keras_history</code> as stated in the source code).</li> </ul> But this is not something we cannot do with the second form, so, is there a special usage of the <code>InputLayer</code> (and <code>Input</code> a fortiori) (except for multiple inputs)? Moreover, the <code>InputLayer</code> is tricky because it's using <code>input_shape</code> differently from other keras layers: we specify the batch size (<code>None</code> here), which is not usually the case...

It would seem that <code>InputLayer</code> has some uses: <ul> <li> First, it allows you to give pure tensorflow tensors as is, without specifying their shape. E.g. you could have written <pre class="prettyprint"><code> model.add(InputLayer(input_tensor=a)) </code></pre> This is nice for several obvious reasons, among others less duplication. </li> <li> Second, they allow you to write non-sequential networks with a single input, e.g. <pre class="prettyprint"><code> input / \ / \ / \ conv1 conv2 | | </code></pre> Without <code>InputLayer</code> you would need to explicitly feed <code>conv1</code> and <code>conv2</code> the same tensor, or create an arbitrary identity layer on top of the model. Neither is quite pleasing. </li> <li> Finally, they remove the arbitrary distinction between "layers that are also inputs" and "normal layers". If you use <code>InputLayer</code> you can write code where there is a clear distinction between what layer is the input and what layer does something. This improves code readability and makes refactoring much easier. For example, replacing the first layer becomes just as easy as replacing any other layer, you don't need to think about <code>input_shape</code>. </li> </ul>

What is the advantage of using an InputLayer (or an Input) in a Keras model with Tensorflow tensors?

Tags:

A Keras model can used as a Tensorflow function on a Tensor, through the functional API, as described here.

So we can do:

from keras.layers import InputLayer  a = tf.placeholder(dtype=tf.float32, shape=(None, 784))  model = Sequential() model.add(InputLayer(input_tensor=a, input_shape=(None, 784))) model.add(Dense(32, activation='relu')) model.add(Dense(10, activation='softmax'))  output = model.output

Which is a tensor:

<tf.Tensor 'dense_24/Softmax:0' shape=(?, 10) dtype=float32>

But, this also works without any InputLayer:

a = tf.placeholder(dtype=tf.float32, shape=(None, 784))  model = Sequential() model.add(Dense(32, activation='relu', input_shape=(784,))) model.add(Dense(10, activation='softmax'))  output = model(a)

works, and output has the same shape as before:

<tf.Tensor 'sequential_9/dense_22/Softmax:0' shape=(?, 10) dtype=float32>

I assume the first form permits:

to explicitely attach the inputs and outputs as attributes of the model (of the same names), so we can reuse them elsewhere. For example with other TF ops.
to transform the tensors given as inputs into Keras inputs, with additional metadata (such as _keras_history as stated in the source code).

But this is not something we cannot do with the second form, so, is there a special usage of the InputLayer (and Input a fortiori) (except for multiple inputs)?
Moreover, the InputLayer is tricky because it's using input_shape differently from other keras layers: we specify the batch size (None here), which is not usually the case...

845

asked Jul 20 '17 14:07

Phylliade

1 Answers

It would seem that InputLayer has some uses:

First, it allows you to give pure tensorflow tensors as is, without specifying their shape. E.g. you could have written
```
  model.add(InputLayer(input_tensor=a)) 
```
This is nice for several obvious reasons, among others less duplication.
Second, they allow you to write non-sequential networks with a single input, e.g.
```
      input        / \       /   \      /     \   conv1   conv2     |       | 
```
Without InputLayer you would need to explicitly feed conv1 and conv2 the same tensor, or create an arbitrary identity layer on top of the model. Neither is quite pleasing.
Finally, they remove the arbitrary distinction between "layers that are also inputs" and "normal layers". If you use InputLayer you can write code where there is a clear distinction between what layer is the input and what layer does something. This improves code readability and makes refactoring much easier. For example, replacing the first layer becomes just as easy as replacing any other layer, you don't need to think about input_shape.

167

answered Oct 03 '22 19:10

Jonas Adler

Related questions
                            
                                Why is it faster to perform float by float matrix multiplication compared to int by int?
                            
                                Can Firebase Cloud Storage rules validate against Firestore data?
                            
                                Is there an XML schema extension for Visual Studio Code?
                            
                                Can't bind to 'icon' since it isn't a known property of 'fa-icon'
                            
                                How to uninstall Elm package?
                            
                                Angular 6 building a library with assets
                            
                                Why examples don't work? (a struggle with imports)
                            
                                What is difference between release notes and changelog?
                            
                                convert io.StringIO to io.BytesIO
                            
                                Variable scope and name resolution in Python
                            
                                Replace wildcards in a binary string avoiding three identical consecutive letters
                            
                                OpenID as a Single Sign On option? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With