Why is it that `input_shape` does not include the batch dimension when passed as an argument to the `Dense` layer?

Tags:

In Keras, why is it that input_shape does not include the batch dimension when passed as an argument to layers like Dense but DOES include the batch dimension when input_shape is passed to the build method of a model?

import tensorflow as tf
from tensorflow.keras.layers import Dense

if __name__ == "__main__":
    model1 = tf.keras.Sequential([Dense(1, input_shape=[10])])
    model1.summary()

    model2 = tf.keras.Sequential([Dense(1)])
    model2.build(input_shape=[None, 10])  # why [None, 10] and not [10]?
    model2.summary()

Is this a conscious choice of API design? If it is, why?

708

asked Nov 04 '20 13:11

Jensun Ravichandran

1 Answers

You can specify the input shape of your model in several different ways. For example by providing one of the following arguments to the first layer of your model:

batch_input_shape: A tuple where the first dimension is the batch size.
input_shape: A tuple that does not include the batch size, e.g., the batch size is assumed to be None or batch_size, if specified.
input_dim: A scalar indicating the dimension of the input.

In all these cases, Keras is internally storing an attribute _batch_input_size to build the model.

Regarding the build method, my guess is that this is indeed a conscious choice - information about the batch size might be useful to build the model in some (perhaps unthought-of) situations. Therefore, a framework that includes the batch dimension as input to build is more generic and complete than a framework that doesn't. Nonetheless, I agree with you that naming the argument batch_input_shape instead of input_shape would make everything more consistent.

It is also worth mentioning that users rarely need to call the build method by themselves. This happens internally when it is needed. Nowadays, it is even possible to ignore the input_shape argument when creating the model (although methods like summary will then not work until the model is built). In this case, Keras is able to infer the input shape from the argument x of fit.

140

answered Oct 17 '22 21:10

rvinas

Related questions
                            
                                redis python psubscribe to event with callback, without calling .listen()
                            
                                Send message from Viber bot to subscribed user
                            
                                __get__ of descriptor __class__ of object class doesn't return as expected
                            
                                It's ok to mix Conda install and Pip install?
                            
                                Python sets versus arrays
                            
                                Hot to fix Tensorflow model not running in Eager mode with .fit()?
                            
                                TF 2.0: Where can I find the upgrade of tf.contrib.training?
                            
                                how to fix "cannot import name 'imresize' error while this function importing from scipy.misc?
                            
                                Tensorflow: create tf.NodeDef() and set attributes
                            
                                Caveats while checking dtype in pandas DataFrame
                            
                                Not able to get real time error in Visual code during python development
                            
                                Why I am getting DatasetV1Adapter return type instead of TensorSliceDataset for tf.data.Dataset.from_tensor_slices(X)
                            
                                Unable to read keystore file from pyspark
                            
                                Correct way to use custom weight maps in unet architecture
                            
                                How can I fix this pytorch error on Windows? (ModuleNotFoundError: No module named 'torch')
                            
                                How to setup a grammar that can handle ambiguity
                            
                                Retrieving text body of answers and comments using Stackexchange API
                            
                                Property Setter for Subclass of Pandas DataFrame
                            
                                Unable to clear pexpect buffer in python3.X
                            
                                Pass function and arguments from node to python, using child_process

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is it that `input_shape` does not include the batch dimension when passed as an argument to the `Dense` layer?

Tags:

python

tensorflow

keras

Jensun Ravichandran

People also ask

1 Answers

rvinas

Recent Activity

Donate For Us