Keras Sequential model input layer

Tags:

When creating a Sequential model in Keras, I understand you provide the input shape in the first layer. Does this input shape then make an implicit input layer?

For example, the model below explicitly specifies 2 Dense layers, but is this actually a model with 3 layers consisting of one input layer implied by the input shape, one hidden dense layer with 32 neurons, and then one output layer with 10 possible outputs?

model = Sequential([
    Dense(32, input_shape=(784,)),
    Activation('relu'),
    Dense(10),
    Activation('softmax'),
])

776

asked Oct 04 '17 19:10

blackHoleDetector

2 Answers

Well, it actually is an implicit input layer indeed, i.e. your model is an example of a "good old" neural net with three layers - input, hidden, and output. This is more explicitly visible in the Keras Functional API (check the example in the docs), in which your model would be written as:

inputs = Input(shape=(784,))                 # input layer
x = Dense(32, activation='relu')(inputs)     # hidden layer
outputs = Dense(10, activation='softmax')(x) # output layer

model = Model(inputs, outputs)

Actually, this implicit input layer is the reason why you have to include an input_shape argument only in the first (explicit) layer of the model in the Sequential API - in subsequent layers, the input shape is inferred from the output of the previous ones (see the comments in the source code of core.py).

You may also find the documentation on tf.contrib.keras.layers.Input enlightening.

answered Oct 18 '22 18:10

desertnaut

It depends on your perspective :-)

Rewriting your code in line with more recent Keras tutorial examples, you would probably use:

model = Sequential()
model.add(Dense(32, activation='relu', input_dim=784))
model.add(Dense(10, activation='softmax')

...which makes it much more explicit that you only have 2 Keras layers. And this is exactly what you do have (in Keras, at least) because the "input layer" is not really a (Keras) layer at all: it's only a place to store a tensor, so it may as well be a tensor itself.

Each Keras layer is a transformation that outputs a tensor, possibly of a different size/shape to the input. So while there are 3 identifiable tensors here (input, outputs of the two layers), there are only 2 transformations involved corresponding to the 2 Keras layers.

On the other hand, graphically, you might represent this network with 3 (graphical) layers of nodes, and two sets of lines connecting the layers of nodes. Graphically, it's a 3-layer network. But "layers" in this graphical notation are bunches of circles that sit on a page doing nothing, whereas a layers in Keras transform tensors and do actual work for you. Personally, I would get used to the Keras perspective :-)

Note finally that for fun and/or simplicity, I substituted input_dim=784 for input_shape=(784,) to avoid the syntax that Python uses to both confuse newcomers and create a 1-D tuple: (<value>,).

answered Oct 18 '22 19:10

omatai

Related questions
                            
                                Scrapy, only follow internal URLS but extract all links found
                            
                                Using mkl_set_num_threads with numpy
                            
                                How to understand lazy function in Django utils functional module
                            
                                Add a nav bar to all templates
                            
                                Increase IPython history length
                            
                                Python re.sub multiline on string
                            
                                In Tensorflow, what is the difference between sampled_softmax_loss and softmax_cross_entropy_with_logits
                            
                                Why can I import successfully without __init__.py?
                            
                                What is the equivalent of Serial.available() in pyserial?
                            
                                Fuzzy string matching in Python
                            
                                How do I set TensorFlow RNN state when state_is_tuple=True?
                            
                                how to set the primary key when writing a pandas dataframe to a sqlite database table using df.to_sql
                            
                                pd.read_html() imports a list rather than a dataframe
                            
                                Splitting a row in a PySpark Dataframe into multiple rows
                            
                                OpenCV Python: Normalize image
                            
                                Please explain in detail 2D Histogram in Python
                            
                                Pandas: Selecting DataFrame rows between two dates (Datetime Index)
                            
                                Multiple aiohttp Application()'s running in the same process?
                            
                                Removing then Inserting a New Middle Layer in a Keras Model
                            
                                Python tuple vs generator

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras Sequential model input layer

Tags:

python

machine-learning

neural-network

deep-learning

keras

blackHoleDetector

People also ask

2 Answers

desertnaut

omatai

Recent Activity

Donate For Us