I am trying to implement a denoising autoencoder with an LSTM layer in between. The architecture goes following. <pre class="prettyprint"><code>FC layer -> FC layer -> LSTM cell -> FC layer -> FC layer. </code></pre> I am unable to understand how my input dimension should be to implement this architecture? I tried the following code <pre class="prettyprint"><code>batch_size = 1 model = Sequential() model.add(Dense(5, input_shape=(1,))) model.add(Dense(10)) model.add(LSTM(32)) model.add(Dropout(0.3)) model.add(Dense(5)) model.add(Dense(1)) model.compile(loss='mean_squared_error', optimizer='adam') model.fit(trainX, trainY, nb_epoch=100, batch_size=batch_size, verbose=2) </code></pre> My trainX is [650,20,1] vector. It is a time series data in with only one feature. I am getting following error <pre class="prettyprint"><code>ValueError Traceback (most recent call last) <ipython-input-20-1248a33f6518> in <module>() 3 model.add(Dense(5, input_shape=(1,))) 4 model.add(Dense(10)) ----> 5 model.add(LSTM(32)) 6 model.add(Dropout(0.3)) 7 model.add(Dense(5)) /usr/local/lib/python2.7/dist-packages/keras/models.pyc in add(self, layer) 330 output_shapes=[self.outputs[0]._keras_shape]) 331 else: --> 332 output_tensor = layer(self.outputs[0]) 333 if isinstance(output_tensor, list): 334 raise TypeError('All layers in a Sequential model ' /usr/local/lib/python2.7/dist-packages/keras/engine/topology.pyc in __call__(self, x, mask) 527 # Raise exceptions in case the input is not compatible 528 # with the input_spec specified in the layer constructor. --> 529 self.assert_input_compatibility(x) 530 531 # Collect input shapes to build layer. /usr/local/lib/python2.7/dist-packages/keras/engine/topology.pyc in assert_input_compatibility(self, input) 467 self.name + ': expected ndim=' + 468 str(spec.ndim) + ', found ndim=' + --> 469 str(K.ndim(x))) 470 if spec.dtype is not None: 471 if K.dtype(x) != spec.dtype: ValueError: Input 0 is incompatible with layer lstm_10: expected ndim=3, found ndim=2 </code></pre>

The dense layer can take sequences as input and it will apply the same dense layer on every vector (last dimension). Example : You have a 2D tensor input that represents a sequence <code>(timesteps, dim_features)</code>, if you apply a dense layer to it with new_dim outputs, the tensor that you will have after the layer will be a new sequence <code>(timesteps, new_dim)</code> If you have a 3D tensor <code>(n_lines, n_words, embedding_dim)</code> that can be a document, with <code>n_lines</code> lines, <code>n_words</code> words per lines and <code>embedding_dim</code> dimensions for each word, applying a dense layer to it with new_dim outputs will get you a new doc tensor (3D) with shape <code>(n_lines, n_words, new_dim)</code> You can see here the dimensions input and output that you can feed and get with the Dense() layer.

Add dense layer before LSTM layer in keras or Tensorflow?

Tags:

neural-network

deep-learning

keras

lstm

keras-layer

I am trying to implement a denoising autoencoder with an LSTM layer in between. The architecture goes following.

FC layer -> FC layer -> LSTM cell -> FC layer -> FC layer.

I am unable to understand how my input dimension should be to implement this architecture?

I tried the following code

batch_size = 1
model = Sequential()
model.add(Dense(5, input_shape=(1,)))
model.add(Dense(10))
model.add(LSTM(32))
model.add(Dropout(0.3))
model.add(Dense(5))
model.add(Dense(1))
model.compile(loss='mean_squared_error', optimizer='adam')
model.fit(trainX, trainY, nb_epoch=100, batch_size=batch_size, verbose=2)

My trainX is [650,20,1] vector. It is a time series data in with only one feature.

I am getting following error

ValueError                                Traceback (most recent call last)
<ipython-input-20-1248a33f6518> in <module>()
      3 model.add(Dense(5, input_shape=(1,)))
      4 model.add(Dense(10))
----> 5 model.add(LSTM(32))
      6 model.add(Dropout(0.3))
      7 model.add(Dense(5))

/usr/local/lib/python2.7/dist-packages/keras/models.pyc in add(self, layer)
    330                  output_shapes=[self.outputs[0]._keras_shape])
    331         else:
--> 332             output_tensor = layer(self.outputs[0])
    333             if isinstance(output_tensor, list):
    334                 raise TypeError('All layers in a Sequential model '

/usr/local/lib/python2.7/dist-packages/keras/engine/topology.pyc in __call__(self, x, mask)
    527             # Raise exceptions in case the input is not compatible
    528             # with the input_spec specified in the layer constructor.
--> 529             self.assert_input_compatibility(x)
    530 
    531             # Collect input shapes to build layer.

/usr/local/lib/python2.7/dist-packages/keras/engine/topology.pyc in assert_input_compatibility(self, input)
    467                                          self.name + ': expected ndim=' +
    468                                          str(spec.ndim) + ', found ndim=' +
--> 469                                          str(K.ndim(x)))
    470             if spec.dtype is not None:
    471                 if K.dtype(x) != spec.dtype:

ValueError: Input 0 is incompatible with layer lstm_10: expected ndim=3, found ndim=2

984

asked Mar 10 '17 09:03

Nilay Thakor

1 Answers

The dense layer can take sequences as input and it will apply the same dense layer on every vector (last dimension). Example :

You have a 2D tensor input that represents a sequence (timesteps, dim_features), if you apply a dense layer to it with new_dim outputs, the tensor that you will have after the layer will be a new sequence (timesteps, new_dim)

If you have a 3D tensor (n_lines, n_words, embedding_dim) that can be a document, with n_lines lines, n_words words per lines and embedding_dim dimensions for each word, applying a dense layer to it with new_dim outputs will get you a new doc tensor (3D) with shape (n_lines, n_words, new_dim)

You can see here the dimensions input and output that you can feed and get with the Dense() layer.

121

answered Sep 29 '22 07:09

Nassim Ben

Related questions
                            
                                Keras Training warm_start
                            
                                Why are Embeddings in PyTorch implemented as Sparse Layers?
                            
                                Keras KerasClassifier gridsearch TypeError: can't pickle _thread.lock objects
                            
                                Is fit_generator in Keras supposed to reset the generator after each epoch?
                            
                                Does normalizing images by dividing by 255 leak information between train and test set?
                            
                                Neural network is not giving the expected output after training in Python
                            
                                Why in preprocessing image data, we need to do zero-centered data?
                            
                                Why plot_model in Keras does not plot the model correctly?
                            
                                model.predict_classes is deprecated - What to use instead?
                            
                                Early stopping in Bert Trainer instances
                            
                                Looking for interesting topic from neural networks area [closed]
                            
                                How to use neural networks to solve "soft" solutions?
                            
                                Neural network library for Python? [closed]
                            
                                Delta rule vs. gradient descent?
                            
                                Neural Network Diverging instead of converging
                            
                                (Python) Gaussian Bernoulli RBM on computing P(v|h)
                            
                                What makes GPUs so efficient in neural network computations?
                            
                                Neural Network composed of multiple activation functions
                            
                                LSTM/RNN many to one
                            
                                How to use keras for binary classification?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With