How to stack multiple lstm in keras?

Tags:

I am using deep learning library keras and trying to stack multiple LSTM with no luck. Below is my code

model = Sequential() model.add(LSTM(100,input_shape =(time_steps,vector_size))) model.add(LSTM(100))

The above code returns error in the third line Exception: Input 0 is incompatible with layer lstm_28: expected ndim=3, found ndim=2

The input X is a tensor of shape (100,250,50). I am running keras on tensorflow backend

744

asked Oct 30 '16 17:10

Tamim Addari

1 Answers

You need to add return_sequences=True to the first layer so that its output tensor has ndim=3 (i.e. batch size, timesteps, hidden state).

Please see the following example:

# expected input data shape: (batch_size, timesteps, data_dim) model = Sequential() model.add(LSTM(32, return_sequences=True,                input_shape=(timesteps, data_dim)))  # returns a sequence of vectors of dimension 32 model.add(LSTM(32, return_sequences=True))  # returns a sequence of vectors of dimension 32 model.add(LSTM(32))  # return a single vector of dimension 32 model.add(Dense(10, activation='softmax'))

From: https://keras.io/getting-started/sequential-model-guide/ (search for "stacked lstm")

answered Sep 23 '22 13:09

Daniel De Freitas

Related questions
                            
                                AttributeError: 'Tensor' object has no attribute 'numpy'
                            
                                In TensorFlow, what is tf.identity used for?
                            
                                Tensorflow One Hot Encoder?
                            
                                Get the value of some weights in a model trained by TensorFlow
                            
                                How to export Keras .h5 to tensorflow .pb?
                            
                                Installing Python3.6 alongside Python3.7 on Mac
                            
                                What is the difference between variable_scope and name_scope? [duplicate]
                            
                                What's the difference between tf.Session() and tf.InteractiveSession()?
                            
                                How do I disable TensorFlow's eager execution?
                            
                                FailedPreconditionError: Attempting to use uninitialized in Tensorflow
                            
                                Gradient Descent vs Adagrad vs Momentum in TensorFlow
                            
                                TensorFlow: InternalError: Blas SGEMM launch failed
                            
                                What is the difference between Dataset.from_tensors and Dataset.from_tensor_slices?
                            
                                What is the purpose of the Tensorflow Gradient Tape?
                            
                                Can I measure the execution time of individual operations with TensorFlow?
                            
                                What is the default kernel initializer in tf.layers.conv2d and tf.layers.dense?
                            
                                tensorflow:AttributeError: 'module' object has no attribute 'mul'
                            
                                Meaning of inter_op_parallelism_threads and intra_op_parallelism_threads
                            
                                How could I use batch normalization in TensorFlow?
                            
                                How do display different runs in TensorBoard?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to stack multiple lstm in keras?

Tags:

tensorflow

deep-learning

keras

lstm

keras-layer

Tamim Addari

People also ask

1 Answers

Daniel De Freitas

Recent Activity

Donate For Us