How to implement a deep bidirectional LSTM with Keras?

Tags:

I am trying to implement a LSTM based speech recognizer. So far I could set up bidirectional LSTM (i think it is working as a bidirectional LSTM) by following the example in Merge layer. Now I want to try it with another bidirectional LSTM layer, which make it a deep bidirectional LSTM. But I am unable to figure out how to connect the output of the previously merged two layers into a second set of LSTM layers. I don't know whether it is possible with Keras. Hope someone can help me with this.

Code for my single layer bidirectional LSTM is as follows

left = Sequential()
left.add(LSTM(output_dim=hidden_units, init='uniform', inner_init='uniform',
               forget_bias_init='one', return_sequences=True, activation='tanh',
               inner_activation='sigmoid', input_shape=(99, 13)))
right = Sequential()
right.add(LSTM(output_dim=hidden_units, init='uniform', inner_init='uniform',
               forget_bias_init='one', return_sequences=True, activation='tanh',
               inner_activation='sigmoid', input_shape=(99, 13), go_backwards=True))

model = Sequential()
model.add(Merge([left, right], mode='sum'))

model.add(TimeDistributedDense(nb_classes))
model.add(Activation('softmax'))

sgd = SGD(lr=0.1, decay=1e-5, momentum=0.9, nesterov=True)
model.compile(loss='categorical_crossentropy', optimizer=sgd)
print("Train...")
model.fit([X_train, X_train], Y_train, batch_size=1, nb_epoch=nb_epoches, validation_data=([X_test, X_test], Y_test), verbose=1, show_accuracy=True)

Dimensions of my x and y values are as follows.

(100, 'train sequences')
(20, 'test sequences')
('X_train shape:', (100, 99, 13))
('X_test shape:', (20, 99, 13))
('y_train shape:', (100, 99, 11))
('y_test shape:', (20, 99, 11))

323

asked Feb 03 '16 05:02

udani

3 Answers

Well, I got the answer for the issue posted on the Keras issues. Hope this would be useful to anyone who look for this kind of approach. How to implement deep bidirectional -LSTM

183

answered Sep 23 '22 14:09

udani

model.add(Bidirectional(LSTM(64)))

Keras example

answered Sep 24 '22 14:09

rosefun

You can use keras.layers.wrappers.Bidirectional. Official manual can be referenced here, https://keras.io/layers/wrappers/#bidirectional

answered Sep 24 '22 14:09

Tom

Related questions
                            
                                Reproducibility and performance in PyTorch
                            
                                Building custom Caffe layer in python
                            
                                How to evolve weights of a neural network in Neuroevolution?
                            
                                Tensorflow Sequence to sequence model using the seq2seq API ( ver 1.1 and above)
                            
                                Inputs to eager execution function cannot be Keras symbolic tensors
                            
                                Save model every 10 epochs tensorflow.keras v2
                            
                                What is "unk" in the pretrained GloVe vector files (e.g. glove.6B.50d.txt)?
                            
                                Deep learning for image classification [closed]
                            
                                Difference between 1 LSTM with num_layers = 2 and 2 LSTMs in pytorch
                            
                                Implementing dropout from scratch
                            
                                Why doesn't my Deep Q Network master a simple Gridworld (Tensorflow)? (How to evaluate a Deep-Q-Net)
                            
                                How to use Batch Normalization correctly in tensorflow?
                            
                                Understanding Gradient Policy Deriving
                            
                                How to select batch size automatically to fit GPU?
                            
                                Does bias in the convolutional layer really make a difference to the test accuracy?
                            
                                How to understand masked multi-head attention in transformer
                            
                                caffe with multi-label images
                            
                                Understanding stateful LSTM [closed]
                            
                                How to decode encoded data from deep autoencoder in Keras (unclarity in tutorial)
                            
                                Keras for implement convolution neural network

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to implement a deep bidirectional LSTM with Keras?

Tags:

deep-learning

keras

lstm

udani

People also ask

3 Answers

udani

rosefun

Tom

Recent Activity

Donate For Us