Bi-directional LSTM for variable-length sequence in Tensorflow

Tags:

I want to train a bi-directional LSTM in tensorflow to perform a sequence classification problem (sentiment classification).

Because sequences are of variable lengths, batches are normally padded with vectors of zero. Normally, I use the sequence_length parameter in the uni-directional RNN to avoid training on the padding vectors.

How can this be managed with bi-directional LSTM. Does the "sequence_length" parameter work automatically starts from an advanced position in the sequence for the backward direction?

Thank you

810

asked Mar 21 '17 19:03

Ramy Baly

1 Answers

bidirectional_dynamic_rnn also has a sequence_length parameter that takes care of sequences of variable lengths.

https://www.tensorflow.org/api_docs/python/tf/nn/bidirectional_dynamic_rnn (mirror):

sequence_length: An int32/int64 vector, size [batch_size], containing the actual lengths for each of the sequences.

You can see an example here: https://github.com/Franck-Dernoncourt/NeuroNER/blob/master/src/entity_lstm.py

answered Sep 29 '22 02:09

Franck Dernoncourt

Related questions
                            
                                Implementing a batch dependent loss in Keras
                            
                                How to configure tensorflow legacy/train.py model.cpk output interval
                            
                                Tensorflow-Deeplearning - Correlation between input and output
                            
                                How to implement Beholder (Tensorboard plugin) for Keras?
                            
                                Keras predict loop memory leak using tf.data.Dataset but not with a numpy array
                            
                                How a robust background removal is implemented?
                            
                                How to convert tf.contrib to Tensorflow 2.0
                            
                                Why does Tensorflow 2 give a warning (but still work anyway) when the input is a pandas dataframe?
                            
                                Getting Model Explanations with Tensorflow Serving and SavedModel Estimators
                            
                                Inputting an obscure file type into tensorflow
                            
                                How to store result of an operation (like TOPK) per epoch in keras
                            
                                error when using Mirrored strategy in Tensorflow
                            
                                Keras custom loss function to ignore false negatives of a specific class during semantic segmentation?
                            
                                Layer names for pretrained inception v3 model (tensorflow) [duplicate]
                            
                                Embedding lookup table doesn't mask padding value
                            
                                How to detect which variable is 'nonetype' in tensorflow
                            
                                How to use textsum?
                            
                                Computer restarts with large mini batches in TensorFlow
                            
                                Difference in matrix multiplication tensorflow vs numpy
                            
                                Training TensorFlow model with summary operations is much slower than without summary operations

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Bi-directional LSTM for variable-length sequence in Tensorflow

Tags:

tensorflow

lstm

bidirectional

Ramy Baly

People also ask

1 Answers

Franck Dernoncourt

Recent Activity

Donate For Us