Variable size input for LSTM in Pytorch

Tags:

I am using features of variable length videos to train one layer LSTM. Video sizes are changing from 10 to 35 frames. I am using batch size of 1. I have the following code:

lstm_model = LSTMModel(4096, 4096, 1, 64)
for step, (video_features, label) in enumerate(data_loader):
    bx = Variable(score.view(-1, len(video_features), len(video_features[0]))) #examples = 1x12x4096, 1x5x4096
    output = lstm_model(bx)

Lstm model is;

class LSTMModel(nn.Module):
def __init__(self, input_size, hidden_size, num_layers, num_classes):
    super(LSTMModel, self).__init__()
    self.l1 = nn.LSTM(input_size=input_size, hidden_size=hidden_size, num_layers=num_layers, batch_first=True)
    self.out = nn.Linear(hidden_size, num_classes)
def forward(self, x):
    r_out, (h_n, h_c) = self.l1(x, None) #None represents zero initial hidden state
    out = self.out(r_out[:, -1, :])
    return out

I just want to ask; am I doing the right for training LSTM with variable size input. The code works okay and loss decreases but I am not sure if I am doing the right thing. Because I haven't used LSTMs in Pytorch before.

416

asked Apr 14 '18 14:04

yns

1 Answers

Yes, you code is correct and will work always for a batch size of 1. But, if you want to use a batch size other than 1, you’ll need to pack your variable size input into a sequence, and then unpack after LSTM. You can find more details in my answer to a similar question.

P.S. - You should post such questions to codereview

135

answered Nov 27 '22 12:11

layog

Related questions
                            
                                Gaussian-RBM fails on a trivial example
                            
                                Accuracy issue in caffe
                            
                                Choosing minibatch size for deep learning
                            
                                TensorFlow network not training?
                            
                                Visualize output of each layer in theano Convolutional MLP
                            
                                How to deep learn from a row of numbers using Node.js and convnetjs and predicted a new value?
                            
                                How to use the vgg-net when I load vgg16_weights.h5?
                            
                                Accessing neural network weights and neuron activations
                            
                                What is the difference between classification and pattern recognition?
                            
                                How to split a tensor column-wise in Keras to implement STFCN
                            
                                patch-wise training and fully convolutional training in FCN
                            
                                How to asynchronously load and train batches to train a DeepLearning model?
                            
                                Keras: model accuracy drops after reaching 99 percent accuracy and loss 0.01
                            
                                How can I improve the classification accuracy of LSTM,GRU recurrent neural networks
                            
                                ssh AWS, Jupyter Notebook not showing up on web browser
                            
                                How to properly feed specific tensor to keras model
                            
                                Store Tensorflow object detection API image output with boxes in CSV format
                            
                                How to overcome overfitting in CNN - standard methods don't work
                            
                                Mini batch training for inputs of variable sizes
                            
                                Siamese Neural Network in TensorFlow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Variable size input for LSTM in Pytorch

Tags:

deep-learning

lstm

pytorch

yns

People also ask

1 Answers

layog

Recent Activity

Donate For Us