Stateful LSTM fails to predict due to batch_size issue

Tags:

I am able to successfully train my stateful LSTM using keras. My batch size is 60 and every input I am sending in the network is divisible by batch_size Following is my snippet :

model = Sequential()
model.add(LSTM(80,input_shape = trainx.shape[1:],batch_input_shape=(60, 
trainx.shape[1], trainx.shape[2]),stateful=True,return_sequences=True))
model.add(Dropout(0.15))
model.add(LSTM(40,return_sequences=False))
model.add(Dense(40))
model.add(Dropout(0.3))
model.add(Dense(output_dim=1))
model.add(Activation("linear"))
keras.optimizers.RMSprop(lr=0.005, rho=0.9, epsilon=1e-08, decay=0.0)
model.compile(loss="mse", optimizer="rmsprop")

My training line which runs successfully:

  model.fit(trainx[:3000,:],trainy[:3000],validation_split=0.1,shuffle=False,nb_epoch=9,batch_size=60)

Now I try to predict on test set which is again divisible by 60 , but I get error :

ValueError: In a stateful network, you should only pass inputs with a number of samples that can be divided by the batch size. Found: 240 samples. Batch size: 32.

Can anyone tell me what is wrong above ? I am confused , tried so many things but nothing helps.

540

asked Jul 14 '17 13:07

Harshit

1 Answers

I suspect that the reason for the error is that you did not specify the batch size in model.predict. As you can see in the documentation in the "predict" section, the default parameters are

model.predict(self, x, batch_size=32, verbose=0)

which is why 32 appears in your error message. So you need to specify batch_size=60 in model.predict.

176

answered Oct 24 '22 12:10

Miriam Farber

Related questions
                            
                                TensorFlow 2.0 Keras: How to write image summaries for TensorBoard
                            
                                Inner most dimension of an Array
                            
                                Mask-RCNN with Keras : Tried to convert 'shape' to a tensor and failed. Error: None values not supported
                            
                                Write tf.dataset back to TFRecord
                            
                                Keras GaussianNoise layer no effect?
                            
                                TypeError: Tensor is unhashable if Tensor equality is enabled. Instead, use tensor.experimental_ref() as the key
                            
                                Access deprecated attribute "validation_data" in tf.keras.callbacks.Callback
                            
                                How to Get Reproducible Results (Keras, Tensorflow):
                            
                                How to reset Keras metrics?
                            
                                Tensorflow 2.2.0 error: [Predictions must be > 0] [Condition x >= y did not hold element-wise:] while using Bidirectional LSTM layer
                            
                                OpenAI GPT-2 model use with TensorFlow JS
                            
                                TensorFlow - why doesn't this sofmax regression learn anything?
                            
                                Tensorflow not using GPU
                            
                                Implementing a many-to-many LSTM in TensorFlow?
                            
                                Tensorflow: why is zip() function used in the steps involving applying the gradients?
                            
                                How does tf.train.batch create a batch
                            
                                3D Convolutional Neural Network input shape
                            
                                How to run Tensorboard and jupyter concurrently with docker?
                            
                                Resize 3D data in tensorflow like tf.image.resize_images
                            
                                How to load checkpoint and inference with C++ for tensorflow?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Stateful LSTM fails to predict due to batch_size issue

Tags:

tensorflow

keras

lstm

rnn

Harshit

People also ask

1 Answers

Miriam Farber

Recent Activity

Donate For Us