Use keras(TensorFlow) to build a Conv2D+LSTM model

Tags:

The data are 10 videos and each videos split into 86 frames and each frame has 28*28 pixels,

video_num = 10
frame_num = 86
pixel_num = 28*28

I want to use Conv2D+LSDM to build the Model, and at each time_steps(=frame_num=86) send the pixels data (=INPUT_SIZE=28*28) in the model.So the following is my code about the Model

BATCH_SIZE = 2 (just try)
TIME_STEPS=frame_num (=86)
INPUT_SIZE=pixel_num (=28*28)

model = Sequential()
model.add(InputLayer(batch_input_shape=(BATCH_SIZE, TIME_STEPS,     
INPUT_SIZE)))
print (model.output_shape)

model.add(TimeDistributed(Conv2D(64,(1,3),strides=(1,1), padding='same', 
data_format='channels_last')))  ##always the error here
print (model.output_shape)

model.add(TimeDistributed(MaxPooling2D(pool_size=(2,2),padding='same')))
print (model.output_shape)

model.add(TimeDistributed(Conv2D(64,(1,3),strides=(1,1), 
data_format='channels_last', padding='same')))
print (model.output_shape)

model.add(TimeDistributed(MaxPooling2D(pool_size=(2,2),padding='same')))
print (model.output_shape)

model.add(TimeDistributed(Flatten()))
print (model.output_shape)

model.add(TimeDistributed(Dense(4096, activation='relu')))
print (model.output_shape)

model.add(LSTM(100, stateful=True, return_sequences=True))
print (model.output_shape)

model.add(Dense(1, activation='sigmoid'))
print (model.output_shape)

the following figure shows the error from command line

https://imgur.com/a/yAPQO says "list index out of range"

I think that error is about the input shape in TimeDistributed() which gets the input from upper layer(InputLayer()), but I have no idea how to fix the error. I have tried to remove the InputLayer(), and use

TimeDistributed(Conv2D(...), input_shape=(TIME_STEPS, INPUT_SIZE))

as the first layer, but also get the same error...

If anyone know about this error, please share your idea, I will be very appreciate. Also, I still didn't very clear about the difference between batch_input_shape and input_shape, did anyone use these two before? Thanks.

243

asked Nov 24 '17 09:11

Edward Chang

1 Answers

A Conv2D layer requires four dimensions, not three:

(batch_size, height, width, channels).

And the TimeDistributed will require an additional dimension:

(batch_size, frames, height, width, channels)

So, if you're really going to work with TimeDistributed+Conv2D, you need 5 dimensions. Your input_shape=(86,28,28,3), or your batch_input_shape=(batch_size,86,28,28,3), where I assumed you've got an RGB video (3 color channels).

Usually, you just pass an input shape to the TimeDistributed.

model.add(TimeDistributed(Dense(....), input_shape=(86,28,28,3))

You will need the batch_input_shape only in the case of using stateful=True LSTM's. Then you just replace the input_shape with the batch_input_shape.

Notice that only the convolutional 2D layers will see images in terms of height and width. When you add the LSTM's, you will need to reshape the data to bring height, width and channels into a single dimension.

For a shape (frames, h, w, ch):

model.add(Reshape((frames,h*w*ch)))

And you should not use TimeDistributed with these LSTMs, only with the convolutional layers.

Your approach of using model.add(TimeDistributed(Flatten())) is ok instead of the reshape.

Notice also that Keras has recently implemented a ConvLSTM2D layer, which might be useful in your case: https://keras.io/layers/recurrent/#convlstm2d

188

answered Nov 10 '22 00:11

Daniel Möller

Related questions
                            
                                Default Argument decorator python
                            
                                Pandas SQL equivalent for 'not equal' clause
                            
                                O(n) solution for finding maximum sum of differences python 3.x?
                            
                                Keras Extremely High Loss
                            
                                How to know from python if Windows path limit has been removed
                            
                                Python exit from all running threads on truthy condition
                            
                                Splitting list of dictionary into sublists after the occurence of particular key of dictionary
                            
                                Data Normalization with tensorflow tf-transform
                            
                                hog() got an unexpected keyword argument 'visualize'
                            
                                Comparing two pandas series for floating point near-equality?
                            
                                Python : upload my own files into my drive using Pydrive library
                            
                                Generate URLs for Flask test client with url_for function
                            
                                django.urls.exceptions.NoReverseMatch: Reverse for 'sign_up' not found. 'sign_up' is not a valid view function or pattern name
                            
                                Abstract matrix multiplication with variables
                            
                                transparent background in gif using Python Imageio
                            
                                How do i take picture from client side(html) and save it to server side(Python)
                            
                                Scraping text in h3 and div tags using beautifulSoup, Python
                            
                                Boto3 create_image for AMI creation - Save ONLY the root volume
                            
                                Reverse for 'password_reset_done' not found. 'password_reset_done' is not a valid view function or pattern name
                            
                                Pandas- Dividing a column by another column conditional on if values are greater than 0?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Use keras(TensorFlow) to build a Conv2D+LSTM model

Tags:

python

keras

lstm

Edward Chang

People also ask

1 Answers

Daniel Möller

Recent Activity

Donate For Us