How do you pass video features from a CNN to an LSTM?

Tags:

After you pass a video frame through a convnet and get an output feature map, how do you pass that data into an LSTM? Also, how do you pass multiple frames to the LSTM thru the CNN?
In other works I want to process video frames with an CNN to get the spatial features. Then I want pass these features to an LSTM to do temporal processing on the spatial features. How do I connect the LSTM to the video features? For example if the input video is 56x56 and then when passed through all of the CNN layers, say it comes out as 20: 5x5's. How are these connected to the LSTM on a frame by frame basis? ANd shoudl they go through a fully connected layer first? Thanks, Jon

971

asked May 02 '16 22:05

Jon

1 Answers

Basically, you can flatten each frame features and feed them into one LSTM cell. With CNN, it's the same. You can feed each output of CNN into one LSTM cell.

For FC, it's up to you.

See a network structure from http://www.eecs.berkeley.edu/Pubs/TechRpts/2014/EECS-2014-180.pdf.

enter image description here

195

answered Sep 28 '22 13:09

Sung Kim

Related questions
                            
                                batch upload videos to youtube via command line python
                            
                                Speed change command fails when audio stream is not present in video - ffmpeg
                            
                                Chromecast not playing multiple HTML5 videos
                            
                                "Start time" on TED video embeds
                            
                                JavaScript window.URL is undefined in function
                            
                                What are my options for simple video formats?
                            
                                How to convert video to base64 data
                            
                                MP4 video will not play in Internet Explorer 11
                            
                                Android Strip Audio From Video
                            
                                Create video from images
                            
                                Android Media Codec video decoding
                            
                                ffmpeg closes with return code 137
                            
                                Apple iOS ARKit: "A sensor failed to deliver the required input" error and stops working
                            
                                In react native, how do you set a video component to the background of the page?
                            
                                how to add audio to existing video using FFMPEG at specific time?
                            
                                FFMPEG- Streaming Stops after few seconds
                            
                                youtube-dl playlist does not work anymore
                            
                                Can Amazon Elastic Transcoder concatenate two videos
                            
                                Android - concatenate two videos
                            
                                Video orientation is incorrect on FireFox

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you pass video features from a CNN to an LSTM?

Tags:

video

tensorflow

lstm

Jon

People also ask

1 Answers

Sung Kim

Recent Activity

Donate For Us