How to connect LSTM layers in Keras, RepeatVector or return_sequence=True?

Tags:

I'm trying to develop an Encoder model in keras for timeseries. The shape of data is (5039, 28, 1), meaning that my seq_len is 28 and I have one feature. For the first layer of the encoder, I'm using 112 hunits, second layer will have 56 and to be able to get back to the input shape for decoder, I had to add 3rd layer with 28 hunits (this autoencoder is supposed to reconstruct its input). But I don't know what is the correct approach to connect the LSTM layers together. AFAIK, I can either add RepeatVector or return_seq=True. You can see both of my models in the following code. I wonder what will be the difference and which approach is the correct one?

First model using return_sequence=True:

inputEncoder = Input(shape=(28, 1))
firstEncLayer = LSTM(112, return_sequences=True)(inputEncoder)
snd = LSTM(56, return_sequences=True)(firstEncLayer)
outEncoder = LSTM(28)(snd)

context = RepeatVector(1)(outEncoder)
context_reshaped = Reshape((28,1))(context)

encoder_model = Model(inputEncoder, outEncoder)
firstDecoder = LSTM(112, return_sequences=True)(context_reshaped)
outDecoder = LSTM(1, return_sequences=True)(firstDecoder)

autoencoder = Model(inputEncoder, outDecoder)

Second model with RepeatVector:

inputEncoder = Input(shape=(28, 1))
firstEncLayer = LSTM(112)(inputEncoder)
firstEncLayer = RepeatVector(1)(firstEncLayer)
snd = LSTM(56)(firstEncLayer)
snd = RepeatVector(1)(snd)
outEncoder = LSTM(28)(snd)
encoder_model = Model(inputEncoder, outEncoder)

context = RepeatVector(1)(outEncoder)
context_reshaped = Reshape((28, 1))(context)

firstDecoder = LSTM(112)(context_reshaped)
firstDecoder = RepeatVector(1)(firstDecoder)
sndDecoder = LSTM(28)(firstDecoder)

outDecoder = RepeatVector(1)(sndDecoder)
outDecoder = Reshape((28, 1))(outDecoder)

autoencoder = Model(inputEncoder, outDecoder)

637

asked Aug 08 '18 14:08

Birish

1 Answers

You will probably have to see for yourself which one is better because it depends on the problem you're solving. However, I'm giving you the difference between the two approaches.

Difference <code>return_sequences=True</code> and RepeatVector Essentially, return_sequences=True returns all the outputs the encoder observed in the past, while RepeatVector repeats the very last output of the encoder.

answered Oct 05 '22 22:10

thushv89

Related questions
                            
                                module 'tensorflow._api.v2.train' has no attribute 'GradientDescentOptimizer'
                            
                                How to graph tf.keras model in Tensorflow-2.0?
                            
                                TensorFlow on Windows: "not a supported wheel on this platform" error
                            
                                How to install libcusolver.so.11
                            
                                What are c_state and m_state in Tensorflow LSTM?
                            
                                Streaming large training and test files into Tensorflow's DNNClassifier
                            
                                ModuleNotFoundError: No module named 'tensorflow.examples'
                            
                                tensorflow constant with variable size
                            
                                Keras LSTM input dimension setting
                            
                                How to do a column sum in Tensorflow?
                            
                                How does TensorFlow SparseCategoricalCrossentropy work?
                            
                                Tensorflow cannot open libcuda.so.1
                            
                                how to normalize input data for models in tensorflow
                            
                                Deploy python app to Heroku "Slug Size too large"
                            
                                ImportError: cannot import name 'to_categorical' from 'keras.utils' (/usr/local/lib/python3.7/dist-packages/keras/utils/__init__.py)
                            
                                TypeError: '>' not supported between instances of 'NoneType' and 'float'
                            
                                How to load a tflite model in script?
                            
                                Change default GPU in TensorFlow
                            
                                Tensorflow and Anaconda on Ubuntu?
                            
                                In Tensorflow, how to use tf.gather() for the last dimension?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to connect LSTM layers in Keras, RepeatVector or return_sequence=True?

Tags:

tensorflow

deep-learning

keras

lstm

autoencoder

Birish

People also ask

1 Answers

thushv89

Recent Activity

Donate For Us