I know there are a lot of questions to this topic, but I don't understand why in my case both options are possible. My input shape in the LSTM is (10,24,2) and my hidden_size is 8. <pre class="prettyprint"><code>model = Sequential() model.add(LSTM(hidden_size, return_sequences=True, stateful = True, batch_input_shape=((10, 24, 2)))) model.add(Dropout(0.1)) </code></pre> Why is it possible to either add this line below: <pre class="prettyprint"><code>model.add(TimeDistributed(Dense(2))) # Option 1 </code></pre> or this one: <pre class="prettyprint"><code>model.add(Dense(2)) # Option 2 </code></pre> Shouldn't <code>Option 2</code> lead to a compilation error, because it expects a two-dimensional input?

In your case the 2 models you define are identical. This is caused by the fact that you use the <code>return_sequences=True</code> parameter which means that the <code>Dense</code> layer is applied to every timestep just like <code>TimeDistributedDense</code> but if you switch to <code>False</code> then the 2 models are not identical and an error is raised in case of <code>TimeDistributedDense</code> version though not in the <code>Dense</code> one. A more thorough explanation is provided here also to a similar situation.

Why is TimeDistributed not needed in my Keras LSTM?

Tags:

python

tensorflow

keras

lstm

I know there are a lot of questions to this topic, but I don't understand why in my case both options are possible. My input shape in the LSTM is (10,24,2) and my hidden_size is 8.

model = Sequential()    
model.add(LSTM(hidden_size, return_sequences=True, stateful = True, 
               batch_input_shape=((10, 24, 2))))
model.add(Dropout(0.1))

Why is it possible to either add this line below:

model.add(TimeDistributed(Dense(2))) # Option 1

or this one:

model.add(Dense(2)) # Option 2

Shouldn't Option 2 lead to a compilation error, because it expects a two-dimensional input?

553

asked Apr 05 '19 09:04

Emma

1 Answers

In your case the 2 models you define are identical.

This is caused by the fact that you use the return_sequences=True parameter which means that the Dense layer is applied to every timestep just like TimeDistributedDense but if you switch to False then the 2 models are not identical and an error is raised in case of TimeDistributedDense version though not in the Dense one.

A more thorough explanation is provided here also to a similar situation.

154

answered Sep 20 '22 10:09

Eypros

Related questions
                            
                                My LSTM learns, loss decreases, but Numerical Gradients don't match Analytical Gradients
                            
                                Change size of train and test set from MNIST Dataset
                            
                                How to use Refresh Token Google API in Python? [duplicate]
                            
                                How do I search for a Tag in xml file using ElementTree where i have a certain "Parent"tag with a specific value? (python)
                            
                                Is there a way to get a webpage's Network activity (which you can see on Chrome Dev Tools) on load via Python?
                            
                                aiohttp asyncio.TimeoutError from None using ClientSession
                            
                                How to restart a process using python multiprocessing module
                            
                                Python Fabric Sudo su - user
                            
                                How to pass a python variables to shell script in azure databricks notebookbles.?
                            
                                Tensorflow.keras.layers "unresolved reference" in pycharm
                            
                                Python equivalent of R "here" package
                            
                                Python run_in_executor and forget?
                            
                                Tensorflow Object Detection - Convert .pb file to tflite
                            
                                Pickling of a namedtuple instance succeeds normally, but fails when module is Cythonized
                            
                                Most efficient way to sort an array into bins specified by an index array?
                            
                                Finding Teeth of Gear by python opencv
                            
                                Converting an Integer value to base64, and then decoding it to get a plaintext
                            
                                How to assign a value to a column for every row of pandas dataframe? [duplicate]
                            
                                How to run python code on AWS lambda with package dependencies >500MB?
                            
                                How do I Sample each group from a pandas data frame at different rates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With