I've checked the source code for both functions, and it seems that LSTM() makes the LSTM network in general, while LSTMCell() only returns one cell. However, in most cases people only use one LSTM Cell in their program. Does this mean when you have only one LSTM Cell (ex. in simple Seq2Seq), calling LSTMCell() and LSTM() would make no difference?

<ul> <li> <code>LSTM</code> is a recurrent layer </li> <li> <code>LSTMCell</code> is an object (which happens to be a layer too) used by the LSTM layer that contains the calculation logic for one step.</li> </ul> A recurrent layer contains a cell object. The cell contains the core code for the calculations of each step, while the recurrent layer commands the cell and performs the actual recurrent calculations. Usually, people use <code>LSTM</code> layers in their code. Or they use <code>RNN</code> layers containing <code>LSTMCell</code>. Both things are almost the same. An <code>LSTM</code> layer is a <code>RNN</code> layer using an <code>LSTMCell</code>, as you can check out in the source code. About the number of cells: Alghout it seems, because of its name, that <code>LSTMCell</code> is a single cell, it is actually an object that manages all the units/cells as we may think. In the same code mentioned, you can see that the <code>units</code> argument is used when creating an instance of <code>LSTMCell</code>.

What's the difference between LSTM() and LSTMCell()?

1 Answers

LSTM is a recurrent layer
LSTMCell is an object (which happens to be a layer too) used by the LSTM layer that contains the calculation logic for one step.

A recurrent layer contains a cell object. The cell contains the core code for the calculations of each step, while the recurrent layer commands the cell and performs the actual recurrent calculations.

Usually, people use LSTM layers in their code.
Or they use RNN layers containing LSTMCell.

Both things are almost the same. An LSTM layer is a RNN layer using an LSTMCell, as you can check out in the source code.

About the number of cells:

Alghout it seems, because of its name, that LSTMCell is a single cell, it is actually an object that manages all the units/cells as we may think. In the same code mentioned, you can see that the units argument is used when creating an instance of LSTMCell.

answered Oct 24 '22 18:10

Daniel Möller

Related questions
                            
                                What's the difference between LibSVM and LibLinear
                            
                                Is it possible to do multivariate multi-step forecasting using FB Prophet?
                            
                                What is weakly supervised learning (bootstrapping)?
                            
                                Maximum Likelihood Estimate pseudocode
                            
                                How does Pytorch's "Fold" and "Unfold" work?
                            
                                Request for example: Recurrent neural network for predicting next value in a sequence
                            
                                Create Bayesian Network and learn parameters with Python3.x [closed]
                            
                                Training on imbalanced data using TensorFlow
                            
                                Hyperparameter optimization for Deep Learning Structures using Bayesian Optimization
                            
                                Building a mutlivariate, multi-task LSTM with Keras
                            
                                What is a bad, decent, good, and excellent F1-measure range?
                            
                                What is a threshold in a Precision-Recall curve?
                            
                                Information Gain calculation with Scikit-learn
                            
                                Precision/recall for multiclass-multilabel classification
                            
                                How To Determine the 'filter' Parameter in the Keras Conv2D Function
                            
                                Predicting how long an scikit-learn classification will take to run
                            
                                Are GAN's unsupervised or supervised?
                            
                                Keras error : Expected to see 1 array
                            
                                Why does sklearn Imputer need to fit?
                            
                                Tensor is not an element of this graph

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the difference between LSTM() and LSTMCell()?

Tags:

machine-learning

keras

narutatsuri

People also ask

1 Answers

Daniel Möller

Recent Activity

Donate For Us