LSTM implementation with peephole

Tags:

I have been reading papers about LSTM and checking its implementations. There is one point that is not clear to me.
In most of the papers it is mentioned that the weight matrices from the cell to gate vectors should be diagonal(ex: Alex page 5, 2013), but I haven't seen this in any implementation.
For example this :
1 2 Another example is from mila lab. 3

Are these people implementing wrongly or am I missing something?

542

asked Feb 06 '16 09:02

seleucia

1 Answers

The TensorFlow implementation does use a diagonal matrix, see here. Note that what this means in practice is that the peepholes only go from the cell to itself, and so you're doing elementwise vector multiplies.

answered Oct 18 '22 05:10

Mohammed AlQuraishi

Related questions
                            
                                How to debug Keras in TensorFlow 2.0?
                            
                                Tensorflow, expected conv2d_input to have 4 dimensions
                            
                                Unexpected results with CuDNNLSTM (instead of LSTM) layer
                            
                                ValueError: The model is not configured to compute accuracy
                            
                                ConnectionClosedError: Connection was closed before we received a valid response from endpoint URL:
                            
                                cast tensorflow 2.0 BatchDataset to numpy array
                            
                                Keras functional API and TensorFlow Hub
                            
                                Can't import tensorflow.keras in VS Code
                            
                                Bert Embedding Layer raises `Type Error: unsupported operand type(s) for +: 'None Type' and 'int'` with BiLSTM
                            
                                How to build TensorFlow lite with select TensorFlow ops for x86_64 systems?
                            
                                How to load numpy array in a tensorflow dataset
                            
                                Why are deep learning libraries so huge?
                            
                                Feeding nullable data from BigQuery into Tensorflow Transform
                            
                                sklearn utils compute_class_weight function for large dataset
                            
                                Difficulty in GAN training
                            
                                Quantization aware training in TensorFlow version 2 and BatchNorm folding
                            
                                Keras loss and metrics values do not match with same function in each
                            
                                Early stopping with multiple conditions
                            
                                Variables on CPU, training/gradients on GPU
                            
                                Port TensorFlow code to Android

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

LSTM implementation with peephole

Tags:

neural-network

tensorflow

deep-learning

lstm

theano

seleucia

People also ask

1 Answers

Mohammed AlQuraishi

Recent Activity

Donate For Us