Keras simple RNN implementation

Tags:

I found problems when trying to compile a network with one recurrent layer. It seems there is some issue with the dimensionality of the first layer and thus my understanding of how RNN layers work in Keras.

My code sample is:

Click to copy

model.add(Dense(8,
                input_dim = 2,
                activation = "tanh",
                use_bias = False))
model.add(SimpleRNN(2,
                    activation = "tanh",
                    use_bias = False))
model.add(Dense(1,
                activation = "tanh",
                use_bias = False))

The error is

Click to copy

ValueError: Input 0 is incompatible with layer simple_rnn_1: expected ndim=3, found ndim=2

This error is returned regardless of input_dim value. What am I missing ?

685

asked Sep 16 '17 16:09

dev1223

1 Answers

That message means: the input going into the rnn has 2 dimensions, but an rnn layer expects 3 dimensions.

For an RNN layer, you need inputs shaped like (BatchSize, TimeSteps, FeaturesPerStep). These are the 3 dimensions expected.

A Dense layer (in keras 2) can work with either 2 or 3 dimensions. We can see that you're working with 2 because you passed an input_dim instead of passing an input_shape=(Steps,Features).

There are many possible ways to solve this, but the most meaningful and logical would be a case where your input data is a sequence with time steps.

Solution 1 - Your training data is a sequence:

If your training data is a sequence, you shape it like (NumberOfSamples, TimeSteps, Features) and pass it to your model. Make sure you use input_shape=(TimeSteps,Features) in the first layer instead of using input_dim.

Solution 2 - You reshape the output of the first dense layer so it has the additional dimension:

Click to copy

model.add(Reshape((TimeSteps,Features)))

Make sure that the product TimeSteps*Features is equal to 8, the output of your first dense layer.

147

answered Oct 14 '22 12:10

Daniel Möller

Related questions
                            
                                Conceptual issues on training neural network wih particle swarm optimization
                            
                                Training Algorithm to train this data
                            
                                Virtual Testing Environment for Drones [closed]
                            
                                How do we get/define filters in convolutional neural networks?
                            
                                How to save/export a Spark ML Lib model to PMML?
                            
                                scitkit-learn query data dimension must match training data dimension
                            
                                XOR gate with a neural network
                            
                                How to calculate the click-through rate
                            
                                Major assumptions of machine learning classifiers (LG, SVM, and decision trees)
                            
                                What does seed do in random forest?
                            
                                How to setup Apache Spark to use local hard disk when data does not fit in RAM in local mode?
                            
                                K-fold cross validation implementation python
                            
                                How is Growing Neural Gas used for clustering?
                            
                                Using cross_val_predict against test data set
                            
                                How to use max pooling to gather information from LSTM nodes
                            
                                Threading in tensorflow's input pipeline
                            
                                Splitting data set into training and testing sets on recommender systems
                            
                                Where can i find ImageNet VID dataset?
                            
                                What is difference between PCA , TruncatedSVD and ICA in details?
                            
                                Same Tensorflow model giving different results on Android and Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras simple RNN implementation

Tags:

machine-learning

neural-network

keras

recurrent-neural-network

rnn

dev1223

People also ask

1 Answers

Daniel Möller

Recent Activity

Donate For Us