Panel data in Keras LSTM

Tags:

I am looking at panel data, which is structured like this:

D = \{(x^{(k)}_{t},y^{(k)}_{t})\,|\, k=1,\dots,N\, , t=t_0,\dots,t_k \}_{k=1}^{N}

where x^{(k)} denotes the k'th sequence, x^{(k)}_{t} denotes the k'th sequences value at time t , furthermore x^{(k)}_{i,t} is the i'th entry in the vector x^{(k)}_{t}. That is x^{(k)}_{t} is the feature vector of the k'th sequence at time t. The sub- and super scripts mean the same things for the label data y^{(k)}_{t}, but here y^{(k)}_{t} \in \{0,1\}.

In plain words: The data set contains individuals observed over time, and for each time point at which an individual is observed, it is recorded whether he bought an item or not ( y\in \{0,1\}).

I would like to use a recurrent neural network with LSTM units from Keras for the task of predicting whether a person will buy an item or not, at a given time point. I have only been able to find examples of RNN's where each sequence has a label value (philipperemy link), not an example where each sequence element has a label value as in the problem I described.

My approach so far, has been to create a tensor with dimensions (samples,timesteps,features) but I cannot figure out how to format the labels, such that keras can match them with the features. It should be something like this (samples,timesteps,1), where the last dimension indicates a single dimension to contain the label value of 0 or 1.

Furthermore some of the approaches that I have come across splits sequences such that subsequences are add to the training data, thus increasing the need for memory tremendously (mlmastery link). This is infeasible in my case, as I have multiple GB's of data, and I would not be able to store it in memory if I added subsequences.

The model I would like to use is something like this:

Click to copy

mod = Sequential()
mod.add(LSTM(30,input_dim=116, return_sequences = True))
mod.add(LSTM(10))
mod.add(Dense(2))

Does anyone have experience working with panel data in keras?

873

asked Mar 09 '17 11:03

Math_kv

1 Answers

Try:

Click to copy

mod = Sequential()
mod.add(LSTM(30, input_shape=(timesteps, features), return_sequences = True))
mod.add(LSTM(10, return_sequences = True))
mod.add(TimeDistributed(Dense(1, activation='sigmoid')))
# In newest Keras version you can change the line above to mod.add(Dense(1, ..))

mod.compile(loss='binary_crossentropy', optimizer='rmsprop')

answered Oct 22 '22 14:10

Marcin Możejko

Related questions
                            
                                Can you reverse a PyTorch neural network and activate the inputs from the outputs?
                            
                                Why does almost every Activation Function Saturate at Negative Input Values in a Neural Network
                            
                                How do filters run across an RGB image, in first layer of a CNN?
                            
                                Resilient backpropagation neural network - question about gradient
                            
                                Is it possible for Encog or Neuroph to run on Android?
                            
                                PyBrain neuron manipulation
                            
                                Conceptual issues on training neural network wih particle swarm optimization
                            
                                Training Algorithm to train this data
                            
                                Gradient checking in backpropagation
                            
                                How do we get/define filters in convolutional neural networks?
                            
                                How to use keras for XOR
                            
                                Caret Neural Network Error: "missing values in resampled performance measures"
                            
                                Changing the solver parameters in Caffe through pycaffe
                            
                                Multilayer-perceptron, visualizing decision boundaries (2D) in Python
                            
                                Multiple accuracy layers in Caffe
                            
                                XOR gate with a neural network
                            
                                MPSCNN Weight Ordering
                            
                                How is Growing Neural Gas used for clustering?
                            
                                Min-Max normalization Layer in Caffe
                            
                                Keras correct input shape for multilayer perceptron

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Panel data in Keras LSTM

Tags:

neural-network

keras

lstm

panel-data

Math_kv

People also ask

1 Answers

Marcin Możejko

Recent Activity

Donate For Us