Setting correct input for RNN

Tags:

In a database there are time-series data with records:

device - timestamp - temperature - min limit - max limit
device - timestamp - temperature - min limit - max limit
device - timestamp - temperature - min limit - max limit
...

For every device there are 4 hours of time series data (with an interval of 5 minutes) before an alarm was raised and 4 hours of time series data (again with an interval of 5 minutes) that didn't raise any alarm. This graph describes better the representation of the data, for every device:

enter image description here

I need to use RNN class in python for alarm prediction. We define alarm when the temperature goes below the min limit or above the max limit.

After reading the official documentation from tensorflow here, i'm having troubles understanding how to set the input to the model. Should i normalise the data beforehand or something and if yes how?

Also reading the answers here didn't help me as well to have a clear view on how to transform my data into an acceptable format for the RNN model.

Any help on how the X and Y in model.fit should look like for my case?

If you see any other issue regarding this problem feel free to comment it.

PS. I have already setup python in docker with tensorflow, keras etc. in case this information helps.

363

asked Aug 03 '20 10:08

GeorgeGeorgitsis

1 Answers

You can begin with a snippet that you mention in the question.

Any help on how the X and Y in model.fit should look like for my case?

X should be a numpy matrix of shape [num samples, sequence length, D], where D is a number of values per timestamp. I suppose D=1 in your case, because you only pass temperature value.

y should be a vector of target values (as in the snippet). Either binary (alarm/not_alarm), or continuous (e.g. max temperature deviation). In the latter case you'd need to change sigmoid activation for something else.

Should i normalise the data beforehand

Yes, it's essential to preprocess your raw data. I see 2 crucial things to do here:

Normalise temperature values with min-max or standardization (wiki, sklearn preprocessing). Plus, I'd add a bit of smoothing.
Drop some fraction of last timestamps from all of the time-series to avoid information leak.

Finally, I'd say that this task is more complex than it seems to be. You might want to either find a good starter tutorial on time-series classification, or a course on machine learning in general. I believe you can find a better method than RNN.

answered Nov 10 '22 22:11

roman

Related questions
                            
                                Implement Gaussian Naive Bayes
                            
                                Named entities as a feature in text categorization?
                            
                                enet() works but not when run via caret::train()
                            
                                How can I speed up a topic model in R?
                            
                                How can I get the relative importance of features of a logistic regression for a particular prediction?
                            
                                Layer names for pretrained inception v3 model (tensorflow) [duplicate]
                            
                                Embedding lookup table doesn't mask padding value
                            
                                Digit Recognition on CNN
                            
                                How to distribute xgboost module for use in spark?
                            
                                Outliers using RPCA
                            
                                Calling a stateful LSTM as a functional model?
                            
                                Troubleshooting tips for clustering word2vec output with DBSCAN
                            
                                What is the meaning of the implementation of the KL divergence in Keras?
                            
                                How to approximate the determinant with keras
                            
                                Speeding up inference of Keras models
                            
                                Autoencoder loss is not decreasing (and starts very high)
                            
                                new shape and old shape must have the same number of elements
                            
                                RuntimeError: _thnn_mse_loss_forward is not implemented for type torch.cuda.LongTensor
                            
                                How to make sure the training phase won't be facing an OOM?
                            
                                How to scale target values of a Keras autoencoder model using a sklearn pipeline?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Setting correct input for RNN

Tags:

machine-learning

neural-network

lstm

normalization

recurrent-neural-network

GeorgeGeorgitsis

People also ask

1 Answers

roman

Recent Activity

Donate For Us