Keras LSTM predicted timeseries squashed and shifted

Tags:

I'm trying to get some hands on experience with Keras during the holidays, and I thought I'd start out with the textbook example of timeseries prediction on stock data. So what I'm trying to do is given the last 48 hours worth of average price changes (percent since previous), predict what the average price chanege of the coming hour is.

However, when verifying against the test set (or even the training set) the amplitude of the predicted series is way off, and sometimes is shifted to be either always positive or always negative, i.e., shifted away from the 0% change, which I think would be correct for this kind of thing.

I came up with the following minimal example to show the issue:

df = pandas.DataFrame.from_csv('test-data-01.csv', header=0)
df['pct'] = df.value.pct_change(periods=1)

seq_len=48
vals = df.pct.values[1:] # First pct change is NaN, skip it
sequences = []
for i in range(0, len(vals) - seq_len):
    sx = vals[i:i+seq_len].reshape(seq_len, 1)
    sy = vals[i+seq_len]
    sequences.append((sx, sy))

row = -24
trainSeqs = sequences[:row]
testSeqs = sequences[row:]

trainX = np.array([i[0] for i in trainSeqs])
trainy = np.array([i[1] for i in trainSeqs])

model = Sequential()
model.add(LSTM(25, batch_input_shape=(1, seq_len, 1)))
model.add(Dense(1))
model.compile(loss='mse', optimizer='adam')
model.fit(trainX, trainy, epochs=1, batch_size=1, verbose=1, shuffle=True)

pred = []
for s in trainSeqs:
    pred.append(model.predict(s[0].reshape(1, seq_len, 1)))
pred = np.array(pred).flatten()

plot(pred)
plot([i[1] for i in trainSeqs])
axis([2500, 2550,-0.03, 0.03])

As you can see, I create training and testing sequences, by selecting the last 48 hours, and the next step into a tuple, and then advancing 1 hour, repeating the procedure. The model is a very simple 1 LSTM and 1 dense layer.

I would have expected the plot of individual predicted points to overlap pretty nicely the plot of training sequences (after all this is the same set they were trained on), and sort of match for the test sequences. However I get the following result on training data:

Orange: true data
Blue: predicted data

enter image description here

Any idea what might be going on? Did I misunderstand something?

Update: to better show what I mean by shifted and squashed I also plotted the predicted values by shifting it back to match the real data and multiplied to match the amplitude.

plot(pred*12-0.03)
plot([i[1] for i in trainSeqs])
axis([2500, 2550,-0.03, 0.03])

enter image description here

As you can see the prediction nicely fits the real data, it's just squashed and offset somehow, and I can't figure out why.

272

asked Dec 30 '17 14:12

cdecker

2 Answers

~~I presume you are overfitting~~, since the dimensionality of your data is 1, and a LSTM with 25 units seems rather complex for such a low-dimensional dataset. Here's a list of things that I would try:

Decreasing the LSTM dimension.
Adding some form of regularization to combat overfitting. For example, dropout might be a good choice.
Training for more epochs or changing the learning rate. The model might need more epochs or bigger updates to find the appropriate parameters.

UPDATE. Let me summarize what we discussed in the comments section.

Just for clarification, the first plot doesn't show the predicted series for a validation set, but for the training set. Therefore, my first overfitting interpretation might be inaccurate. I think an appropriate question to ask would be: is it actually possible to predict the future price change from such a low-dimensional dataset? Machine learning algorithms aren't magical: they'll find patterns in the data only if they exist.

If the past price change alone is indeed not very informative of the future price change then:

Your model will learn to predict the mean of the price changes (probably something around 0), since that's the value that produces the lowest loss in absence of informative features.
The predictions might appear to be slightly "shifted" because the price change at timestep t+1 is slightly correlated with the price change at timestep t (but still, predicting something close to 0 is the safest choice). That is indeed the only pattern that I, as an inexpert, would be able to observe (i.e. that the value at timestep t+1 is sometimes similar to the one at timestep t).

If values at timesteps t and t+1 happened to be more correlated in general, then I presume that the model would be more confident about this correlation and the amplitude of the prediction would be bigger.

answered Nov 13 '22 03:11

rvinas

Increase the number of epochs. You can use EarlyStopping to avoid overfitting.
How's you data scaled? Time series are very sensitive to outliers in the data. Try MinMax((0.1, 0.9)) for example and then RobustScaler is also a good choice.
I'm not sure than LSTM(seq_len) is really necessary until you have a lot of data. Why not to try the smaller dimension?

Try all of this and try to overfit (mse should be around zero on a real dataset). Then apply regularizations.

UPDATE

Let me explain you why did you get by

plot(pred*12-0.03)

a good fit.

Ok, let we consider LSTM layer as black box and forget about it. It returns us 25 values - that's all. This value goes forward to the Dense layer, where we apply to the vector of 25 values function:

y = w * x + b

Here w and b - vectors that are defined by NN and in the beginning are usually near zero. x - your values after LSTM layer and y - target (single value).

While you have just 1 epoch: w and b are not fitted at all to your data (they are around zero actually). But what if you apply

plot(pred*12-0.03)

to your predicted value? You (someway) apply to the target variable w and b. Now w and b are single values, not vectors, and they are applied to single value. But they do (almost) the same work as Dense layer.

So, increase the number of epochs to get better fitting.

UPDATE2 By the way, I see some outliers in the data. You can try also to use MAE as loss/accuracy metrics.

answered Nov 13 '22 03:11

avchauzov

Related questions
                            
                                Using coverage, how do I test this line?
                            
                                Errno 2 using python shutil.py No such file or directory for file destination
                            
                                Increasing speed of a pure Numpy/Scipy convolutional neural network implementation
                            
                                Python futurize without replacing / with old_div
                            
                                where is the ./configure of TensorFlow and how to enable the GPU support?
                            
                                What does "dict-like" mean in Python?
                            
                                csv: writer.writerows() splitting my string inputs
                            
                                Should variable names have adjectives before or after the noun? [closed]
                            
                                Generating random vectors of Euclidean norm <= 1 in Python?
                            
                                Tox installs the wrong version of pip to it's virtual env
                            
                                Pandas setting multi-index on rows, then transposing to columns
                            
                                Why does Python's set difference method take time with an empty set?
                            
                                Python Data Frame: cumulative sum of column until condition is reached and return the index
                            
                                Automatic scroll down to bottom of result in ipython notebook
                            
                                Python 3.x cannot serialize Decimal() to JSON
                            
                                'No module named requests' even if I installed requests with pip
                            
                                Is there a way to set title/name of a thread in Python? [duplicate]
                            
                                Why does numpy integer subtraction produce a float64?
                            
                                Tool to automatically expand YAML merges?
                            
                                Flask Config File - 'DEBUG=True' Do Nothing

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras LSTM predicted timeseries squashed and shifted

Tags:

python

machine-learning

keras

lstm

time-series

cdecker

People also ask

2 Answers

rvinas

avchauzov

Recent Activity

Donate For Us