Python ARIMA model, predicted values are shifted

Tags:

I am new to Python ARIMA implementation. I have a data at 15 min frequency for few months. In my attempt to follow the Box-Jenkins method to fit a timeseries model. I ran into an issue towards the end. The ACF-PACF graph for the time series (ts) and the difference series (ts_diff) are given. I used ARIMA (5,1,2) and finally I plotted the fitted values(green) and original values(blue). As you can from figure, there is a clear shift(by one) in values. What am I doing wrong?

Is the prediction bad? Any insight will be helpful.

388

asked Feb 24 '16 05:02

Seetha Pothapragada

3 Answers

This is a standard property of one-step ahead prediction or forecasting.

The information used for the forecast is the history up to and including the previous period. A peak, for example, at a period will affect the forecast for the next period, but cannot influence the forecast for the peak period. This makes the forecasts appear shifted in the plot.

A two-step ahead forecast would give the impression of a shift by two periods.

172

answered Oct 04 '22 10:10

Josef

Just to confirm, I am doing this right then? Here is the code I used.

from statsmodels.tsa.arima_model import ARIMA
model = sm.tsa.ARIMA(ts, order=(5, 1, 2))
model = model.fit()
results_ARIMA=model.predict(typ='levels')
concatenated = pd.concat([ts, results_ARIMA], axis=1, keys=['original', 'predicted'])
concatenated.head(10)
    original    predicted
login_time      
1970-01-01 20:00:00 2   NaN
1970-01-01 20:15:00 6   2.000186
1970-01-01 20:30:00 9   4.552971
1970-01-01 20:45:00 7   7.118973
1970-01-01 21:00:00 1   7.099769
1970-01-01 21:15:00 4   3.624975
1970-01-01 21:30:00 0   3.867454
1970-01-01 21:45:00 4   1.618120
1970-01-01 22:00:00 9   2.997275
1970-01-01 22:15:00 8   6.300015

answered Oct 04 '22 11:10

Seetha Pothapragada

In the model you specify (5, 1, 2), you set d = 1. This means that you are differencing the data by 1, or in other words, performing a shift of your entire range of time-related observations so as to minimize the residuals of the fitted model.

Sometimes, setting d to 1 will result in a ACF / PACF plot with fewer and / or less dramatic spikes (i.e. less extreme residuals). In such cases, if you use the model you have fitted to predict future values, your predictions will deviate less dramatically from the observations you have if you apply differencing.

Differencing is accomplished through Y(differenced) = Y(t) - Y(t-d), where Y(t) refers to observed value Y at timeindex t, and d refers to the order of differencing you apply. When you use differencing, your entire range of observations basically shifts to the right. This means you lose some data at the left edge of your time series. How many time points you lose depends on the order of differencing d you use. This is where your observed shift comes from.

This page may offer a more elaborate explanation (make sure to click around a bit and explore the other pages on there if you want a treatment of the whole process of fitting an ARIMA model).

Hope this helps (or at least puts your mind at ease about the shift)!

Bests,

Evert

answered Oct 04 '22 11:10

Evert van Doorn

Related questions
                            
                                Get current user Async in Tornado
                            
                                changing x-axis tick labels using ggplot
                            
                                Dynamic order in django-mptt
                            
                                How to 'raw text' a variable in Python?
                            
                                Python create zip file
                            
                                Multiply two pandas series with mismatched indices
                            
                                Yielding a value from a coroutine in Python, a.k.a. convert callback to generator
                            
                                Odoo computed fields: works without store=True, doesn't work with store=True
                            
                                Python - Node.js (V8) runtime is not available on this system
                            
                                How to specify Python interpreter version in VIM?
                            
                                pattern for saving newline-delimited json (aka linejson, jsonlines, .jsonl files) with python
                            
                                Grouping list combinations for round-robin tournament
                            
                                Serializer validated_data is empty even when is_valid is True
                            
                                OpenCV with AWS Lambda
                            
                                How do I automatically kill a process that uses too much memory with Python?
                            
                                EOF occurred in violation of protocol with Python ftplib
                            
                                Python Selenium Send Keys Giving Warning about size
                            
                                Django Pagination too slow with large dataset
                            
                                SQLAlchemy Declarative: How to merge models and existing business logic classes
                            
                                Python Pyinstaller 3.1 Intel MKL FATAL ERROR: Cannot load mkl_intel_thread.dll

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python ARIMA model, predicted values are shifted

Tags:

python

statsmodels

Seetha Pothapragada

People also ask

3 Answers

Josef

Seetha Pothapragada

Evert van Doorn

Recent Activity

Donate For Us