Detecting seasonality without two full periods of data

Tags:

time-series

I have the following dataset (df) with 20 months:

There should be seasonality, and I want to estimate this and remove it. I attempted this using the below code:

df <- ts(df$price, frequency = 12, start = c(2016,8))
decompose_df <- decompose( , "additive")
adjust_df<- df- decompose_df $seasonal
plot(adjust_df)

But as I only have 20 months and not two full periods of data, I get the below error:

Error in decompose(df, "additive") : 
time series has no or less than 2 periods

Is there a way I can test and remove this seasonality? Even though I only have 20 periods when I need 24.

215

asked Jul 13 '18 16:07

1 Answers

It's not possible using the usual methods of decomposition because they estimate seasonality using at least as many degrees of freedom as there are seasonal periods. As @useR has pointed out, you need at least two observations per seasonal period to be able to distinguish seasonality from noise.

However, if you are willing to assume that the seasonality is relatively smooth, then you can estimate it using fewer degrees of freedom. For example, you can approximate the seasonal pattern using Fourier terms with a few parameters.

df <- ts(c(
2735.869,2857.105,2725.971,2734.809,2761.314,2828.224,2830.284,2758.149,
2774.943,2782.801,2861.970,2878.688,3049.229,3029.340,3099.041,3071.151,
3075.576,3146.372,3005.671,3149.381), start=c(2016,8), frequency=12)

library(forecast)
library(ggplot2)
decompose_df <- tslm(df ~ trend + fourier(df, 2))
trend <- coef(decompose_df)[1] + coef(decompose_df)['trend']*seq_along(df)
components <- cbind(
  data = df,
  trend = trend,  
  season = df - trend - residuals(decompose_df),
  remainder = residuals(decompose_df)
)
autoplot(components, facet=TRUE)

enter image description here

You can adjust the order of the Fourier terms as required. I've used 2 here. For monthly data, the maximum you can use is 6, but that will give a model with 13 degrees of freedom which is way too many with only 20 observations. If you don't know about Fourier terms for seasonality, see https://otexts.org/fpp2/useful-predictors.html#fourier-series.

Now we can remove the seasonal component to get the seasonally adjusted data.

adjust_df <- df - components[,'season']
autoplot(df, series="Data") + autolayer(adjust_df, series="Seasonally adjusted")

enter image description here

171

answered Oct 27 '22 05:10

Rob Hyndman

Related questions
                            
                                add mean line to ggplot
                            
                                Rotating text of secondary axis labels
                            
                                Understanding lookahead in R regexp
                            
                                Error in vapply ... values must be length 1, but FUN(X[[11]]) result is length 0
                            
                                How to pop up the graphics window from Rscript?
                            
                                whole earth polygon for world map in ggplot2 (and sf)
                            
                                Add columns to a reactive data frame in Shiny and update them
                            
                                length of 'dimnames' [2] not equal to array extent when using corrplot function from a matrix read from a csv file
                            
                                rmarkdown pdf font not available
                            
                                How to delete all "Values" in RStudio Environment?
                            
                                lme4::glmer vs. Stata's melogit command
                            
                                Can we use Base R to find the 95% of the area under a curve?
                            
                                Split vector into dataframe at every nth element
                            
                                r Shiny action button and data table output
                            
                                How to merge lists with identical column names to get their union
                            
                                ggplot italicize part of caption AND divide text over two lines
                            
                                Dealing with large numbers in R [Inf] and Python
                            
                                Extract column name in mutate_if call
                            
                                Combining plots created by R base, lattice, and ggplot2
                            
                                Harnessing .f list names with purrr::pmap

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With