I've been using the tidy forecasting package fable (which has been so useful). I was wondering if there was an easy way to extract that the p,d,q values from a mable. Using the data on this guide as an example https://www.mitchelloharawild.com/blog/fable/ <pre class="prettyprint"><code>library(tidyverse) library(tsibble) library(fable) tourism_state <- tourism %>% group_by(State) %>% summarise(Trips = sum(Trips)) fit <- tourism_state %>% model(arima = ARIMA(Trips)) </code></pre> <pre class="prettyprint"><code>> fit # A mable: 8 x 2 # Key: State [8] State arima <chr> <model> 1 ACT <ARIMA(0,1,1)> 2 New South Wales <ARIMA(0,1,1)(0,1,1)[4]> 3 Northern Territory <ARIMA(1,0,1)(0,1,1)[4]> 4 Queensland <ARIMA(2,1,2)> 5 South Australia <ARIMA(1,0,1)(0,1,1)[4]> 6 Tasmania <ARIMA(0,0,3)(2,1,0)[4]> 7 Victoria <ARIMA(0,1,1)(0,1,1)[4]> 8 Western Australia <ARIMA(0,1,3)> </code></pre> I know the specifications are stored under model[[1]]$fit$spec but I can't figure out a way to extract them if I have a large list of models Ideally I'd like <pre class="prettyprint"><code> State arima p d q <chr> <model> 1 ACT <ARIMA(0,1,1)> 0 1 1 2 New South Wales <ARIMA(0,1,1)(0,1,1)[4]> 0 1 1 3 Northern Territory <ARIMA(1,0,1)(0,1,1)[4]> 1 0 1 4 Queensland <ARIMA(2,1,2)> 5 South Australia <ARIMA(1,0,1)(0,1,1)[4]> and so on.... 6 Tasmania <ARIMA(0,0,3)(2,1,0)[4]> 7 Victoria <ARIMA(0,1,1)(0,1,1)[4]> 8 Western Australia <ARIMA(0,1,3)> </code></pre> Thanks!

What about this? <pre class="prettyprint lang-r prettyprint-override"><code># specificly needed libraries from tidyverse library(dplyr) library(purrr) fit %>% mutate(map_dfr(arima, c("fit", "spec"))) #> # A mable: 8 x 10 #> # Key: State [8] #> State arima p d q P D Q constant period #> <chr> <model> <int> <int> <int> <int> <int> <int> <lgl> <dbl> #> 1 ACT <ARIMA(0,1,1)> 0 1 1 0 0 0 FALSE 4 #> 2 New South Wales <ARIMA(0,1,1)(0,1,1)[4]> 0 1 1 0 1 1 FALSE 4 #> 3 Northern Territory <ARIMA(1,0,1)(0,1,1)[4]> 1 0 1 0 1 1 FALSE 4 #> 4 Queensland <ARIMA(2,1,2)> 2 1 2 0 0 0 FALSE 4 #> 5 South Australia <ARIMA(1,0,1)(0,1,1)[4]> 1 0 1 0 1 1 FALSE 4 #> 6 Tasmania <ARIMA(0,0,3)(2,1,0)[4]> 0 0 3 2 1 0 FALSE 4 #> 7 Victoria <ARIMA(0,1,1)(0,1,1)[4]> 0 1 1 0 1 1 FALSE 4 #> 8 Western Australia <ARIMA(0,1,3)> 0 1 3 0 0 0 FALSE 4 </code></pre> It works on <code>R >= 4.0</code> and <code>dplyr >= 1.0</code>. The <code>arima</code> column is a list. We can use <code>map</code> to extract data from lists. <code>map</code> will return a list itself, but with <code>map_dfr</code> you can return a dataframe which <code>mutate</code> will interprete as a new set of columns to add to the original dataframe. Note that with this code the output and the input keep the same class (<code>mable</code>).

Fable: Extracting the p,d,q specification from an ARIMA model

Tags:

r

arima

fable-r

I've been using the tidy forecasting package fable (which has been so useful).

I was wondering if there was an easy way to extract that the p,d,q values from a mable.

Using the data on this guide as an example https://www.mitchelloharawild.com/blog/fable/

library(tidyverse)
library(tsibble)
library(fable)

tourism_state <- tourism %>% 
  group_by(State) %>% 
  summarise(Trips = sum(Trips))

fit <- tourism_state %>% 
  model(arima = ARIMA(Trips))

> fit
# A mable: 8 x 2
# Key:     State [8]
  State                                 arima
  <chr>                               <model>
1 ACT                          <ARIMA(0,1,1)>
2 New South Wales    <ARIMA(0,1,1)(0,1,1)[4]>
3 Northern Territory <ARIMA(1,0,1)(0,1,1)[4]>
4 Queensland                   <ARIMA(2,1,2)>
5 South Australia    <ARIMA(1,0,1)(0,1,1)[4]>
6 Tasmania           <ARIMA(0,0,3)(2,1,0)[4]>
7 Victoria           <ARIMA(0,1,1)(0,1,1)[4]>
8 Western Australia            <ARIMA(0,1,3)>

I know the specifications are stored under model[[1]]$fit$spec but I can't figure out a way to extract them if I have a large list of models

Ideally I'd like

  State                                 arima       p     d       q
  <chr>                               <model>
1 ACT                          <ARIMA(0,1,1)>       0     1       1
2 New South Wales    <ARIMA(0,1,1)(0,1,1)[4]>       0     1       1
3 Northern Territory <ARIMA(1,0,1)(0,1,1)[4]>       1     0       1
4 Queensland                   <ARIMA(2,1,2)>       
5 South Australia    <ARIMA(1,0,1)(0,1,1)[4]>       and so on....
6 Tasmania           <ARIMA(0,0,3)(2,1,0)[4]>
7 Victoria           <ARIMA(0,1,1)(0,1,1)[4]>
8 Western Australia            <ARIMA(0,1,3)>

Thanks!

325

asked Aug 13 '20 10:08

usually_confused

1 Answers

What about this?

# specificly needed libraries from tidyverse
library(dplyr)
library(purrr)

fit %>%
  mutate(map_dfr(arima, c("fit", "spec")))

#> # A mable: 8 x 10
#> # Key:     State [8]
#>   State                                 arima     p     d     q     P     D     Q constant period
#>   <chr>                               <model> <int> <int> <int> <int> <int> <int> <lgl>     <dbl>
#> 1 ACT                          <ARIMA(0,1,1)>     0     1     1     0     0     0 FALSE         4
#> 2 New South Wales    <ARIMA(0,1,1)(0,1,1)[4]>     0     1     1     0     1     1 FALSE         4
#> 3 Northern Territory <ARIMA(1,0,1)(0,1,1)[4]>     1     0     1     0     1     1 FALSE         4
#> 4 Queensland                   <ARIMA(2,1,2)>     2     1     2     0     0     0 FALSE         4
#> 5 South Australia    <ARIMA(1,0,1)(0,1,1)[4]>     1     0     1     0     1     1 FALSE         4
#> 6 Tasmania           <ARIMA(0,0,3)(2,1,0)[4]>     0     0     3     2     1     0 FALSE         4
#> 7 Victoria           <ARIMA(0,1,1)(0,1,1)[4]>     0     1     1     0     1     1 FALSE         4
#> 8 Western Australia            <ARIMA(0,1,3)>     0     1     3     0     0     0 FALSE         4

It works on R >= 4.0 and dplyr >= 1.0.

The arima column is a list. We can use map to extract data from lists.

map will return a list itself, but with map_dfr you can return a dataframe which mutate will interprete as a new set of columns to add to the original dataframe.

Note that with this code the output and the input keep the same class (mable).

171

answered Sep 30 '22 18:09

Edo

Related questions
                            
                                .Renviron variables not found when Rscript run from powershell?
                            
                                magrittr pipe not evaluating a dot passed a second pipe within a function argument
                            
                                Is there a way to have different dropdown options for different rows in an rhandsontable?
                            
                                How can you show the rownames in pheatmap on the left side of the graph?
                            
                                R Markdown: place an Appendix after the "References" section?
                            
                                Conda install r-essentials with MKL
                            
                                Unable to load Rcpp package
                            
                                From list to data frame with tidyverse, selecting specific list elements
                            
                                Assign group id to sequence of concecutive unique values in timeseries
                            
                                Query to Snowflake database isn't working because no active warehouse is selected
                            
                                Specify the dots argument when calling a tidyselect-using function without needing to specify the preceding arguments
                            
                                Conditionally determining value of column by looking at last group
                            
                                Button to view in full screen
                            
                                "Debug location is approximate because the source is not available" in R 4.0.0 + RStudio
                            
                                An analog to rnorm in python
                            
                                r summarize_if with multiple conditions
                            
                                R devtools unable to install - Ubuntu 20.04 - package or namespace load failed for ‘pkgload’
                            
                                Pause and resume caret training in R
                            
                                Numbered captions on customized and reactive figure in R markdown HTML file
                            
                                Standardize variables using dplyr [r]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With