I am using R for some statistical analysis of time series. I have tried Googling around, but I can't seem to find any definitive answers. Can any one who knows more please point me in the right direction? Example: Let's say I want to do a linear regression of two time series. The time series contain daily data, but there might be gaps here and there so the time series are not regular. Naturally I only want to compare data points where both time series have data. This is what I do currently to read the csv files into a data frame: <pre class="prettyprint"><code>library(zoo) apples <- read.csv('/Data/apples.csv', as.is=TRUE) oranges <- read.csv('/Data/oranges.csv', as.is=TRUE) apples$date <- as.Date(apples$date, "%d/%m/%Y") oranges$date <- as.Date(oranges$date, "%d/%m/%Y") zapples <- zoo(apples$close,apples$date) zoranges <- zoo(oranges$close,oranges$date) zdata <- merge(zapples, zoranges, all=FALSE) data <- as.data.frame(zdata) </code></pre> Is there a slicker way of doing this? Also, how can I slice the data, e.g., select the entries in <code>data</code> with dates within a certain period?

Try something along these lines. This assumes that the dates are in column 1. The dyn package can be used to transform <code>lm</code>, <code>glm</code> and many similar regression type functions to ones that accept zoo series. Write <code>dyn$lm</code> in place of <code>lm</code> as shown: <pre class="prettyprint"><code>library(dyn) # also loads zoo fmt <- "%d/%m/%Y" zapples <- read.zoo('apples.csv', header = TRUE, sep = ",", format = fmt) zoranges <- read.zoo('oranges.csv', header = TRUE, sep = ",", format = fmt) zdata <- merge(zapples, zoranges) dyn$lm(..whatever.., zdata) </code></pre> You don't need <code>all = FALSE</code> since <code>lm</code> will ignore rows with NAs under the default setting of its <code>na.action</code> argument. The <code>window.zoo</code> function can be used to slice data. Depending on what you want to do you might also want to look at the xts and quantmod packages.

What is the best practice of handling time series in R?

Tags:

r

time-series

I am using R for some statistical analysis of time series. I have tried Googling around, but I can't seem to find any definitive answers. Can any one who knows more please point me in the right direction?

Example:

Let's say I want to do a linear regression of two time series. The time series contain daily data, but there might be gaps here and there so the time series are not regular. Naturally I only want to compare data points where both time series have data. This is what I do currently to read the csv files into a data frame:

library(zoo)
apples <- read.csv('/Data/apples.csv', as.is=TRUE)
oranges <- read.csv('/Data/oranges.csv', as.is=TRUE)
apples$date <- as.Date(apples$date, "%d/%m/%Y")
oranges$date <- as.Date(oranges$date, "%d/%m/%Y")
zapples <- zoo(apples$close,apples$date)
zoranges <- zoo(oranges$close,oranges$date)
zdata <- merge(zapples, zoranges, all=FALSE)
data <- as.data.frame(zdata)

Is there a slicker way of doing this?

Also, how can I slice the data, e.g., select the entries in data with dates within a certain period?

837

asked Feb 11 '11 02:02

c00kiemonster

1 Answers

Try something along these lines. This assumes that the dates are in column 1. The dyn package can be used to transform lm, glm and many similar regression type functions to ones that accept zoo series. Write dyn$lm in place of lm as shown:

library(dyn) # also loads zoo
fmt <- "%d/%m/%Y"
zapples <- read.zoo('apples.csv', header = TRUE, sep = ",", format = fmt)
zoranges <- read.zoo('oranges.csv', header = TRUE, sep = ",", format = fmt)
zdata <- merge(zapples, zoranges)
dyn$lm(..whatever.., zdata)

You don't need all = FALSE since lm will ignore rows with NAs under the default setting of its na.action argument.

The window.zoo function can be used to slice data.

Depending on what you want to do you might also want to look at the xts and quantmod packages.

answered Nov 15 '22 20:11

G. Grothendieck

Related questions
                            
                                Spread with duplicate identifiers (using tidyverse and %>%) [duplicate]
                            
                                `purrr::map` to any type
                            
                                Remove rows with the same value across all columns
                            
                                Remove specific last character from string
                            
                                Error with H2O in R - can't connect to local host
                            
                                How to Transpose (t) in the Tidyverse Using Tidyr
                            
                                R: Remove duplicates from a dataframe based on categories in a column
                            
                                Show content for menuItem when menuSubItems exist in Shiny Dashboard
                            
                                Reducing spacing between lines when using atop
                            
                                How to include NA data in a table
                            
                                Dynamic variable names in R regressions
                            
                                How to recode a range of rows in between two specific values
                            
                                How to trim white spaces when trimws is not working?
                            
                                How to draw a point in polar coordinates with negative r?
                            
                                "Hmisc" package or namespace failed to load - no package called 'latticeExtra'
                            
                                Is it possible to draw the axis line first, before the data?
                            
                                Correlation clustering in R
                            
                                Getting the contents of a library interactively in R
                            
                                predict.svm does not predict new data
                            
                                Changing user agent string in a http request in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With