I would like to get rolling average for each of the numeric variables that I have. Using data.table package, I know how to compute for a single variable. But how should I revise the code so it can process multiple variables at a time rather than revising the variable name and repeat this procedure for several times? Thanks. Suppose I have other numeric variables named as "V2", "V3", and "V4". <pre class="prettyprint"><code>require(data.table) setDT(data) setkey(data,Receptor,date) data[ , `:=` ('RollConc' = rollmean(AvgConc, 48, align="left", na.pad=TRUE)) , by=Receptor] </code></pre> A copy of my sample data can be found at: https://drive.google.com/file/d/0B86_a8ltyoL3OE9KTUstYmRRbFk/view?usp=sharing I would like to get 5-hour rolling means for "AvgConc","TotDep","DryDep", and "WetDep" by each receptor.

From your description you want something like this, which is similar to one example that can be found in one of the data.table vignettes: <pre class="prettyprint"><code>library(data.table) set.seed(42) DT <- data.table(x = rnorm(10), y = rlnorm(10), z = runif(10), g = c("a", "b"), key = "g") library(zoo) DT[, paste0("ravg_", c("x", "y")) := lapply(.SD, rollmean, k = 3, na.pad = TRUE), by = g, .SDcols = c("x", "y")] </code></pre>

Now, one can use the <code>frollmean</code> function in the <code>data.table</code> package for this. <pre class="prettyprint"><code>library(data.table) xy <- c("x", "y") DT[, (xy):= lapply(.SD, frollmean, n = 3, fill = NA, align="center"), by = g, .SDcols = xy] </code></pre> Here, I am replacing the x and y columns by the rolling average. <hr> <pre class="prettyprint"><code># Data set.seed(42) DT <- data.table(x = rnorm(10), y = rlnorm(10), z = runif(10), g = c("a", "b"), key = "g") </code></pre>

rolling average to multiple variables in R using data.table package

Tags:

r

data.table

moving-average

I would like to get rolling average for each of the numeric variables that I have. Using data.table package, I know how to compute for a single variable. But how should I revise the code so it can process multiple variables at a time rather than revising the variable name and repeat this procedure for several times? Thanks.

Suppose I have other numeric variables named as "V2", "V3", and "V4".

require(data.table)
setDT(data)
setkey(data,Receptor,date)
data[ , `:=` ('RollConc' = rollmean(AvgConc, 48, align="left", na.pad=TRUE)) , by=Receptor]

A copy of my sample data can be found at: https://drive.google.com/file/d/0B86_a8ltyoL3OE9KTUstYmRRbFk/view?usp=sharing

I would like to get 5-hour rolling means for "AvgConc","TotDep","DryDep", and "WetDep" by each receptor.

602

asked Jul 17 '15 18:07

Vicki1227

2 Answers

From your description you want something like this, which is similar to one example that can be found in one of the data.table vignettes:

library(data.table)
set.seed(42)
DT <- data.table(x = rnorm(10), y = rlnorm(10), z = runif(10), g = c("a", "b"), key = "g")
library(zoo)
DT[, paste0("ravg_", c("x", "y")) := lapply(.SD, rollmean, k = 3, na.pad = TRUE), 
   by = g, .SDcols = c("x", "y")]

answered Oct 15 '22 18:10

Roland

Now, one can use the frollmean function in the data.table package for this.

library(data.table)    
xy <- c("x", "y")
DT[, (xy):= lapply(.SD, frollmean, n = 3, fill = NA, align="center"), 
                                   by = g, .SDcols =  xy]

Here, I am replacing the x and y columns by the rolling average.

# Data
set.seed(42)
DT <- data.table(x = rnorm(10), y = rlnorm(10), z = runif(10), 
                                g = c("a", "b"), key = "g")

answered Oct 15 '22 19:10

Suren

Related questions
                            
                                How to show the progress of code in parallel computation in R?
                            
                                Vectorisation of for loop with multiple conditions
                            
                                How to change the histogram borderline thickness in ggplot geom_histogram()
                            
                                How to do str_extract with base R?
                            
                                How to convert searchTwitter results (from library(twitteR)) into a data.frame?
                            
                                R + ggplot2: how to hide missing dates from x-axis?
                            
                                Repeat headers when using xtable with longtable option
                            
                                Filtering a data frame
                            
                                sort and output records with SAS and R
                            
                                Problems installing R on Linux CentOS 6.2
                            
                                select last observation from longitudinal data
                            
                                How to rotate the axis labels in ggplot2?
                            
                                Select rows from data.frame ending with a specific character string in R
                            
                                R- Collapse rows and sum the values in the column
                            
                                Is there an fread analog for reading from stdin?
                            
                                Reshape multiple categorical variables to binary response variables
                            
                                "Error in int_abline...plot.new has not been called yet"
                            
                                How to order a column by group in R
                            
                                Create group number for contiguous runs of equal values
                            
                                Remove trailing and leading spaces and extra internal whitespace with one gsub call

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With