I'm trying to calculate the <code>cumsum</code> starting from the last row towards the first for each group. Sample data: <pre class="prettyprint"><code>t1 <- data.frame(var = "a", val = c(0,0,0,0,1,0,0,0,0,1,0,0,0,0,0)) t2 <- data.frame(var = "b", val = c(0,0,0,0,1,0,0,1,0,0,0,0,0,0,0)) ts <- rbind(t1, t2) </code></pre> Desired format (grouped by <code>var</code>): <pre class="prettyprint"><code>ts <- data.frame(var = c("a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b"), val = c(2,2,2,2,2,1,1,1,1,1,0,0,0,0,0,2,2,2,2,2,1,1,1,0,0,0,0,0,0,0)) </code></pre>

An option without explicitly reversing the vector: <pre class="prettyprint"><code>ave(ts$val, ts$var, FUN = function(x) Reduce(sum, x, right = TRUE, accumulate = TRUE)) [1] 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 2 2 2 2 2 1 1 1 0 0 0 0 0 0 0 </code></pre> Or the same approach with <code>dplyr</code>: <pre class="prettyprint"><code>ts %>% group_by(var) %>% mutate(val = Reduce(sum, val, right = TRUE, accumulate = TRUE)) </code></pre>

Calculate cumsum from the end towards the beginning

Tags:

r

reverse

cumsum

I'm trying to calculate the cumsum starting from the last row towards the first for each group.

Sample data:

t1 <- data.frame(var = "a", val = c(0,0,0,0,1,0,0,0,0,1,0,0,0,0,0))
t2 <- data.frame(var = "b", val = c(0,0,0,0,1,0,0,1,0,0,0,0,0,0,0))
ts <- rbind(t1, t2)

Desired format (grouped by var):

ts <- data.frame(var = c("a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a", "a",
                           "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b", "b"), 
                 val = c(2,2,2,2,2,1,1,1,1,1,0,0,0,0,0,2,2,2,2,2,1,1,1,0,0,0,0,0,0,0))

820

asked May 18 '18 14:05

adl

2 Answers

Promoting my comment to an answer; using:

ts$val2 <- ave(ts$val, ts$var, FUN = function(x) rev(cumsum(rev(x))))

gives:

> ts
   var val val2
1    a   0    2
2    a   0    2
3    a   0    2
4    a   0    2
5    a   1    2
6    a   0    1
7    a   0    1
8    a   0    1
9    a   0    1
10   a   1    1
11   a   0    0
12   a   0    0
13   a   0    0
14   a   0    0
15   a   0    0
16   b   0    2
17   b   0    2
18   b   0    2
19   b   0    2
20   b   1    2
21   b   0    1
22   b   0    1
23   b   1    1
24   b   0    0
25   b   0    0
26   b   0    0
27   b   0    0
28   b   0    0
29   b   0    0
30   b   0    0

Or with dplyr or data.table:

library(dplyr)
ts %>% 
  group_by(var) %>%
  mutate(val2 = rev(cumsum(rev(val))))

library(data.table)
setDT(ts)[, val2 := rev(cumsum(rev(val))), by = var]

138

answered Oct 19 '22 13:10

Jaap

An option without explicitly reversing the vector:

ave(ts$val, ts$var, FUN = function(x) Reduce(sum, x, right = TRUE, accumulate = TRUE))

 [1] 2 2 2 2 2 1 1 1 1 1 0 0 0 0 0 2 2 2 2 2 1 1 1 0 0 0 0 0 0 0

Or the same approach with dplyr:

ts %>%
 group_by(var) %>%
 mutate(val = Reduce(sum, val, right = TRUE, accumulate = TRUE))

answered Oct 19 '22 13:10

tmfmnk

Related questions
                            
                                data.table - subsetting based on variable whose name is a column, too
                            
                                remove everything after the last underscore of a column in R [duplicate]
                            
                                Figure captions with multiple plots in one chunk
                            
                                Declare variable with a dot at the begining in R [closed]
                            
                                Suppress ggpairs messages when generating plot
                            
                                R: possible truncation of >= 4GB file
                            
                                Cut integer into equally sized integers and assign to vector
                            
                                Is it possible to list all the global options that can be set for a package?
                            
                                Subset/filter in dplyr chain with ggplot2
                            
                                call R script from Shiny App
                            
                                r caretEnsemble warning: indexes not defined in trControl
                            
                                Use actionButton to send email in RShiny
                            
                                How to add calculated columns to nested data frames (list columns) using purrr
                            
                                pandoc document conversion failed with error 1 after update to R version 3.4.2
                            
                                Select columns, skip if column not exist
                            
                                Average clustering coefficient of a network (igraph)
                            
                                Why is R dplyr::mutate inconsistent with custom functions
                            
                                How do I plot the mean instead of the median with geom_boxplot? [duplicate]
                            
                                How to store r ggplot graph as html code snippet
                            
                                Beautifying Sankey/Alluvial visualization using R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With