Cumulative total by group

Tags:

r

data.table

For the following dataset:

d = data.frame(date = as.Date(as.Date('2015-01-01'):as.Date('2015-04-10'), origin = "1970-01-01"),
               group = rep(c('A','B','C','D'), 25), value = sample(1:100))
head(d)
         date group value
1: 2015-01-01     A     4
2: 2015-01-02     B    32
3: 2015-01-03     C    46
4: 2015-01-04     D    40
5: 2015-01-05     A    93
6: 2015-01-06     B    10

.. can anyone advise a more elegant way to calculate a cumulative total of values by group than this data.table) method?

library(data.table)
setDT(d)
d.cast = dcast.data.table(d, group ~ date, value.var = 'value', fun.aggregate = sum)
c.sum = d.cast[, as.list(cumsum(unlist(.SD))), by = group]

.. which is pretty clunky and yields a flat matrix that needs dplyr::gather or reshape2::melt to reformat.

Surely R can do better than this??

439

asked May 22 '15 14:05

geotheory

2 Answers

If you just want cumulative sums per group, then you can do

transform(d, new=ave(value,group,FUN=cumsum))

with base R.

163

answered Sep 22 '22 05:09

MrFlick

This should work

library(dplyr)
d %>% 
  group_by(group) %>% 
  arrange(date) %>% 
  mutate(Total = cumsum(value))

answered Sep 20 '22 05:09

Akhil Nair

Related questions
                            
                                ggplot2: How to specify multiple fill colors for points that are connected by lines of different colors
                            
                                how to generate random numbers with sequence in R
                            
                                how to draw arrow in ggplot2 with annotation
                            
                                Change thickness of a marker in ggplot2
                            
                                How can I shorten x-axis label text in ggplot?
                            
                                What is the most useful output format for graphs? [closed]
                            
                                Loop through netcdf files and run calculations - Python or R
                            
                                reading multiple csv files in R [duplicate]
                            
                                R: Compare all the columns pairwise in matrix
                            
                                error with scale_x_labels in ggplot2
                            
                                How can I summarizing data statistics using R
                            
                                Captions on tables in pdf documents generated by rmarkdown
                            
                                How do I evaluate columns inside data.table with different conditions
                            
                                Change the order of elements in vector in R
                            
                                Visualizing time series in spirals using R or Python?
                            
                                Creating variable in R data frame depending on another data frame
                            
                                Convert List of Vectors into Data Frame of Counts [duplicate]
                            
                                Remove duplicate column pairs, sort rows based on 2 columns [duplicate]
                            
                                How to construct a named list (a SEXP) to be returned from the C function called with .Call()?
                            
                                Changing node/vertice opacity in iGraph in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With