Adding another grouping with dplyr

Tags:

r

dplyr

I would like to mutate a data frame twice, grouping by two sets of columns which intersect each other. i.e.:

df <- df %>% group_by(a, b) %>% mutate(x = sum(d))
df <- df %>% group_by(a, b, c) %>% mutate(y = sum(e))

Is there a faster/more elegant way to do this? I was hoping to be able to do something like:

Click to copy

df <- df %>%
    group_by(a, b) %>%
    mutate(x = sum(d)) %>%
    group_by(c) %>%
    mutate(y = sum(e))

Or perhaps save a variable with the first group_by applied and then use it twice.

525

asked Oct 29 '15 18:10

Sam Brightman

1 Answers

We use add=TRUE in the second group_by to group by 3 variables, adding c in the OP's example-

Click to copy

 df %>%
   group_by(a, b) %>%
   mutate(x = sum(d)) %>%
   group_by(c, add=TRUE) %>%
   mutate(y = sum(e))

According to the documentation for ?group_by

By default, when add = FALSE, group_by will override existing groups. To instead add to the existing groups, use add = TRUE

This can be done in one group_by call, but only with non-dplyrish functions:

Click to copy

 df %>%
   group_by(a, b) %>%
   mutate(x = sum(d), y = ave(e, c, sum))

129

answered Oct 23 '22 04:10

akrun

Related questions
                            
                                R check doesn't like std:cout (C++)
                            
                                Adding double quotes to string in R
                            
                                R error in '[<-.data.frame'... replacement has # items, need #
                            
                                R: Setting limits to scale_x_yearqtr in ggplot for yearqtr (zoo)
                            
                                Calculate number of days between two dates in r
                            
                                How to split a decimal number from a string in R
                            
                                How to plot, where each row in a matrix is a line inte plot in R
                            
                                How to install RHadoop packages (Rmr, Rhdfs, Rhbase)?
                            
                                R: Combine list of data frames into single data frame, add column with list index
                            
                                Simultaneous order, row-filter and column-select with data.table
                            
                                r caret predict returns fewer output than input
                            
                                dplyr: max value in a group, excluding the value in each row?
                            
                                How do you change the timezone of Sys.time()
                            
                                Error: could not find function "read_html"
                            
                                How to know if the app is running at local or on server? (R Shiny)
                            
                                linear regression using lm() - surprised by the result
                            
                                Test if a value is unique in a vector in R
                            
                                How to count the frequency of a string for each row in R
                            
                                R ifelse to replace values in a column
                            
                                Convert column in data.frame to date

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Adding another grouping with dplyr

Tags:

r

dplyr

Sam Brightman

People also ask

1 Answers

akrun

Recent Activity

Donate For Us