I'm writing functions that take in a <code>data.frame</code> and then do some operations. I need to add and subtract items from the <code>group_by</code> criteria in order to get where I want to go. If I want to add a <code>group_by</code> criteria to a df, that's pretty easy: <pre class="prettyprint"><code>library(tidyverse) set.seed(42) n <- 10 input <- data.frame(a = 'a', b = 'b' , vals = 1 ) input %>% group_by(a) -> grouped grouped #> # A tibble: 1 x 3 #> # Groups: a [1] #> a b vals #> <fct> <fct> <dbl> #> 1 a b 1. ## add a group: grouped %>% group_by(b, add=TRUE) #> # A tibble: 1 x 3 #> # Groups: a, b [1] #> a b vals #> <fct> <fct> <dbl> #> 1 a b 1. ## drop a group? </code></pre> But how do I programmatically drop the grouping by <code>b</code> which I added, yet keep all other groupings the same?

Here's an approach that uses tidyeval so that bare column names can be used as the function arguments. I'm not sure if it makes sense to convert the bare column names to text (as I've done below) or if there's a more elegant way to work directly with the bare column names. <pre class="prettyprint"><code>drop_groups = function(data, ...) { groups = map_chr(groups(data), rlang::quo_text) drop = map_chr(quos(...), rlang::quo_text) if(any(!drop %in% groups)) { warning(paste("Input data frame is not grouped by the following groups:", paste(drop[!drop %in% groups], collapse=", "))) } data %>% group_by_at(setdiff(groups, drop)) } d = mtcars %>% group_by(cyl, vs, am) groups(d %>% drop_groups(vs, cyl)) </code></pre> <blockquote> <pre class="prettyprint"><code>[[1]] am </code></pre> </blockquote> <pre class="prettyprint"><code>groups(d %>% drop_groups(a, vs, b, c)) </code></pre> <blockquote> <pre class="prettyprint"><code>[[1]] cyl [[2]] am Warning message: In drop_groups(., a, vs, b, c) : Input data frame is not grouped by the following groups: a, b, c </code></pre> </blockquote> UPDATE: The approach below works directly with quosured column names, without converting them to strings. I'm not sure which approach is "preferred" in the tidyeval paradigm, or whether there is yet another, more desirable method. <pre class="prettyprint"><code>drop_groups2 = function(data, ...) { groups = map(groups(data), quo) drop = quos(...) if(any(!drop %in% groups)) { warning(paste("Input data frame is not grouped by the following groups:", paste(drop[!drop %in% groups], collapse=", "))) } data %>% group_by(!!!setdiff(groups, drop)) } </code></pre>

Programmatically dropping a `group_by` field in dplyr

Tags:

r

dplyr

I'm writing functions that take in a data.frame and then do some operations. I need to add and subtract items from the group_by criteria in order to get where I want to go.

If I want to add a group_by criteria to a df, that's pretty easy:

library(tidyverse)
set.seed(42)
n <- 10
input <- data.frame(a = 'a', 
                    b = 'b' , 
                    vals = 1
)

input %>%
  group_by(a) -> 
grouped 

grouped
#> # A tibble: 1 x 3
#> # Groups:   a [1]
#>   a     b      vals
#>   <fct> <fct> <dbl>
#> 1 a     b        1.

## add a group:
grouped %>% 
  group_by(b, add=TRUE)
#> # A tibble: 1 x 3
#> # Groups:   a, b [1]
#>   a     b      vals
#>   <fct> <fct> <dbl>
#> 1 a     b        1.

## drop a group?

But how do I programmatically drop the grouping by b which I added, yet keep all other groupings the same?

444

asked May 08 '18 15:05

JD Long

1 Answers

Here's an approach that uses tidyeval so that bare column names can be used as the function arguments. I'm not sure if it makes sense to convert the bare column names to text (as I've done below) or if there's a more elegant way to work directly with the bare column names.

drop_groups = function(data, ...) {

  groups = map_chr(groups(data), rlang::quo_text)
  drop = map_chr(quos(...), rlang::quo_text)

  if(any(!drop %in% groups)) {
    warning(paste("Input data frame is not grouped by the following groups:", 
                  paste(drop[!drop %in% groups], collapse=", ")))
  }

  data %>% group_by_at(setdiff(groups, drop))

}

d = mtcars %>% group_by(cyl, vs, am)

groups(d %>% drop_groups(vs, cyl))

[[1]]
am

groups(d %>% drop_groups(a, vs, b, c))

[[1]]
cyl

[[2]]
am

Warning message:
In drop_groups(., a, vs, b, c) :
  Input data frame is not grouped by the following groups: a, b, c

UPDATE: The approach below works directly with quosured column names, without converting them to strings. I'm not sure which approach is "preferred" in the tidyeval paradigm, or whether there is yet another, more desirable method.

drop_groups2 = function(data, ...) {

  groups = map(groups(data), quo)
  drop = quos(...)

  if(any(!drop %in% groups)) {
    warning(paste("Input data frame is not grouped by the following groups:", 
                  paste(drop[!drop %in% groups], collapse=", ")))
  }

  data %>% group_by(!!!setdiff(groups, drop))

}

104

answered Nov 08 '22 08:11

eipi10

Related questions
                            
                                split strings and add them as new row
                            
                                How to Count Unique rows in a data frame?
                            
                                dplyr::n() returns “Error: Error: n() should only be called in a data context ”
                            
                                How to connect data points (for each subject) on a plot in R?
                            
                                R round exponential number
                            
                                How do I extract values from uniform list in R?
                            
                                When to choose nls() over loess()?
                            
                                Making a ternary plot
                            
                                replace special characters along with the space in list of strings
                            
                                Unable to open png device in loop
                            
                                Adding key legend to multi-histogram plot in R
                            
                                How can I color the ocean blue in a map of the US?
                            
                                Change label size of Cluster Dendrogram in R 3.01
                            
                                How to check if a matrix has an inverse in the R language
                            
                                R Reorder matrix columns by matching colnames to list of string
                            
                                Viewing all column names with any NA in R
                            
                                Add text on top of a facet dodged barplot using ggplot2
                            
                                Installation of RODBC on OS X Yosemite
                            
                                Identifying where value changes in R data.frame column
                            
                                How to install Tidyverse on Ubuntu 16.04 and 17.04

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With