I want to use use the <code>dplyr::group_by</code> function inside another function, but I do not know how to pass the arguments to this function. Can someone provide a working example? <pre class="prettyprint"><code>library(dplyr) data(iris) iris %.% group_by(Species) %.% summarise(n = n()) # ## Source: local data frame [3 x 2] ## Species n ## 1 virginica 50 ## 2 versicolor 50 ## 3 setosa 50 mytable0 <- function(x, ...) x %.% group_by(...) %.% summarise(n = n()) mytable0(iris, "Species") # OK ## Source: local data frame [3 x 2] ## Species n ## 1 virginica 50 ## 2 versicolor 50 ## 3 setosa 50 mytable1 <- function(x, key) x %.% group_by(as.name(key)) %.% summarise(n = n()) mytable1(iris, "Species") # Wrong! # Error: unsupported type for column 'as.name(key)' (SYMSXP) mytable2 <- function(x, key) x %.% group_by(key) %.% summarise(n = n()) mytable2(iris, "Species") # Wrong! # Error: index out of bounds </code></pre>

For programming, <code>group_by_</code> is the counterpart to <code>group_by</code>: <pre class="prettyprint"><code>library(dplyr) mytable <- function(x, ...) x %>% group_by_(...) %>% summarise(n = n()) mytable(iris, "Species") # or iris %>% mytable("Species") </code></pre> which gives: <pre class="prettyprint"><code> Species n 1 setosa 50 2 versicolor 50 3 virginica 50 </code></pre> Update At the time this was written dplyr used <code>%.%</code> which is what was originally used above but now <code>%>%</code> is favored so have changed above to that to keep this relevant. Update 2 regroup is now deprecated, use group_by_ instead. Update 3 <code>group_by_(list(...))</code> now becomes <code>group_by_(...)</code> in new version of dplyr as per Roberto's comment. Update 4 Added minor variation suggested in comments. Update 5: With rlang/tidyeval it is now possible to do this: <pre class="prettyprint"><code>library(rlang) mytable <- function(x, ...) { group_ <- syms(...) x %>% group_by(!!!group_) %>% summarise(n = n()) } mytable(iris, "Species") </code></pre> or passing <code>Species</code> unevaluated, i.e. no quotes around it: <pre class="prettyprint"><code>library(rlang) mytable <- function(x, ...) { group_ <- enquos(...) x %>% group_by(!!!group_) %>% summarise(n = n()) } mytable(iris, Species) </code></pre> Update 6: There is now a {{...}} notation that works if there is just one grouping variable: <pre class="prettyprint"><code>mytable <- function(x, group) { x %>% group_by({{group}}) %>% summarise(n = n()) } mytable(iris, Species) </code></pre>

UPDATE: As of dplyr 0.7.0 you can use tidy eval to accomplish this. See http://dplyr.tidyverse.org/articles/programming.html for more details. <pre class="prettyprint"><code>library(tidyverse) data("iris") my_table <- function(df, group_var) { group_var <- enquo(group_var) # Create quosure df %>% group_by(!!group_var) %>% # Use !! to unquote the quosure summarise(n = n()) } my_table(iris, Species) > my_table(iris, Species) # A tibble: 3 x 2 Species n <fctr> <int> 1 setosa 50 2 versicolor 50 3 virginica 50 </code></pre>

dplyr: How to use group_by inside a function?

Tags:

r

dplyr

tidyeval

nse

I want to use use the dplyr::group_by function inside another function, but I do not know how to pass the arguments to this function.

Can someone provide a working example?

library(dplyr) data(iris) iris %.% group_by(Species) %.% summarise(n = n()) #  ## Source: local data frame [3 x 2] ##      Species  n ## 1  virginica 50 ## 2 versicolor 50 ## 3     setosa 50  mytable0 <- function(x, ...) x %.% group_by(...) %.% summarise(n = n()) mytable0(iris, "Species") # OK ## Source: local data frame [3 x 2] ##      Species  n ## 1  virginica 50 ## 2 versicolor 50 ## 3     setosa 50  mytable1 <- function(x, key) x %.% group_by(as.name(key)) %.% summarise(n = n()) mytable1(iris, "Species") # Wrong! # Error: unsupported type for column 'as.name(key)' (SYMSXP)  mytable2 <- function(x, key) x %.% group_by(key) %.% summarise(n = n()) mytable2(iris, "Species") # Wrong! # Error: index out of bounds

678

asked Feb 16 '14 17:02

Emilio Torres Manzanera

2 Answers

For programming, group_by_ is the counterpart to group_by:

library(dplyr)  mytable <- function(x, ...) x %>% group_by_(...) %>% summarise(n = n()) mytable(iris, "Species") # or iris %>% mytable("Species")

which gives:

     Species  n 1     setosa 50 2 versicolor 50 3  virginica 50

Update At the time this was written dplyr used %.% which is what was originally used above but now %>% is favored so have changed above to that to keep this relevant.

Update 2 regroup is now deprecated, use group_by_ instead.

Update 3 group_by_(list(...)) now becomes group_by_(...) in new version of dplyr as per Roberto's comment.

Update 4 Added minor variation suggested in comments.

Update 5: With rlang/tidyeval it is now possible to do this:

library(rlang) mytable <- function(x, ...) {   group_ <- syms(...)   x %>%      group_by(!!!group_) %>%      summarise(n = n()) } mytable(iris, "Species")

or passing Species unevaluated, i.e. no quotes around it:

library(rlang) mytable <- function(x, ...) {   group_ <- enquos(...)   x %>%      group_by(!!!group_) %>%      summarise(n = n()) } mytable(iris, Species)

Update 6: There is now a {{...}} notation that works if there is just one grouping variable:

mytable <- function(x, group) {   x %>%      group_by({{group}}) %>%      summarise(n = n()) } mytable(iris, Species)

130

answered Oct 03 '22 05:10

G. Grothendieck

UPDATE: As of dplyr 0.7.0 you can use tidy eval to accomplish this.

See http://dplyr.tidyverse.org/articles/programming.html for more details.

library(tidyverse) data("iris")  my_table <- function(df, group_var) {   group_var <- enquo(group_var)      # Create quosure   df %>%      group_by(!!group_var) %>%        # Use !! to unquote the quosure     summarise(n = n()) }  my_table(iris, Species)  > my_table(iris, Species) # A tibble: 3 x 2      Species     n       <fctr> <int> 1     setosa    50 2 versicolor    50 3  virginica    50

answered Oct 03 '22 05:10

Brad Cannell

Related questions
                            
                                Techniques for finding near duplicate records
                            
                                Include files R?
                            
                                What is the difference between cat and print?
                            
                                When should I use setDT() instead of data.table() to create a data.table?
                            
                                R Shiny set DataTable column width
                            
                                R knitr: Possible to programmatically modify chunk labels?
                            
                                No non-missing arguments warning when using min or max in reshape2
                            
                                Get a list of the data sets in a particular package
                            
                                reshape vs. reshape2 in R
                            
                                extracting standardized coefficients from lm in R
                            
                                How to get the name of the calling function inside the called routine?
                            
                                What are Replacement Functions in R?
                            
                                Sort matrix according to first column in R
                            
                                Set R plots x axis to show at y=0
                            
                                Reading data from PDF files into R
                            
                                Solution. How to install_github when there is a proxy
                            
                                Extract matrix column values by matrix column name
                            
                                How to slice data from a middle index until the end without using `length` in R (like you can in python)?
                            
                                Adjust Transparency (alpha) of stat_smooth lines, not just transparency of Confidence Interval
                            
                                lambda-like functions in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With