dplyr: whats the difference between group_by and group_by_ functions?

Tags:

dplyr

I can't figure out what the underscore-based function is for the group_by_() function.

From the group_by help:

by_cyl <- group_by(mtcars, cyl)  
summarise(by_cyl, mean(disp), mean(hp))

yields the expected:

Source: local data frame [3 x 3]  
    cyl mean(disp)  mean(hp)
1   4   105.1364  82.63636
2   6   183.3143 122.28571
3   8   353.1000 209.21429

but this:

by_cyl <- group_by_(mtcars, cyl)

yields an error:

"Error in as.lazy_dots(list(...)) : object 'cyl' not found"

So my question is what does the underscore version do? And also, under what circumstances would I want to use it, rather than the "regular" one?

Thanks

494

asked Feb 23 '15 04:02

1 Answers

The dplyr Non-Standard Evaluation vignette helps here: http://cran.r-project.org/web/packages/dplyr/vignettes/nse.html

Note: the above link is now out of date, but the same information can be found on the github page for the package. https://github.com/tidyverse/dplyr/blob/34423af89703b0772d59edcd0f3485295b629ab0/vignettes/nse.Rmd

Dplyr uses non-standard evaluation (NSE) in all of the most important single table verbs: filter(), mutate(), summarise(), arrange(), select() and group_by(). NSE is important not only to save you typing, but for database backends, is what makes it possible to translate your R code to SQL. However, while NSE is great for interactive use it’s hard to program with. This vignette describes how you can opt out of NSE in dplyr, and instead rely only on SE (along with a little quoting).

...

Every function in dplyr that uses NSE also has a version that uses SE. There’s a consistent naming scheme: the SE is the NSE name with _ on the end. For example, the SE version of summarise() is summarise_(), the SE version of arrange() is arrange_(). These functions work very similarly to their NSE cousins, but the inputs must be “quoted”

137

answered Oct 20 '22 14:10

r.bot

Related questions
                            
                                Garbage collection of seemingly PROTECTed pairlist
                            
                                Strings as variable references in an R formula
                            
                                guidelines for testing a statistical function in R?
                            
                                Can anyone help me write a R data frame as a SAS data set?
                            
                                Is it possible/advisable to skip roxygen in favor of roxygen2? [closed]
                            
                                geom_smooth() - and scaling the y axis, losing data from smoothing
                            
                                How to do one-way ANOVA in R with unequal sample sizes?
                            
                                Installation directory of R and the usage of .libPath()
                            
                                How to save glm result without data or only with coeffients for prediction?
                            
                                stacking columns in data.frame into one column in R
                            
                                combine month and day into one date column
                            
                                Update formula in R
                            
                                R: How to save lists into csv?
                            
                                Merging or overlaying xyplots in a lattice panel
                            
                                Values from multiple dataframe columns into one vector
                            
                                What does "df[] <-" do in R
                            
                                R: Shiny dateRangeInput format
                            
                                Plot one numeric variable against n numeric variables in n plots
                            
                                Remove/hide figure caption below knitted markdown->pandoc plot
                            
                                How to import an R function from another package such that it would be available for the user?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

dplyr: whats the difference between group_by and group_by_ functions?

Tags:

r

dplyr

hackR

People also ask

1 Answers

r.bot

Recent Activity

Donate For Us