specify dplyr column names [duplicate]

Tags:

How can I pass column names to dplyr if I do not know the column name, but want to specify it through a variable?

e.g. this works:

require(dplyr)
df <- as.data.frame(matrix(seq(1:9),ncol=3,nrow=3))
df$group <- c("A","B","A")
gdf <- df %.% group_by(group) %.% summarise(m1 =mean(V1),m2 =mean(V2),m3 =mean(V3))

But this does not

require(dplyr)
someColumn = "group"
df <- as.data.frame(matrix(seq(1:9),ncol=3,nrow=3))
df$group <- c("A","B","A")
gdf <- df %.% group_by(someColumn) %.% summarise(m1 =mean(V1),m2 =mean(V2),m3 =mean(V3))

570

asked Jan 27 '14 19:01

user3241888

2 Answers

I just gave a similar answer over at Group by multiple columns in dplyr, using string vector input, but for good measure: functions that allow you to operate on columns using strings have been added to dplyr. These have the same name as the regular dplyr functions, but end in an underscore. The functions are described in detail in this vignette.

Given df and someColumn from the OP, this now works a treat:

gdf <- df %>% group_by_(someColumn) %>% summarise(m1=mean(V1),m2=mean(V2),m3=mean(V3))

Note that it is group_by_, rather than group_by, and the %>% operator is used as %.% is deprecated.

answered Sep 30 '22 20:09

edward

Here's an answer to this straightforward question, obtained by picking through hadley's solution to his posted dupe.

gdf <- df %.% regroup( lapply( someColumn, as.symbol)) %.% summarise(m1 =mean(V1),m2 =mean(V2),m3 =mean(V3))

FWIW, my use case involved grouping by one variable column and one constant column. The solution to that is:

gdf <- df %.% regroup( lapply( c( 'constant_column', someColumn), as.symbol)) %.% summarise(m1 =mean(V1),m2 =mean(V2),m3 =mean(V3))

Finally, the posted eval solution doesn't work. That just makes a new column whose values are all what someColumn evals to.

answered Sep 30 '22 21:09

StatSandwich

Related questions
                            
                                Reference for R wizards
                            
                                Sending in Column Name to ddply from Function
                            
                                R idiom for switch/case
                            
                                using C function from other package in Rcpp
                            
                                R: Handling of sf objects in raster package
                            
                                rlang::sym in anonymous functions
                            
                                ggplot2 equivalent of matplot() : plot a matrix/array by columns?
                            
                                Installing R on RHEL 6
                            
                                Possible to create Rd help files for objects not in a package?
                            
                                Using attributes of `ftable` for extracting data
                            
                                OAuth access for R
                            
                                Usage of Dot / Period in R Functions
                            
                                Gdata package perl issue
                            
                                R's data.table Truncating Bits?
                            
                                How do I close unused connections after read_html in R
                            
                                Fastest way to parse a date-time string to class Date
                            
                                Building a package with devtools - throwing an error where "Author" and "Maintainer" fields are missing/empty despite being filled
                            
                                Using "..." and "replicate"
                            
                                An error ['\+' is an unrecognized escape in character string starting "\+" while creating a R package
                            
                                How to write a function that calls a function that calls data.table?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

specify dplyr column names [duplicate]

Tags:

r

group-by

dplyr

columnname

user3241888

People also ask

2 Answers

edward

StatSandwich

Recent Activity

Donate For Us