<p>I am working with R Shiny for some exploratory data analysis. I have two checkbox inputs that contain only the user-selected options. The first checkbox input contains only the categorical variables; the second checkbox contains only numeric variables. Next, I apply a <code>groupby</code> on these two selections:</p> <pre class="prettyprint"><code>var1 <- input$variable1 # Checkbox with categorical variables var2 <- input$variable2 # Checkbox with numerical variables v$data <- dataset %>% group_by_(var1) %>% summarize_(Sum = interp(~sum(x), x = as.name(var2))) %>% arrange(desc(Sum)) </code></pre> <p>When only one categorical variable is selected, this <code>groupby</code> works perfectly. When multiple categorical variables are chosen, this <code>groupby</code> returns an array with column names. How do I pass this array of column names to <code>dplyr</code>'s <code>groupby</code>? </p>

<h3>dplyr version >1.0</h3> <p>With more recent versions of <code>dplyr</code>, you should use <code>across</code> along with a tidyselect helper function. See <code>help("language", "tidyselect")</code> for a list of all the helper functions. In this case if you want all columns in a character vector, use <code>all_of()</code></p> <pre class="prettyprint"><code>cols <- c("mpg","hp","wt") mtcars %>% group_by(across(all_of(cols))) %>% summarize(x=mean(gear)) </code></pre> <h3>original answer (older versions of dplyr)</h3> <p>If you have a vector of variable names, you should pass them to the <code>.dots=</code> parameter of <code>group_by_</code>. For example:</p> <pre class="prettyprint"><code>mtcars %>% group_by_(.dots=c("mpg","hp","wt")) %>% summarize(x=mean(gear)) </code></pre>

dplyr - groupby on multiple columns using variable names

Tags:

r

group-by

dplyr

shiny

I am working with R Shiny for some exploratory data analysis. I have two checkbox inputs that contain only the user-selected options. The first checkbox input contains only the categorical variables; the second checkbox contains only numeric variables. Next, I apply a groupby on these two selections:

var1 <- input$variable1      # Checkbox with categorical variables var2 <- input$variable2      # Checkbox with numerical variables  v$data <- dataset %>%   group_by_(var1) %>%   summarize_(Sum = interp(~sum(x), x = as.name(var2))) %>%   arrange(desc(Sum))

When only one categorical variable is selected, this groupby works perfectly. When multiple categorical variables are chosen, this groupby returns an array with column names. How do I pass this array of column names to dplyr's groupby?

216

asked Dec 28 '15 04:12

Neil

1 Answers

dplyr version >1.0

With more recent versions of dplyr, you should use across along with a tidyselect helper function. See help("language", "tidyselect") for a list of all the helper functions. In this case if you want all columns in a character vector, use all_of()

cols <- c("mpg","hp","wt") mtcars %>%     group_by(across(all_of(cols))) %>%     summarize(x=mean(gear))

original answer (older versions of dplyr)

If you have a vector of variable names, you should pass them to the .dots= parameter of group_by_. For example:

mtcars %>%     group_by_(.dots=c("mpg","hp","wt")) %>%     summarize(x=mean(gear))

141

answered Oct 01 '22 01:10

MrFlick

Related questions
                            
                                width and gap of geom_bar (ggplot2)
                            
                                Displaying text below the plot generated by ggplot2
                            
                                Setting up a 3D matrix in R and accessing certain elements
                            
                                R Function for returning ALL factors
                            
                                How do I quickly convert the size element of file.info() from bytes to KB, MB, GB, etc.?
                            
                                How to test when condition returns numeric(0) in R
                            
                                How to read in numbers with a comma as decimal separator?
                            
                                How to preserve base data frame rownames upon filtering in dplyr chain
                            
                                Is Rgraphviz no longer available for R? [duplicate]
                            
                                Exclude columns by names in mutate_at in dplyr
                            
                                Connecting across missing values with geom_line
                            
                                Showing different axis labels using ggplot2 with facet_wrap
                            
                                How expensive is it to compute the eigenvalues of a matrix?
                            
                                How do I put more space between the axis labels and axis title in an R boxplot
                            
                                R equivalent of SELECT DISTINCT on two or more fields/variables
                            
                                geom_bar bars not displaying when specifying ylim
                            
                                Vectorizing a matrix [duplicate]
                            
                                How to subset from a list in R
                            
                                Formatting mouse over labels in plotly when using ggplotly
                            
                                Count the number of non-zero elements of each column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With