I'd like to use <code>summarise_each()</code> to apply multiple functions to a grouped dataset. However, rather than apply each function to all columns, I'd like to apply each function to particular subsets. I realize I could do this by specifying each column with <code>summarise()</code>, but I have many variables. Is there an alternate solution to either 1) using <code>summarise_each()</code> and then deleting the unneeded columns or 2) saving the <code>group_by()</code> result, performing multiple separate <code>summarise_each()</code> operations and combining the results? If this is not clear, let me know and I can try to illustrate with some example code.

I would suggest the following: here I would like to apply min function to one variable and max function to other. Then I simply merge those with the grouping variable. <pre class="prettyprint"><code>> by_species <- iris %>% group_by(Species) </code></pre> Start with variable for which I want to apply the min function: <blockquote> min_var <- by_species %>% summarise_each(funs(min), Petal.Width) min_var Source: local data frame [3 x 2] </blockquote> <pre class="prettyprint"><code> Species Petal.Width (fctr) (dbl) 1 setosa 0.1 2 versicolor 1.0 3 virginica 1.4 </code></pre> Then the variable for which I want to apply the max function: <blockquote> max_var <- by_species %>% summarise_each(funs(max), Sepal.Width) max_var Source: local data frame [3 x 2] </blockquote> <pre class="prettyprint"><code> Species Sepal.Width (fctr) (dbl) 1 setosa 4.4 2 versicolor 3.4 3 virginica 3.8 </code></pre> Now, we just merge the above two: <blockquote> left_join(min_var,max_var) Joining by: "Species" Source: local data frame [3 x 3] </blockquote> <pre class="prettyprint"><code> Species Petal.Width Sepal.Width (fctr) (dbl) (dbl) 1 setosa 0.1 4.4 2 versicolor 1.0 3.4 3 virginica 1.4 3.8 </code></pre>

dplyr summarise_each() using multiple functions for different column subsets across the same groups

Tags:

r

dplyr

I'd like to use summarise_each() to apply multiple functions to a grouped dataset. However, rather than apply each function to all columns, I'd like to apply each function to particular subsets. I realize I could do this by specifying each column with summarise(), but I have many variables.

Is there an alternate solution to either 1) using summarise_each() and then deleting the unneeded columns or 2) saving the group_by() result, performing multiple separate summarise_each() operations and combining the results?

If this is not clear, let me know and I can try to illustrate with some example code.

831

asked Jan 16 '16 00:01

Cotton.Rockwood

Video Answer

1 Answers

I would suggest the following: here I would like to apply min function to one variable and max function to other. Then I simply merge those with the grouping variable.

> by_species <- iris %>% group_by(Species)

Start with variable for which I want to apply the min function:

min_var <- by_species %>% summarise_each(funs(min), Petal.Width) min_var Source: local data frame [3 x 2]

      Species Petal.Width
       (fctr)       (dbl)
1     setosa         0.1
2 versicolor         1.0
3  virginica         1.4

Then the variable for which I want to apply the max function:

max_var <- by_species %>% summarise_each(funs(max), Sepal.Width) max_var Source: local data frame [3 x 2]

     Species Sepal.Width
      (fctr)       (dbl)
 1     setosa         4.4
 2 versicolor         3.4
 3  virginica         3.8

Now, we just merge the above two:

left_join(min_var,max_var) Joining by: "Species" Source: local data frame [3 x 3]

      Species Petal.Width Sepal.Width
     (fctr)       (dbl)       (dbl)
1     setosa         0.1         4.4
2 versicolor         1.0         3.4
3  virginica         1.4         3.8

137

answered Oct 24 '22 11:10

Rushad Faridi

Related questions
                            
                                Using R and Sensor Accelerometer Data to Detect a Jump
                            
                                Combine plots with grid.arrange and adjust plot size and axis label
                            
                                R list as key for hash
                            
                                R: Enable autocompletion in custom class
                            
                                r strptime (R version 3.2.2 )
                            
                                R kmeans (stats) vs Kmeans (amap)
                            
                                When does the object returned by invisible() cease to be invisible?
                            
                                Equivalent to \Sexpr{} for Python, etc., in knitr + RMarkdown?
                            
                                Is it possible to sample from a conditional density in R given some conditional data?
                            
                                Create ggplot2 plot in memory?
                            
                                Pass column name to function from mutate_each
                            
                                R: How to change plot background color for a specific range in ggvis shiny app
                            
                                force "apply" to return a matrix?
                            
                                What causes this ggplot2 facet bug?
                            
                                Calculating partial correlation adjusted for a categorical variable
                            
                                How to Make RStudio Presentation Self-contained?
                            
                                Regression table in latex from splm
                            
                                how to override the 2GB memory limit when R starts
                            
                                User-specified attributes of data.table get removed
                            
                                Fastest way to apply function to all pairwise combinations of columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With