I would like to be able to use <code>dplyr</code>'s split-apply-combine strategy to the apply the <code>summary()</code> command. Take a simple data frame: <pre class="prettyprint"><code>df <- data.frame(class = c('A', 'A', 'B', 'B'), value = c(100, 120, 800, 880)) </code></pre> Ideally we would do something like this: <pre class="prettyprint"><code>df %>% group_by(class) %>% do(summary(.$value)) </code></pre> Unfortunately this does not work. Any ideas?

You can use the SE version of <code>data_frame</code>, that is, <code>data_frame_</code> and perform: <pre class="prettyprint"><code>df %>% group_by(class) %>% do(data_frame_(summary(.$value))) </code></pre> Alternatively, you can use <code>as.list()</code> wrapped by <code>data.frame()</code> with the argument <code>check.names = FALSE</code>: <pre class="prettyprint"><code>df %>% group_by(class) %>% do(data.frame(as.list(summary(.$value)), check.names = FALSE)) </code></pre> Both versions produce: <pre class="prettyprint"><code># Source: local data frame [2 x 7] # Groups: class [2] # # class Min. 1st Qu. Median Mean 3rd Qu. Max. # (fctr) (dbl) (dbl) (dbl) (dbl) (dbl) (dbl) # 1 A 100 105 110 110 115 120 # 2 B 800 820 840 840 860 880 </code></pre>

using dplyr's do() with summary()

Tags:

r

dplyr

summary

I would like to be able to use dplyr's split-apply-combine strategy to the apply the summary() command.

Take a simple data frame:

Click to copy

df <- data.frame(class = c('A', 'A', 'B', 'B'),
                 value = c(100, 120, 800, 880))

Ideally we would do something like this:

Click to copy

df %>%
  group_by(class) %>%
  do(summary(.$value))

Unfortunately this does not work. Any ideas?

803

asked Mar 28 '16 12:03

Bastiaan Quast

1 Answers

You can use the SE version of data_frame, that is, data_frame_ and perform:

Click to copy

df %>%
  group_by(class) %>%
  do(data_frame_(summary(.$value)))

Alternatively, you can use as.list() wrapped by data.frame() with the argument check.names = FALSE:

Click to copy

df %>%
  group_by(class) %>%
  do(data.frame(as.list(summary(.$value)), check.names = FALSE))

Both versions produce:

Click to copy

# Source: local data frame [2 x 7]
# Groups: class [2]
# 
#    class  Min. 1st Qu. Median  Mean 3rd Qu.  Max.
#   (fctr) (dbl)   (dbl)  (dbl) (dbl)   (dbl) (dbl)
# 1      A   100     105    110   110     115   120
# 2      B   800     820    840   840     860   880

answered Nov 15 '22 07:11

JasonAizkalns

Related questions
                            
                                List files on HTTP/FTP server in R
                            
                                Load all files from folder and subfolders
                            
                                Mutate with dplyr using multiple conditions
                            
                                Converting day of week to number in R
                            
                                Concatenate (paste) elements based on indices
                            
                                Group by and select min date with data.table
                            
                                R: error installing packages UBUNTU - Error in dyn.load(file, DLLpath = DLLpath, ...) : unable to load shared object
                            
                                RPostgreSQL - import dataframe into a table
                            
                                Vectorization of a for-loop in R
                            
                                R strsplit doesn't split on "."?
                            
                                Create a two-mode frequency matrix in R
                            
                                Rmarkdown table with cells that have two values
                            
                                Change color of specific tick in ggplot2
                            
                                How to create a conditional dummy in R?
                            
                                Create N random integers with no gaps
                            
                                Reading multiple JSON files in a directory into one Data Frame
                            
                                Find all possible substrings of length n
                            
                                How to include a header based on a condition in knitr
                            
                                Expand Data Frame
                            
                                Add a series of elements in different locations within a vector

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

using dplyr's do() with summary()

Tags:

r

dplyr

summary

Bastiaan Quast

People also ask

1 Answers

JasonAizkalns

Recent Activity

Donate For Us