How can I collapse a dataframe by some variables, taking mean across others

Tags:

I need to summarize a data frame by some variables, ignoring the others. This is sometimes referred to as collapsing. E.g. if I have a dataframe like this:

Widget Type Energy  
egg 1 20  
egg 2 30  
jap 3 50  
jap 1 60

Then collapsing by Widget, with Energy the dependent variable, Energy~Widget, would yield

Widget Energy  
egg  25  
jap  55

In Excel the closest functionality might be "Pivot tables" and I've worked out how to do it in python ( http://alexholcombe.wordpress.com/2009/01/26/summarizing-data-by-combinations-of-variables-with-python/), and here's an example with R using doBy library to do something very related ( http://www.mail-archive.com/[email protected]/msg02643.html), but is there an easy way to do the above? And even better is there anything built into the ggplot2 library to create plots that collapse across some variables?

505

asked Apr 01 '10 04:04

Alex Holcombe

1 Answers

Use aggregate to summarize across a factor:

> df<-read.table(textConnection('
+ egg 1 20
+ egg 2 30
+ jap 3 50
+ jap 1 60'))
> aggregate(df$V3,list(df$V1),mean)
  Group.1  x
1     egg 25
2     jap 55

For more flexibility look at the tapply function and the plyr package.

In ggplot2 use stat_summary to summarize

qplot(V1,V3,data=df,stat="summary",fun.y=mean,geom='bar',width=0.4)

134

answered Nov 03 '22 23:11

Jyotirmoy Bhattacharya

Related questions
                            
                                Fill superimposed ellipses in ggplot2 scatterplots
                            
                                How to convert a sparse matrix into a matrix of index and value of non-zero element
                            
                                R: sparse matrix conversion
                            
                                Why can 'hallo\nworld' match both \n and \\n in R?
                            
                                Approaches for spatial geodesic latitude longitude clustering in R with geodesic or great circle distances
                            
                                Is there a way to delete all comments in a R script using RStudio?
                            
                                R-Project: xlsx package installation failure (due to java issues)
                            
                                devtools::install_github fails with CA cert error
                            
                                Efficiently plotting millions of data points in R
                            
                                Assign point color depending on data.frame column value R
                            
                                How to change and remove default library location?
                            
                                Resize plotly R ggplotly
                            
                                How do you check for a scalar in R?
                            
                                Split character vector at math comparisons signs in R
                            
                                Could not find function 'fread' in R 3.4 while reading a big dataset
                            
                                Convert scientific notation to numeric, preserving decimals
                            
                                How to fix "failed to load cairo DLL" in R?
                            
                                What's the difference between ggplot and basic plot in R? [closed]
                            
                                Warning: “Variables with usage in documentation object ‘FANG’ but not in code:”
                            
                                Making R package work in both Windows and Linux

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I collapse a dataframe by some variables, taking mean across others

Tags:

r

ggplot2

pivot-table

Alex Holcombe

People also ask

1 Answers

Jyotirmoy Bhattacharya

Recent Activity

Donate For Us