I have the following sample <code>data.table</code>: <pre class="prettyprint"><code>dtb <- data.table(a=sample(1:100,100), b=sample(1:100,100), id=rep(1:10,10)) </code></pre> I would like to aggregate all columns (a and b, though they should be kept separate) by id using <code>colSums</code>, for example. What is the correct way to do this? The following does not work: <pre class="prettyprint"><code> dtb[,colSums, by="id"] </code></pre> This is just a sample and my table has many columns so I want to avoid specifying all of them in the function name

this is actually what i was looking for and is mentioned in the FAQ: <pre class="prettyprint"><code>dtb[,lapply(.SD,mean),by="id"] </code></pre>

aggregating multiple columns in data.table

Tags:

dataframe

r

aggregate

data.table

I have the following sample data.table:

dtb <- data.table(a=sample(1:100,100), b=sample(1:100,100), id=rep(1:10,10))

I would like to aggregate all columns (a and b, though they should be kept separate) by id using colSums, for example. What is the correct way to do this? The following does not work:

 dtb[,colSums, by="id"]

This is just a sample and my table has many columns so I want to avoid specifying all of them in the function name

463

asked Jul 27 '12 20:07

Alex

1 Answers

this is actually what i was looking for and is mentioned in the FAQ:

dtb[,lapply(.SD,mean),by="id"]

143

answered Oct 22 '22 14:10

Alex

Related questions
                            
                                Argument is of length zero
                            
                                Changing the Color of negative numbers to Red in a table generated with xtable()?
                            
                                heatmap-like plot, but for categorical variables
                            
                                Return the character associated with the specified Ascii code in R
                            
                                Set global thousand separator on knitr
                            
                                Lazy sequences in R
                            
                                Shift values in single column of dataframe up
                            
                                "subset" and "[" on dataframe give slightly different results, why?
                            
                                how to download and display an image from an URL in R?
                            
                                Dataframe within dataframe?
                            
                                How to make part of rmarkdown document without section numbering?
                            
                                R: data.table count !NA per row
                            
                                Exclude function from R package manual
                            
                                Replace a subset of a data frame with dplyr join operations
                            
                                How to remove warning messages in R Markdown document?
                            
                                Timing R code with Sys.time()
                            
                                Is it possible to rotate a plot in R (base graphics)?
                            
                                What is the fastest way to calculate first two principal components in R?
                            
                                ggplot with Strings on x-Axis
                            
                                R saving the output of table() into a data frame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

aggregating multiple columns in data.table

Tags:

dataframe

r

aggregate

data.table

Alex

People also ask

1 Answers

Alex

Recent Activity

Donate For Us