Say I have a <code>data.frame</code> object: <pre class="prettyprint"><code>df <- data.frame(name=c('black','black','black','red','red'), type=c('chair','chair','sofa','sofa','plate'), num=c(4,5,12,4,3)) </code></pre> Now I want to count the number of rows (observations) of for each combination of <code>name</code> and <code>type</code>. This can be done like so: <pre class="prettyprint"><code>table(df[ , c("name","type")]) </code></pre> or possibly also with <code>plyr</code>, (though I am not sure how). However, how do I get the results incorporated into the original data frame? So that the results will look like this: <pre class="prettyprint"><code>df # name type num count # 1 black chair 4 2 # 2 black chair 5 2 # 3 black sofa 12 1 # 4 red sofa 4 1 # 5 red plate 3 1 </code></pre> where <code>count</code> now stores the results from the aggregation. A solution with <code>plyr</code> could be interesting to learn as well, though I would like to see how this is done with base R.

Using <code>data.table</code>: <pre class="prettyprint"><code>library(data.table) dt = as.data.table(df) # or coerce to data.table by reference: # setDT(df) dt[ , count := .N, by = .(name, type)] </code></pre> For pre-<code>data.table 1.8.2</code> alternative, see edit history. <hr> Using <code>dplyr</code>: <pre class="prettyprint"><code>library(dplyr) df %>% group_by(name, type) %>% mutate(count = n()) </code></pre> Or simply: <pre class="prettyprint"><code>add_count(df, name, type) </code></pre> <hr> Using <code>plyr</code>: <pre class="prettyprint"><code>plyr::ddply(df, .(name, type), transform, count = length(num)) </code></pre>

Count number of rows per group and add result to original data frame

Tags:

r

r-faq

count

aggregate

Say I have a data.frame object:

df <- data.frame(name=c('black','black','black','red','red'),                  type=c('chair','chair','sofa','sofa','plate'),                  num=c(4,5,12,4,3))

Now I want to count the number of rows (observations) of for each combination of name and type. This can be done like so:

table(df[ , c("name","type")])

or possibly also with plyr, (though I am not sure how).

However, how do I get the results incorporated into the original data frame? So that the results will look like this:

df #    name  type num count # 1 black chair   4     2 # 2 black chair   5     2 # 3 black  sofa  12     1 # 4   red  sofa   4     1 # 5   red plate   3     1

where count now stores the results from the aggregation.

A solution with plyr could be interesting to learn as well, though I would like to see how this is done with base R.

578

asked Sep 16 '11 21:09

Uri Laserson

1 Answers

Using data.table:

library(data.table) dt = as.data.table(df)  # or coerce to data.table by reference: # setDT(df)  dt[ , count := .N, by = .(name, type)]

For pre-data.table 1.8.2 alternative, see edit history.

Using dplyr:

library(dplyr) df %>%   group_by(name, type) %>%   mutate(count = n())

Or simply:

add_count(df, name, type)

Using plyr:

plyr::ddply(df, .(name, type), transform, count = length(num))

145

answered Oct 03 '22 01:10

Ramnath

Related questions
                            
                                Extract Month and Year From Date in R
                            
                                How to suppress warnings when plotting with ggplot
                            
                                Transparent equivalent of given color
                            
                                Aggregating by unique identifier and concatenating related values into a string [duplicate]
                            
                                Replace all values in a matrix <0.1 with 0
                            
                                Scale a series between two points
                            
                                How to add new column to an dataframe (to the front not end)?
                            
                                How to sort a character vector where elements contain letters and numbers in R?
                            
                                Seeing if data is normally distributed in R
                            
                                Offline install of R package and dependencies
                            
                                R: numeric 'envir' arg not of length one in predict()
                            
                                Meaning of ~. (tilde dot) argument?
                            
                                Force character vector encoding from "unknown" to "UTF-8" in R
                            
                                R + combine a list of vectors into a single vector
                            
                                How do you print to stderr in R?
                            
                                Overlay histogram with density curve
                            
                                Combine column to remove NA's
                            
                                How to cbind or rbind different lengths vectors without repeating the elements of the shorter vectors?
                            
                                Putting mathematical symbols and subscripts mixed with regular letters [duplicate]
                            
                                making matplotlib graphs look like R by default?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With