I have some member order data that I would like to aggregate by week of order. This is what the data looks like: <pre class="prettyprint"><code>memberorders=data.frame(MemID=c('A','A','B','B','B','C','C','D'), week = c(1,2,1,4,5,1,4,1), value = c(10,20,10,10,2,5,30,3)) </code></pre> I'm using dplyr to group_by <code>MemID</code> and summarize "value" for <code>week<=2</code> and <code>week<=4</code> (to see how much each member ordered in weeks 1-2 and 1-4. The code I currently have is: <pre class="prettyprint"><code>MemberLTV <- memberorders %>% group_by(MemID) %>% summarize( sum2 = sum(value[week<=2]), sum4 = sum(value[week<=4])) </code></pre> I'm now trying to add two more fields in summarize, count2 and count4, that would count the number of instances of each condition (<code>week <=2</code> and <code>week <=4</code>). The desired output is: <pre class="prettyprint"><code>output = data.frame(MemID = c('A','B','C','D'), sum2 = c(30,10,5,3), sum4 = c(30,20,35,3), count2 = c(2,1,1,1), count4 = c(2,2,2,1)) </code></pre> I'm guessing it's just a little tweak of the sum function but I'm having trouble figuring it out.

Try <pre class="prettyprint"><code> library(dplyr) memberorders %>% group_by(MemID) %>% summarise(sum2= sum(value[week<=2]), sum4= sum(value[week <=4]), count2=sum(week<=2), count4= sum(week<=4)) </code></pre>

Conditionally Count in dplyr

Q: How do I count observations in R?

count() lets you quickly count the unique values of one or more variables: df %>% count(a, b) is roughly equivalent to df %>% group_by(a, b) %>% summarise(n = n()) .

Tags:

r

dplyr

I have some member order data that I would like to aggregate by week of order.

This is what the data looks like:

memberorders=data.frame(MemID=c('A','A','B','B','B','C','C','D'),              week = c(1,2,1,4,5,1,4,1),              value = c(10,20,10,10,2,5,30,3))

I'm using dplyr to group_by MemID and summarize "value" for week<=2 and week<=4 (to see how much each member ordered in weeks 1-2 and 1-4. The code I currently have is:

MemberLTV <- memberorders %>% group_by(MemID) %>% summarize( sum2 = sum(value[week<=2]), sum4 = sum(value[week<=4]))

I'm now trying to add two more fields in summarize, count2 and count4, that would count the number of instances of each condition (week <=2 and week <=4).

The desired output is:

output  = data.frame(MemID = c('A','B','C','D'),                  sum2 = c(30,10,5,3),                  sum4 = c(30,20,35,3),                  count2 = c(2,1,1,1),                  count4 = c(2,2,2,1))

I'm guessing it's just a little tweak of the sum function but I'm having trouble figuring it out.

537

asked Apr 27 '15 20:04

SFuj

1 Answers

Try

 library(dplyr)  memberorders %>%          group_by(MemID) %>%          summarise(sum2= sum(value[week<=2]), sum4= sum(value[week <=4]),                    count2=sum(week<=2), count4= sum(week<=4))

144

answered Oct 05 '22 07:10

akrun

Related questions
                            
                                How to combine 2 plots (ggplot) into one plot?
                            
                                Setting document title in Rmarkdown from parameters
                            
                                select columns based on multiple strings with dplyr contains()
                            
                                inst and extdata folders in R Packaging
                            
                                Merge or combine by rownames
                            
                                Add empty columns to a dataframe with specified names from a vector
                            
                                changing title in multiplot ggplot2 using grid.arrange
                            
                                Euclidean distance of two vectors
                            
                                R dplyr: rename variables using string functions
                            
                                How to solve the error " missing required header GL/gl.h" while installing the Package mvoutlier in R?
                            
                                Colour points in a plot differently depending on a vector of values
                            
                                remove the last element of a vector
                            
                                controlling the output with RApacheOutputErrors
                            
                                Multiple functions in one .Rd file
                            
                                How can I add freehand red circles to a ggplot2 graph?
                            
                                What is R's multidimensional equivalent of rbind and cbind?
                            
                                How to flatten a list to a list without coercion?
                            
                                Number formatting axis labels in ggplot2?
                            
                                Include levels of zero count in result of table()
                            
                                How to source() .R file saved using UTF-8 encoding?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With