Aggregate data.frame for each day

Tags:

aggregate

I have a data.frame dat about car sells (Buy=0 in the data frame) and buys (Buy=1 in the data frame) of a used car seller.

  Date       Buy   Price
29-06-2015    1    5000
29-06-2015    0    8000
29-06-2015    1    10000
30-06-2015    0    3500
30-06-2015    0    12000 
...          ...  ...

What I need is a new, aggregated data.frame that gives me the number of buys and the number of sells per day together with the summed prices of all the buys and sells for that day:

  Date      Buys   Sells   Price_Buys  Price_Sells
29-06-2015    2    1         15000        8000
30-06-2015    0    2           0          15500
...          ...  ...

I tried to use aggregate(dat$Buy, by=list(Date=dat$Date, FUN=sum)). However, I am still struggling how to aggregate the sells as well.

300

asked Jan 28 '16 21:01

4 Answers

This can be done pretty cleanly in dplyr, grouping by date using group_by and then summarizing with summarize:

library(dplyr)
(out <- dat %>%
  group_by(Date) %>%
  summarize(Buys=sum(Buy == 1), Sells=sum(Buy == 0),
            Price_Buys=sum(Price[Buy == 1]), Price_Sells=sum(Price[Buy == 0])))
#         Date  Buys Sells Price_Buys Price_Sells
#       (fctr) (int) (int)      (int)       (int)
# 1 29-06-2015     2     1      15000        8000
# 2 30-06-2015     0     2          0       15500

You can now manipulate this object as you would a normal data frame, e.g. with something like:

out$newvar <- with(out, Sells*Price_Sells - Buys*Price_Buys)
out
# Source: local data frame [2 x 6]
#         Date  Buys Sells Price_Buys Price_Sells newvar
#       (fctr) (int) (int)      (int)       (int)  (int)
# 1 29-06-2015     2     1      15000        8000 -22000
# 2 30-06-2015     0     2          0       15500  31000

answered Oct 04 '22 13:10

Gopala

I would use one of the dpylr solutions myself, but I think it is still noteworthy, that it can also be done with aggregate(), since this is how you started out:

aggregate(cbind(Buys = Buy, Sells = !Buy,
                Price_Buys = Price * Buy, Price_Sells = Price * !Buy) ~ Date,
          data = dat, sum)
##         Date Buys Sells Price_Buys Price_Sells
## 1 29-06-2015    2     1      15000        8000
## 2 30-06-2015    0     2          0       15500

The idea here is to get the sales as !Buy. This will convert Buy to a logical (0 => TRUE, 1 => FALSE) and then apply the NOT-operator (!) to it. In this way, 0 is converted to 1 and 1 is converted to 0. The same trick can be used when calculating the price.

The comparison of this solution to the others should also show you, that dplyr produces much more readable code.

answered Oct 04 '22 13:10

Stibu

Related questions
                            
                                Draw 3x3 square grid in R
                            
                                Change the year in a datetime object in R?
                            
                                R: parse string to a matrix
                            
                                Converting Factor Levels to Numbers
                            
                                FUN-error after running 'tolower' while making Twitter wordcloud
                            
                                column name with brackets or other punctuations for dplyr group_by
                            
                                R: Creating a vector with a specific amount of random numbers
                            
                                ggplot2 sourcing error: X11 library is missing
                            
                                Count values higher than a certain threshold by group
                            
                                r search along a vector and calculate the mean
                            
                                Proper R Markdown Code Organization
                            
                                Test if column name contains string in R
                            
                                Removing one tableGrob when applied to a box plot with a facet_wrap
                            
                                How to delete everything after nth delimiter in R?
                            
                                How can I import SAS format files into R?
                            
                                Dynamically sorting columns in dplyr via passing ordered vector with column names to select
                            
                                Plot 2 tmap objects side-by-side
                            
                                Is there a function to recognize a word?
                            
                                How to combine two rows in R?
                            
                                Why is standard R median function so much slower than a simple C++ alternative?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Aggregate data.frame for each day

Tags:

r

aggregate

jeffrey

People also ask

4 Answers

josliber

David Arenburg

Gopala

Stibu

Recent Activity

Donate For Us