Aggregate data frame by date and apply different functions to corresponding columns?

Tags:

I have the following data frame "DF" which is part of a much larger one:

             X1  X2            X3 X4 X5
4468 2010-03-24   3  1.000000e+00  1  2
7662 2010-03-24   9  3.000000e+00  2  1
1272 2010-03-25   8  2.000000e+00  1  1
1273 2010-03-26   9  0.000000e+00  1  1
1274 2010-03-27   8  0.000000e+00  1  1
4469 2010-03-28   4  0.000000e+00  1  2
7663 2010-03-28   4  3.000000e+00  3  1
8734 2010-03-28   7  4.000000e+00  2  3
1275 2010-03-29   8  0.000000e+00  1  1

As you can see the first column contains a date. What I want to do is as follows: I want to transform this dataframe to a new one "DF2" where there is only 1 row per date with corresponding column values:

Click to copy

X2, the average 
X3, the sum
X4, the maximum

of all previous values per date. X5 is not relevant and can be removed. This would be the result:

Click to copy

             X1  X2            X3 X4
7662 2010-03-24   6  4.000000e+00  2  
1272 2010-03-25   8  2.000000e+00  1  
1273 2010-03-26   9  0.000000e+00  1  
1274 2010-03-27   8  0.000000e+00  1  
8734 2010-03-28   5  7.000000e+00  3  
1275 2010-03-29   8  0.000000e+00  1

Does anyone know how to accomplish this? Help would be much appreciated!

594

asked May 13 '13 16:05

MB123

1 Answers

Click to copy

DF <- read.table(text="             X1  X2            X3 X4 X5
4468 2010-03-24   3  1.000000e+00  1  2
7662 2010-03-24   9  3.000000e+00  2  1
1272 2010-03-25   8  2.000000e+00  1  1
1273 2010-03-26   9  0.000000e+00  1  1
1274 2010-03-27   8  0.000000e+00  1  1
4469 2010-03-28   4  0.000000e+00  1  2
7663 2010-03-28   4  3.000000e+00  3  1
8734 2010-03-28   7  4.000000e+00  2  3
1275 2010-03-29   8  0.000000e+00  1  1",header=TRUE)

library(data.table)

DT <- as.data.table(DF)

DT[,list(X2=mean(X2),X3=sum(X3),X4=max(X4)),by=X1]

#            X1 X2 X3 X4
# 1: 2010-03-24  6  4  2
# 2: 2010-03-25  8  2  1
# 3: 2010-03-26  9  0  1
# 4: 2010-03-27  8  0  1
# 5: 2010-03-28  5  7  3
# 6: 2010-03-29  8  0  1

answered Nov 07 '22 18:11

Roland

Related questions
                            
                                Crosstab with multiple items
                            
                                R: How to split a specific column based on symbol in R? [duplicate]
                            
                                how to produce a sweave document without angle bracket ">" in front of code chunks?
                            
                                Why does R say no loop for break/next, jumping to top level
                            
                                How to iterate through hash items, in an R environment?
                            
                                R put multiple randomForest objects into a vector
                            
                                Split data by year
                            
                                How to search an environment using ls() inside a function?
                            
                                Appending % sign in output of prop.table
                            
                                Subtract shifted vectors in R
                            
                                creating columns within a legend list while using ggplot in R code
                            
                                Plot mean and sd of dataset per x value using ggplot2
                            
                                How to include object in regular expression
                            
                                sapply paste before at beginning of string
                            
                                Control 'base' point size in ggplot aes(size)
                            
                                Converting two columns of date and time data to one
                            
                                How can I use functions returning vectors (like fivenum) with ddply or aggregate?
                            
                                How to write a loop to run the t-test of a data frame?
                            
                                General issues regarding a plot
                            
                                Keep column name when filtering matrix columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Aggregate data frame by date and apply different functions to corresponding columns?

Tags:

r

max

aggregate

sum

average

MB123

People also ask

1 Answers

Roland

Recent Activity

Donate For Us