R- Calculate a count of items over time using start and end dates

Tags:

I want to calculate a count of items over time using their Start and End dates.

Some sample data

START <- as.Date(c("2014-01-01", "2014-01-02","2014-01-03","2014-01-03"))
END <- as.Date(c("2014-01-04", "2014-01-03","2014-01-03","2014-01-04"))
df <- data.frame(START,END)
df

gives

       START        END
1 2014-01-01 2014-01-04
2 2014-01-02 2014-01-03
3 2014-01-03 2014-01-03
4 2014-01-03 2014-01-04

A table showing a count of these items across time (based on their Start and End times) is as follows:

DATETIME    COUNT
2014-01-01   1 
2014-01-02   2 
2014-01-03   4 
2014-01-04   2

Can this be done using R, especially using dplyr? Many thanks.

752

asked Oct 10 '14 00:10

Dave M

1 Answers

This would do it. You can change the column names as necessary.

as.data.frame(table(Reduce(c, Map(seq, df$START, df$END, by = 1))))
#         Var1 Freq
# 1 2014-01-01    1
# 2 2014-01-02    2
# 3 2014-01-03    4
# 4 2014-01-04    2

As noted in the comments, Var1 in the above solution is now a factor, and not a date. To keep the date class in the first column, you could do some more work to the above solution, or use plyr::count instead of as.data.frame(table(...))

library(plyr)
count(Reduce(c, Map(seq, df$START, df$END, by = 1)))
#            x freq
# 1 2014-01-01    1
# 2 2014-01-02    2
# 3 2014-01-03    4
# 4 2014-01-04    2

188

answered Nov 04 '22 15:11

Rich Scriven

Related questions
                            
                                How to write and read printable ASCII characters to/from UTF-8 encoding file?
                            
                                Imputation MICE in R still NA left in dataset
                            
                                irregular list of lists to dataframe
                            
                                Import all txt files in folder, concatenate into data frame, use file names as variable in R?
                            
                                Replace integer(0) by NA
                            
                                subsetting data to first occurrence in R
                            
                                Default linetypes in ggplot2?
                            
                                object.size() reports smaller size than .Rdata file
                            
                                Add ggplot annotation outside the panel? Or two titles?
                            
                                Creating a matrix from multiple column vectors
                            
                                `data.table` global search - filter rows given pattern match in `any` column
                            
                                Way to extract data from lm-object before function is applied?
                            
                                Shiny's tabsetPanel not displaying plots in multiple tabs
                            
                                Distance between vectors with missing values
                            
                                Slow data.frame row assignation
                            
                                R shiny display formula
                            
                                Grouped barplot with cut y axis
                            
                                R - Removing tick mark without removing label
                            
                                Passing user specifications as arguments to dplyr within Shiny
                            
                                Calling C code from an R package, within C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

R- Calculate a count of items over time using start and end dates

Tags:

r

dplyr

duration

Dave M

People also ask

1 Answers

Rich Scriven

Recent Activity

Donate For Us