Aggregating sequential and grouped data in R

Tags:

I have a dataset that looks like this toy example. The data describes the location a person has moved to and the time since this relocation happened. For example, person 1 started out in a rural area, but moved to a city 463 days ago (2nd row), and 415 days ago he moved from this city to a town (3rd row), etc.

set.seed(123)
df <- as.data.frame(sample.int(1000, 10))
colnames(df) <- "time"
df$destination <- as.factor(sample(c("city", "town", "rural"), size = 10, replace = TRUE, prob = c(.50, .25, .25)))
df$user <- sample.int(3, 10, replace = TRUE)
df[order(df[,"user"], -df[,"time"]), ]

The data:

time destination user
 526       rural    1
 463        city    1
 415        town    1
 299        city    1
 179       rural    1
 938        town    2
 229        town    2
 118        city    2
 818        city    3
 195        city    3

I wish to aggregate this data to the format below. That is, to count the types of relocations for each user, and sum it up to one matrix. How do I achieve this (preferably without writing loops)?

from  to     count
city  city   1
city  town   1
city  rural  1
town  city   2
town  town   1
town  rural  0
rural city   1
rural town   0
rural rural  0

466

asked Jul 22 '21 19:07

Joshua

1 Answers

One possible way based on data.table package:

library(data.table)

cases <- unique(df$destination)

setDT(df)[, .(from=destination, to=shift(destination, -1)), by=user
          ][CJ(from=cases, to=cases), .(count=.N), by=.EACHI, on=c("from", "to")]


#      from     to count
#    <char> <char> <int>
# 1:   city   city     1
# 2:   city  rural     1
# 3:   city   town     1
# 4:  rural   city     1
# 5:  rural  rural     0
# 6:  rural   town     0
# 7:   town   city     2
# 8:   town  rural     0
# 9:   town   town     1

137

answered Sep 27 '22 21:09

B. Christian Kamgang

Related questions
                            
                                how to plot the linear regression in R?
                            
                                Replace missing value with previous value [duplicate]
                            
                                reading a .tif file in R [closed]
                            
                                How to get Euler–Mascheroni's constant in R?
                            
                                Keyboard shortcut to empty workspace/environment in RStudio
                            
                                R draw heatmap with clusters, but hide dendrogram
                            
                                how to use %dopar% when only import foreach in DESCRIPTION of a package
                            
                                Plotting a raster with the color ramp diverging around zero
                            
                                Extract names of dataframes passed with dots
                            
                                Index unique values in data.table
                            
                                R 3d array to 2d matrix
                            
                                Replace values from another dataframe by IDs
                            
                                ggrepel remove line around labels
                            
                                How to add Latex code in ggplot2 legend labels?
                            
                                Use lubridate to edit year within dplyr chain
                            
                                Adding an image to Shiny action button
                            
                                How to apply a function to a subset of data.table using by and exposing all columns to the function?
                            
                                With min() in R return NA instead of Inf
                            
                                R Markdown PowerPoint Slide Customization
                            
                                Produce an inset in each facet of an R ggplot while preserving colours of the original facet content

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Aggregating sequential and grouped data in R

Tags:

dataframe

r

grouping

Joshua

People also ask

1 Answers

B. Christian Kamgang

Recent Activity

Donate For Us