Increase counter by 1 for each unique group of values

Tags:

r

I want to create a continually increasing counter for each group, where each group is a unique combination of person and day.

This is what the data looks like:

> df
  person      date
1      0    monday
2      0   tuesday
3      1    monday
4      1    monday
5      1   tuesday
6      2    monday
7      2    monday
8      2   tuesday
9      2 wednesday

Thus, I want to add a new variable starts at 1, and adds for for each new combination of person and day.

> df
  person      date counter
1      0    monday       1
2      0   tuesday       2
3      1    monday       3
4      1    monday       3
5      1   tuesday       4
6      2    monday       5
7      2    monday       5
8      2   tuesday       6
9      2 wednesday       7

I hope that the data is clear enough. The counter continues until it reaches the end of the data set.

219

asked Jul 22 '15 20:07

Boudewijn Aasman

2 Answers

You can use rleid from the devel version of data.table. Instructions to install the devel version are here

 library(data.table)#v.9.5+
 setDT(df)[, counter:= rleid(date)][]
 #    person      date counter
 # 1:      0    monday       1
 # 2:      0   tuesday       2
 # 3:      1    monday       3
 # 4:      1    monday       3
 # 5:      1   tuesday       4
 # 6:      2    monday       5
 # 7:      2    monday       5
 # 8:      2   tuesday       6
 # 9:      2 wednesday       7

library(dplyr)
df %>%  
   mutate(counter= cumsum(date!=lag(date, default=FALSE)))

106

answered Nov 02 '22 01:11

akrun

Base package:

df1 <- data.frame(unique(df), counter= 1:nrow(unique(df)))
merge(df, df1)

Output:

  person      date counter
1      0    monday       1
2      0   tuesday       2
3      1    monday       3
4      1    monday       3
5      1   tuesday       4
6      2    monday       5
7      2    monday       5
8      2   tuesday       6
9      2 wednesday       7

answered Nov 01 '22 23:11

mpalanco

Related questions
                            
                                Adding column of predicted Hazard Ratio to dataframe after Cox Regression in R
                            
                                Install some parts from Github when calling "install.packages()" in R
                            
                                Scatterplot of Year-On-Year Correlation of Data in R using ggplot2
                            
                                Creating compound/interacted dummy variables in data.table in R
                            
                                What is python's not? A special function type?
                            
                                Draw geom_smooth only for fits that are significant
                            
                                How to work with huge matrices in R? [closed]
                            
                                Export R package documentation to a web page
                            
                                Load dataset from "R" package using data(), assign it directly to a variable?
                            
                                copy a list of data.tables
                            
                                Zero inflated poisson model fails to fit
                            
                                plot tree in ggplot in R
                            
                                R - servr::jekyll() build error
                            
                                R cut dendrogram into groups with minimum size
                            
                                R: List of indices to binary matrix
                            
                                Install R package from Atlassian Stash
                            
                                Using substitute to do variable substitutions inside R expressions
                            
                                Create new column in data frame using a for loop to calculate value in R?
                            
                                unable to install 'XML' package dependency for 'pmml' on Ubuntu
                            
                                Fill missing sequence values with dplyr

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With