Increment by one to each duplicate value

Tags:

I am trying to find a proper way, in R, to find duplicated values, and add the value 1 to each subsequent duplicated value grouped by id. For example:

data = data.table(id = c('1','1','1','1','1','2','2','2'),
                  value = c(95,100,101,101,101,20,35,38))

data$new_value <- ifelse(data[ , data$value] == lag(data$value,1),
                         lag(data$value, 1) + 1 ,data$value)
data$desired_value <- c(95,100,101,102,103,20,35,38)

Produces:

   id value new_value desired_value
1:  1    95        NA            95
2:  1   100       100           100
3:  1   101       101           101 # first 101 in id 1: add 0
4:  1   101       102           102 # second 101 in id 1: add 1
5:  1   101       102           103 # third 101 in id 1: add 2
6:  2    20        20            20
7:  2    35        35            35
8:  2    38        38            38

I tried doing this with ifelse, but it doesn't work recursively so it only applies to the following row, and not any subsequent rows. Also the lag function results in me losing the first value in value.

I've seen examples with character variables with make.names or make.unique, but haven't been able to find a solution for a duplicated numeric value.

Background: I am doing a survival analysis and I am finding that with my data there are stop times that are the same, so I need to make it unique by adding a 1 (stop times are in seconds).

494

asked Apr 04 '17 01:04

Daren Eiri

2 Answers

Here's an attempt. You're essentially grouping by id and value and adding 0:(length(value)-1). So:

data[, onemore := value + (0:(.N-1)), by=.(id, value)]

#   id value new_value desired_value onemore
#1:  1    95        96            95      95
#2:  1   100       101           100     100
#3:  1   101       102           101     101
#4:  1   101       102           102     102
#5:  1   101       102           103     103
#6:  2    20        21            20      20
#7:  2    35        36            35      35
#8:  2    38        39            38      38

118

answered Sep 18 '22 23:09

thelatemail

With base R we can use ave where we take the first value of each group and basically add the row number of that row in that group.

data$value1 <- ave(data$value, data$id, data$value, FUN = function(x)
                                                      x[1] + seq_along(x) - 1)

#   id value new_value desired_value value1
#1:  1    95        96            95     95
#2:  1   100       101           100    100
#3:  1   101       102           101    101
#4:  1   101       102           102    102
#5:  1   101       102           103    103
#6:  2    20        21            20     20
#7:  2    35        36            35     35
#8:  2    38        39            38     38

answered Sep 21 '22 23:09

Ronak Shah

Related questions
                            
                                Get Value of last non-empty column for each row
                            
                                R - replace part of a string using wildcards
                            
                                Subtract specific rows
                            
                                Replace rows in one data frame if they appear in another data frame
                            
                                How to convert integer to factor? [duplicate]
                            
                                Counting the total number of words in of rows of a dataframe
                            
                                What is the simplest way to display raster image in R?
                            
                                replacing values in a column with another column R
                            
                                Linear regression with specified slope
                            
                                dplyr row_number Error in rank
                            
                                Split r chunk header across lines in knitr
                            
                                Collapse absolutePanel in shiny?
                            
                                reqExecutions IBrokers package
                            
                                Stemming words using tm package in R does not work properly?
                            
                                R smooth.spline(): smoothing spline is not smooth but overfitting my data
                            
                                How can I interleave rows from 2 data frames together?
                            
                                Importing csv file with line breaks to R or Python Pandas
                            
                                R Shiny date slider animation by month (currently by day)
                            
                                How to print numbers divisible by 7
                            
                                How to pass multiple column names as input to group_by in dplyr [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Increment by one to each duplicate value

Tags:

r

duplicates

sequence

Daren Eiri

People also ask

2 Answers

thelatemail

Ronak Shah

Recent Activity

Donate For Us