Create duplicate rows based on conditions in R

Tags:

I have a data.table that looks like this

dt <- data.table(ID=c("A","A","B","B"),Amount1=c(100,200,300,400),
                 Amount2=c(1500,1500,2400,2400),Dupl=c(1,0,1,0))

   ID Amount1 Amount2 Dupl
1:  A     100    1500    1
2:  A     200    1500    0
3:  B     300    2400    1
4:  B     400    2400    0

I need to duplicate each row that has a 1 in the Dupl column and replace the Amount1 value with the Amount2 value in that duplicated row. Besides that I need to give that duplicated row the value 2 in Dupl. This means it should look like this:

   ID Amount1 Amount2 Dupl
1:  A     100    1500    1
2:  A    1500    1500    2
3:  A     200    1500    0
4:  B     300    2400    1
5:  B    2400    2400    2
6:  B     400    2400    0

Any help is much appreciated! Kind regards,

Tim

744

asked Mar 10 '15 10:03

Tim_Utrecht

2 Answers

You could try

rbind(dt,dt[Dupl==1][,c('Amount1', 'Dupl') := list(Amount2, 2)])

124

answered Oct 17 '22 03:10

akrun

Using dplyr

library("data.table")
library("dplyr")

#data
dt <- data.table(ID = c("A", "A", "B", "B"),
                 Amount1 = c(100, 200, 300, 400),
                 Amount2 = c(1500, 1500, 2400, 2400),
                 Dupl = c(1, 0, 1, 0))
#result
rbind(dt,
      dt %>% 
        filter(Dupl == 1) %>% 
        mutate(Dupl = 2,
               Amount1 = Amount2))

#    ID Amount1 Amount2 Dupl
# 1:  A     100    1500    1
# 2:  A     200    1500    0
# 3:  B     300    2400    1
# 4:  B     400    2400    0
# 5:  A    1500    1500    2
# 6:  B    2400    2400    2

answered Oct 17 '22 02:10

zx8754

Related questions
                            
                                Keyboard shortcut to produce code chunk brackets in markdown in R for RStudio
                            
                                Adding points from other dataset to ggplot2
                            
                                Unlist a data frame by rows, not columns
                            
                                How is xgboost cover calculated?
                            
                                Convert a list of lists to a character vector
                            
                                Mutate with a list column function in dplyr
                            
                                How can we pass pandoc_args to yaml header in rmarkdown?
                            
                                Fill missing dates by group
                            
                                Insert images using knitr::include_graphics in a for loop
                            
                                Is it impossible to install R 4.0 on Ubuntu 18.04.4 LTS because r-base-core requires a libc6 version >= 2.29?
                            
                                Generating multidimensional data
                            
                                returning different data frames in a function - R
                            
                                Returning anonymous functions from lapply - what is going wrong?
                            
                                Error bars on stacked bar ggplot2
                            
                                Split a character vector into individual characters? (opposite of paste or stringr::str_c)
                            
                                How to match by nearest date from two data frames?
                            
                                Overlapping matches in R
                            
                                Building a box plot from all columns of data frame with column names on x in ggplot2 [duplicate]
                            
                                One-class classification with SVM in R
                            
                                How to sort a list of lists in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Create duplicate rows based on conditions in R

Tags:

r

duplicates

data.table

conditional

Tim_Utrecht

People also ask

2 Answers

akrun

zx8754

Recent Activity

Donate For Us