Efficient way to find repeated runs of rows, remove, & count

Tags:

r

I have a data set with repeating rows. I want to remove consecutive repeated and count them but only if they're consecutive. I'm looking for an efficient way to do this. Can't think of how in dplyr or data.table.

MWE

dat <- data.frame(     x = c(6, 2, 3, 3, 3, 1, 1, 6, 5, 5, 6, 6, 5, 4),     y = c(7, 5, 7, 7, 7, 5, 5, 7, 1, 2, 7, 7, 1, 7),     z = c(rep(LETTERS[1:2], each=7)) )  ##        x     y     z ## 1      6     7     A ## 2      2     5     A ## 3      3     7     A ## 4      3     7     A ## 5      3     7     A ## 6      1     5     A ## 7      1     5     A ## 8      6     7     B ## 9      5     1     B ## 10     5     2     B ## 11     6     7     B ## 12     6     7     B ## 13     5     1     B ## 14     4     7     B

Desired output

       x     y     z   n 1      6     7     A   1 2      2     5     A   1 3      3     7     A   3 4      1     5     A   2 5      6     7     B   1 6      5     1     B   1 7      5     2     B   1 8      6     7     B   2 9      5     1     B   1  10     4     7     B   1

261

asked Apr 18 '16 01:04

Tyler Rinker

1 Answers

With data.table:

library(data.table) setDT(dat)  dat[, c(.SD[1L], .N), by=.(g = rleidv(dat))][, g := NULL]      x y z N  1: 6 7 A 1  2: 2 5 A 1  3: 3 7 A 3  4: 1 5 A 2  5: 6 7 B 1  6: 5 1 B 1  7: 5 2 B 1  8: 6 7 B 2  9: 5 1 B 1 10: 4 7 B 1

178

answered Oct 06 '22 06:10

Frank

Related questions
                            
                                how to get index of sorted array elements
                            
                                how to drop columns by passing variable name with dplyr?
                            
                                ROC curve from training data in caret
                            
                                How to assign output of cat to an object?
                            
                                How to use a variable in dplyr::filter?
                            
                                How to import a .tsv file
                            
                                Remove accents from a dataframe column in R
                            
                                Error when I try to predict class probabilities in R - caret
                            
                                How to write from R to the clipboard on a mac
                            
                                Is there a way to check if a column is a Date in R?
                            
                                Draw more than one function curves in the same plot [duplicate]
                            
                                Frequency count of two column in R
                            
                                Extract Links from Webpage using R
                            
                                How to create a column with a quartile rank?
                            
                                Run multiple R-scripts simultaneously
                            
                                Rmarkdown font size and header
                            
                                How to maintain size of ggplot with long labels
                            
                                Moving variance in R
                            
                                How can I extract elements from lists of lists in R?
                            
                                How do you change library location in R? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With