Exclude subsequent duplicated rows

Tags:

I would like to exclude all duplicated rows. However, it has to be true just when they are subsequent rows. Follows a representative example:

My input df:

    df <- "NAME   VALUE 
    Prb1  0.05
    Prb2  0.05
    Prb3  0.05
    Prb4  0.06
    Prb5  0.06
    Prb6  0.01
    Prb7  0.10
    Prb8  0.05"

df <- read.table(text=df, header=T)

My expected outdf:

outdf <- "NAME   VALUE 
Prb1  0.05
Prb4  0.06
Prb6  0.01
Prb7  0.10
Prb8  0.05"

outdf <- read.table(text=df, header=T)

766

asked May 15 '15 13:05

user2120870

1 Answers

rle() is a nice function that identifies runs of identical values, but it can be kind of a pain to wrestle it's output into a usable form. Here's a relatively painless incantation that works in your case.

df[sequence(rle(df$VALUE)$lengths) == 1, ]
#   NAME VALUE
# 1 Prb1  0.05
# 4 Prb4  0.06
# 6 Prb6  0.01
# 7 Prb7  0.10
# 8 Prb8  0.05

answered Oct 27 '22 09:10

Josh O'Brien

Related questions
                            
                                how to realize countifs function (excel) in R
                            
                                How to plot a contour line showing where 95% of values fall within, in R and in ggplot2
                            
                                How to remove groups of observation with dplyr::filter()
                            
                                Detect multiple strings with dplyr and stringr
                            
                                "Density" curve overlay on histogram where vertical axis is frequency (aka count) or relative frequency?
                            
                                R : Check if R object exists before creating it
                            
                                calculate median from data.table columns in R
                            
                                R - Autofit Excel column width
                            
                                can lapply not modify variables in a higher scope
                            
                                Existing function for seeing if a row exists in a data frame?
                            
                                How to get R plot window size?
                            
                                How to prevent regmatches drop non matches?
                            
                                Change font size of titles from facet_wrap
                            
                                How to rbind only the common columns of two data sets
                            
                                How to retrieve the most repeated value in a column present in a data frame
                            
                                Pretty axis labels for log scale in ggplot
                            
                                Keep column name when select one column from a data frame/matrix in R
                            
                                Plot the equivalent of correlation matrix for factors (categorical data)? And mixed types?
                            
                                cumulative plot using ggplot2
                            
                                Row-wise variance of a matrix in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Exclude subsequent duplicated rows

Tags:

r

conditional

duplicate-data

user2120870

People also ask

1 Answers

Josh O'Brien

Recent Activity

Donate For Us