filter duplicates from a data frame in r [duplicate]

Tags:

I have a dataframe with one observation per row and two observations per subject. I'd like to filter out just the rows with duplicate 'day' numbers.

ex <- data.frame('id'= rep(1:5,2), 'day'= c(1:5, 1:3,5:6))

The following code filters out just the second duplicated row, but not the first. Again, I'd like to filter out both of the duplicated rows.

ex %>% 
    group_by(id) %>% 
    filter(duplicated(day))

The following code works, but seems clunky. Does anyone have a more efficient solution?

ex %>% 
    group_by(id) %>% 
    filter(duplicated(day, fromLast = TRUE) | duplicated(day, fromLast = FALSE))

647

asked Nov 04 '16 19:11

afleishman

1 Answers

Single tidyverse pipe:

exSinglesOnly <- 
    ex %>% 
    group_by(id,day) %>% # the complete group of interest
    mutate(duplicate = n()) %>% # count number in each group
    filter(duplicate == 1) %>% # select only unique records
    select(-duplicate) # remove group count column

> exSinglesOnly
Source: local data frame [4 x 2]
Groups: id, day [4]

     id   day
  <int> <int>
1     4     4
2     5     5
3     4     5
4     5     6

164

answered Oct 19 '22 13:10

leerssej

Related questions
                            
                                R: Rolling window function with adjustable window and step-size for irregularly spaced observations
                            
                                Calculating mean of multiple matrices in R
                            
                                How to use corrplot with simple matrices
                            
                                R: data.table, set first and last value of a group to NA
                            
                                How to avoid "Error in stripchart.default(x1, ...) : invalid plotting method" error?
                            
                                How to add horizontal separator in R's heatmap.2
                            
                                microbenchmark as data frame or matrix
                            
                                Using ROracle dbWriteTable to write POSIXct back to Oracle DB
                            
                                How do I add a link to open a pdf file in a new window from my R shiny app?
                            
                                How to decode encoded polylines from OSRM and plotting route geometry?
                            
                                R: Combining Nested List Elements by Name
                            
                                Change ggplot legend title
                            
                                How can I perform a "setdiff" merge using data.table?
                            
                                Missing horizontal scroll bar in R Markdown HTML code chunks and output
                            
                                R Error: could not find function "select"
                            
                                Replace NA with 0, only in numeric columns in data.table
                            
                                Passing a column name to R tidyr spread
                            
                                Counting occurrences without modifying the original order
                            
                                stringr equivalent to grep
                            
                                Change size of hover text in Plotly

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

filter duplicates from a data frame in r [duplicate]

Tags:

r

duplicates

unique

dplyr

afleishman

People also ask

1 Answers

leerssej

Recent Activity

Donate For Us