R: show ALL rows with duplicated elements in a column [duplicate]

Tags:

r

dplyr

Does a function like this exist in any package?

isdup <- function (x) duplicated (x) | duplicated (x, fromLast = TRUE)

My intention is to use it with dplyr to display all rows with duplicated values in a given column. I need the first occurrence of the duplicated element to be shown as well.

In this data.frame for instance

dat <- as.data.frame (list (l = c ("A", "A", "B", "C"), n = 1:4))
dat

> dat
  l n
1 A 1
2 A 2
3 B 3
4 C 4

I would like to display the rows where column l is duplicated ie. those with an A value doing:

library (dplyr)
dat %>% filter (isdup (l))

returns

  l n
1 A 1
2 A 2

972

asked May 20 '16 17:05

dmontaner

1 Answers

dat %>% group_by(l) %>% filter(n() > 1)

I don't know if it exists in any package, but since you can implement it easily, I'd say just go ahead and implement it yourself.

104

answered Sep 20 '22 13:09

Nick Larsen

Related questions
                            
                                How to arrange column in heatmap.2() based on a predefined order
                            
                                Different results with formula and non-formula for caret training
                            
                                Split a column by group [duplicate]
                            
                                Count common words in two strings
                            
                                How to melt R data.frame and plot group by bar plot
                            
                                R: How to find non-sequential elements in an array
                            
                                convert dplyr join syntax into pure data.table syntax
                            
                                Using gsub adding new column in a data.table
                            
                                Creating new shape palettes in ggplot2 and other R graphics
                            
                                How to enforce stack ordering in ggplot geom_area
                            
                                Why does the ngrams() function give distinct bigrams?
                            
                                Need to plot a curve with standard error in R
                            
                                How can one list pairs of perfectly collinear numeric vectors in a data.frame?
                            
                                purrr map a t.test onto a split df
                            
                                summarize groups into intervals using dplyr
                            
                                How to order a data.frame based on row.names in another data frame?
                            
                                R - Compute Cross Product of Vectors (Physics)
                            
                                Getting a matrix ordered
                            
                                diff on data.table column
                            
                                Unable to append to SQL Server table using sqlSave in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With