After searching for a while, I know that this question has not been answered yet. Assume that I have the following vector <code>v <- c("a", "b", "b", "c","c","c", "d", "d", "d", "d")</code> How do I find those values having more than 1 duplicates (should be <code>"c","c","c", "d", "d", "d", "d")</code> and more than 2 duplicates (should be <code>"d", "d", "d", "d"</code>) Function <code>duplicated(v)</code> only returns values having duplicates.

You can generate a <code>table()</code> and then check which elements of <code>v</code> are part of the relevant subset of the table, e.g. <pre class="prettyprint"><code>R> v <- c("a", "b", "b", "c","c","c", "d", "d", "d", "d") R> tab <- table(v) R> tab v a b c d 1 2 3 4 R> v[v %in% names(tab[tab > 2])] [1] "c" "c" "c" "d" "d" "d" "d" R> v[v %in% names(tab[tab > 3])] [1] "d" "d" "d" "d" </code></pre>

Multiple duplicates (2 times, 3 times,...) in R

1 Answers

You can generate a table() and then check which elements of v are part of the relevant subset of the table, e.g.

R> v <- c("a", "b", "b", "c","c","c", "d", "d", "d", "d")
R> tab <- table(v)
R> tab
v
a b c d 
1 2 3 4 
R> v[v %in% names(tab[tab > 2])]
[1] "c" "c" "c" "d" "d" "d" "d"
R> v[v %in% names(tab[tab > 3])]
[1] "d" "d" "d" "d"

answered Sep 21 '22 17:09

Achim Zeileis

Related questions
                            
                                How to calculate the predicted probability of negative binomial regression model?
                            
                                Getting all combinations which sum up to 100 using R
                            
                                Match and replace many values in data.table
                            
                                Plotting one scatterplot with multiple dataframes with ggplot in python
                            
                                How do you get geom_map to show all parts of a map?
                            
                                get online data every hour in R
                            
                                roxygen2: Issue with exporting print method
                            
                                Sudden "unused argument" error
                            
                                Continuous colour of geom_line according to y value
                            
                                How to change plot title in R when the package already uses an existing title?
                            
                                How to perform lm.ridge summary?
                            
                                Get tick break positions in ggplot
                            
                                Splitting vector based on vector of chunk-lengths
                            
                                Reorganizing a unique (NYC MTA turnstile) dataset in R
                            
                                Error in R (mice package), too many weights
                            
                                How to source R code without overwriting current variables?
                            
                                How to speed up or vectorize a for loop?
                            
                                R: Convert list with different number of rows to data.frame
                            
                                How to convert vector of characters to corpus input for the DocumentTermMatrix function from tm package in R?
                            
                                ggplot2: More complex faceting

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multiple duplicates (2 times, 3 times,...) in R

Tags:

r

duplicates

duplicate-data

Duy Bui

People also ask

1 Answers

Achim Zeileis

Recent Activity

Donate For Us