using R - delete rows when a value repeated less than 3 times

Tags:

frame with 10 rows and 3 columns

    a   b c
1   1 201 1
2   2 202 1
3   3 203 1
4   4 204 1
5   5 205 4
6   6 206 5
7   7 207 4
8   8 208 4
9   9 209 8
10 10 210 5

I want to delete all rows where the same value in the column "c" repeated less than 3 times. In this example I want to remove rows 6, 9 and 10. (my real data.frame has 5000 rows and 25 cols) I tried to do it using the function rle, but I keep getting the wrong solution. any help? thanks!

512

asked Oct 12 '10 21:10

Claudia

2 Answers

Here is a solution using ave :

Data[ave(Data$c, Data$c, FUN = length) > 2, ]

or using ave with subset:

subset(Data, ave(c, c, FUN = length) > 2)

196

answered Oct 07 '22 01:10

G. Grothendieck

Building on Joshua's answer:

Data[Data$c %in% names(which(table(Data$c) > 2)), ]

answered Oct 06 '22 23:10

Erik Iverson

Related questions
                            
                                Converting a data.frame to a list of lists
                            
                                append rows to dataframe using foreach package
                            
                                else if(){} VS ifelse()
                            
                                detecting word boundary with regex in data frame in R
                            
                                How to delete rows from a dataframe that contain n*NA
                            
                                Vector to Matrix of Differences between elements
                            
                                Is it possible to define the "mid" range in scale_fill_gradient2()?
                            
                                How do I count the number of words in a text (string)?
                            
                                RODBC sqlSave table creation problems
                            
                                how to convert country codes into country names in a column within a data frame using R?
                            
                                Use dplyr to filter out columns containing characters
                            
                                How to drop columns from data frame with less than 2 unique levels in R
                            
                                Fastest way to transpose a list in R / Rcpp
                            
                                How to transform a vector into data frame with fixed dimension
                            
                                Convert column with pipe delimited data into dummy variables [duplicate]
                            
                                How to deal with zero in log plot
                            
                                how to hyperlink an image in R Markdown
                            
                                Creating a half-donut, or parliamentary seating, chart
                            
                                Replace the last occurence of a string (and only it) using regular expression
                            
                                How to measure overfitting when train and validation sample is small in Keras model

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With