What is the best way to filter rows from data frame when the values to be deleted are stored in a vector? In my case I have a column with dates and want to remove several dates. I know how to delete rows corresponding to one day, using <code>!=</code>, e.g.: <pre class="prettyprint"><code>m[m$date != "01/31/11", ] </code></pre> To remove several dates, specified in a vector, I tried: <pre class="prettyprint"><code>m[m$date != c("01/31/11", "01/30/11"), ] </code></pre> However, this generates a warning message: <pre class="prettyprint"><code>Warning message: In `!=.default`(m$date, c("01/31/11", "01/30/11")) : longer object length is not a multiple of shorter object length Calls: [ ... [.data.frame -> Ops.dates -> NextMethod -> Ops.times -> NextMethod </code></pre> What is the correct way to apply a filter based on multiple values?

nzcoops is spot on with his suggestion. I posed this question in the R Chat a while back and Paul Teetor suggested defining a new function: <pre class="prettyprint"><code>`%notin%` <- function(x,y) !(x %in% y) </code></pre> Which can then be used as follows: <pre class="prettyprint"><code>foo <- letters[1:6] > foo[foo %notin% c("a", "c", "e")] [1] "b" "d" "f" </code></pre> Needless to say, this little gem is now in my R profile and gets used quite often.

Filter data frame rows based on values in vector

Tags:

dataframe

r

subset

What is the best way to filter rows from data frame when the values to be deleted are stored in a vector? In my case I have a column with dates and want to remove several dates.

I know how to delete rows corresponding to one day, using !=, e.g.:

m[m$date != "01/31/11", ]

To remove several dates, specified in a vector, I tried:

m[m$date != c("01/31/11", "01/30/11"), ]

However, this generates a warning message:

Warning message:
In `!=.default`(m$date, c("01/31/11", "01/30/11")) :
longer object length is not a multiple of shorter object length
Calls: [ ... [.data.frame -> Ops.dates -> NextMethod -> Ops.times -> NextMethod

What is the correct way to apply a filter based on multiple values?

709

asked Sep 21 '11 05:09

matt_k

Video Answer

1 Answers

nzcoops is spot on with his suggestion. I posed this question in the R Chat a while back and Paul Teetor suggested defining a new function:

`%notin%` <- function(x,y) !(x %in% y)

Which can then be used as follows:

foo <- letters[1:6]

> foo[foo %notin% c("a", "c", "e")]
[1] "b" "d" "f"

Needless to say, this little gem is now in my R profile and gets used quite often.

176

answered Oct 07 '22 21:10

Chase

Related questions
                            
                                In R, what does a negative index do?
                            
                                Warning: replacing previous import ‘head’ when loading ‘utils’ in R
                            
                                Create barplot from data.frame
                            
                                Creating zip file from folders in R
                            
                                Is R an interpreted or compiled programming language?
                            
                                Get only the value of an element in an R data frame (without the index)
                            
                                R: generate all permutations of vector without duplicated elements
                            
                                Is there a way to programmatically darken the color given RGB values?
                            
                                Extract name of data.frame in R as character
                            
                                r - ggplot2 - highlighting selected points and strange behavior
                            
                                Change negative values in dataframe column to absolute value
                            
                                Changing facet label to math formula in ggplot2
                            
                                Adaptive moving average - top performance in R
                            
                                Mutate multiple columns in a dataframe
                            
                                installation of package ‘devtools’ had non-zero exit status on Ubuntu
                            
                                igraph creating a weighted adjacency matrix
                            
                                Copy folder recursive in R
                            
                                Get a single value out of any statistics tests (e.g. value of spearman rho out of cor.test)
                            
                                Plot fitted line within certain range R
                            
                                apply strsplit rowwise

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With