Subset dataframe by multiple logical conditions of rows to remove

Tags:

I would like to subset (filter) a dataframe by specifying which rows not (!) to keep in the new dataframe. Here is a simplified sample dataframe:

data
v1 v2 v3 v4
a  v  d  c
a  v  d  d
b  n  p  g
b  d  d  h    
c  k  d  c    
c  r  p  g
d  v  d  x
d  v  d  c
e  v  d  b
e  v  d  c

For example, if a row of column v1 has a "b", "d", or "e", I want to get rid of that row of observations, producing the following dataframe:

v1 v2 v3 v4
a  v  d  c
a  v  d  d
c  k  d  c    
c  r  p  g

I have been successful at subsetting based on one condition at a time. For example, here I remove rows where v1 contains a "b":

sub.data <- data[data[ , 1] != "b", ]

However, I have many, many such conditions, so doing it one at a time is not desirable. I have not been successful with the following:

sub.data <- data[data[ , 1] != c("b", "d", "e")

sub.data <- subset(data, data[ , 1] != c("b", "d", "e"))

I've tried some other things as well, like !%in%, but that doesn't seem to exist. Any ideas?

823

asked Jun 05 '11 16:06

3 Answers

Try this

subset(data, !(v1 %in% c("b","d","e")))

146

answered Oct 12 '22 10:10

Andrie

You can also accomplish this by breaking things up into separate logical statements by including & to separate the statements.

subset(my.df, my.df$v1 != "b" & my.df$v1 != "d" & my.df$v1 != "e")

This is not elegant and takes more code but might be more readable to newer R users. As pointed out in a comment above, subset is a "convenience" function that is best used when working interactively.

answered Oct 12 '22 10:10

N Brouwer

Related questions
                            
                                R: determine if a script is running in Windows or Linux
                            
                                Add margin row totals in dplyr chain
                            
                                Recommendations for Windows text editor for R [closed]
                            
                                How to move backward parent folder
                            
                                pivot_wider issue "Values in `values_from` are not uniquely identified; output will contain list-cols"
                            
                                R ggplot2: Labelling a horizontal line on the y axis with a numeric value
                            
                                Place y-axis on the right
                            
                                In `knitr` how can I test for if the output will be PDF or word?
                            
                                RMarkdown: How to end tabbed content
                            
                                Chain arithmetic operators in dplyr with %>% pipe
                            
                                Add a variable to a data frame containing max value of each row
                            
                                Specifying column names in a data.frame changes spaces to "."
                            
                                How to return 5 topmost values from vector in R?
                            
                                Structure of an R course for beginners
                            
                                Plotting pca biplot with ggplot2
                            
                                Eliminating NAs from a ggplot
                            
                                Check if R is running in RStudio
                            
                                Use hist() function in R to get percentages as opposed to raw frequencies
                            
                                Parent directory in R
                            
                                Update data frame via function doesn't work

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Subset dataframe by multiple logical conditions of rows to remove

Tags:

dataframe

r

subset

Jota

People also ask

3 Answers

chl

Andrie

N Brouwer

Recent Activity

Donate For Us