I have a data frame <code>df</code> with an ID column eg <code>A</code>,<code>B</code>,etc. I also have a vector containing certain IDs: <pre class="prettyprint"><code>L <- c("A", "B", "E") </code></pre> How can I filter the data frame to get only the IDs present in the vector? Individually, I would use <pre class="prettyprint"><code>subset(df, ID == "A") </code></pre> but how do I filter on a whole vector?

You can use the <code>%in%</code> operator: <pre class="prettyprint"><code>> df <- data.frame(id=c(LETTERS, LETTERS), x=1:52) > L <- c("A","B","E") > subset(df, id %in% L) id x 1 A 1 2 B 2 5 E 5 27 A 27 28 B 28 31 E 31 </code></pre> If your IDs are unique, you can use <code>match()</code>: <pre class="prettyprint"><code>> df <- data.frame(id=c(LETTERS), x=1:26) > df[match(L, df$id), ] id x 1 A 1 2 B 2 5 E 5 </code></pre> or make them the rownames of your dataframe and extract by row: <pre class="prettyprint"><code>> rownames(df) <- df$id > df[L, ] id x A A 1 B B 2 E E 5 </code></pre> Finally, for more advanced users, and if speed is a concern, I'd recommend looking into the <code>data.table</code> package.

Filtering a data frame on a vector [duplicate]

Tags:

dataframe

r

subset

I have a data frame df with an ID column eg A,B,etc. I also have a vector containing certain IDs:

L <- c("A", "B", "E")

How can I filter the data frame to get only the IDs present in the vector? Individually, I would use

subset(df, ID == "A")

but how do I filter on a whole vector?

388

asked Feb 19 '12 14:02

adam.888

1 Answers

You can use the %in% operator:

> df <- data.frame(id=c(LETTERS, LETTERS), x=1:52) > L <- c("A","B","E") > subset(df, id %in% L)    id  x 1   A  1 2   B  2 5   E  5 27  A 27 28  B 28 31  E 31

If your IDs are unique, you can use match():

> df <- data.frame(id=c(LETTERS), x=1:26) > df[match(L, df$id), ]   id x 1  A 1 2  B 2 5  E 5

or make them the rownames of your dataframe and extract by row:

> rownames(df) <- df$id > df[L, ]   id x A  A 1 B  B 2 E  E 5

Finally, for more advanced users, and if speed is a concern, I'd recommend looking into the data.table package.

146

answered Oct 02 '22 11:10

flodel

Related questions
                            
                                ggplot side by side geom_bar()
                            
                                Clickable links in Shiny Datatable
                            
                                dplyr join define NA values
                            
                                Split up `...` arguments and distribute to multiple functions
                            
                                What's a good strategy to get a decent overview of big correlation matrices or pairs?
                            
                                kruskal.test shows "All group levels must be finite" error. What is the problem?
                            
                                access data frame column using variable
                            
                                Finding rows containing a value (or values) in any column
                            
                                How to use superscript with ggplot2
                            
                                Apply list of functions to list of values
                            
                                How to find the highest (latest) and lowest (earliest) date [R]
                            
                                Splitting a large data frame into smaller segments
                            
                                Non-standard evaluation (NSE) in dplyr's filter_ & pulling data from MySQL
                            
                                Where in R do I permanently store my custom functions?
                            
                                How to add line breaks to plotly hover labels
                            
                                remove row with nan value
                            
                                How to compute error rate from a decision tree?
                            
                                parallel execution of random forest in R
                            
                                Making square axes in R
                            
                                Insert a logo in upper right corner of R markdown html document

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With