I have a dataframe like this <pre class="prettyprint"><code>df <- data.frame(col1 = c(letters[1:4],"a"),col2 = 1:5,col3 = letters[10:14]) df col1 col2 col3 1 a 1 j 2 b 2 k 3 c 3 l 4 d 4 m 5 a 5 n </code></pre> I would like to identify the columns that contain any value from the following vector: <pre class="prettyprint"><code>vals=c("a","b","n","w") </code></pre> A tidy solution would be awesome!

We may use <code>select</code> <pre class="prettyprint"><code>library(dplyr) df %>% select(where(~ any(. %in% vals, na.rm = TRUE))) </code></pre> -output <pre class="prettyprint"><code> col1 col3 1 a j 2 b k 3 c l 4 d m 5 a n </code></pre> <hr> A similar option in <code>base R</code> is with <code>Filter</code> <pre class="prettyprint"><code>Filter(\(x) any(x %in% vals, na.rm = TRUE), df) col1 col3 1 a j 2 b k 3 c l 4 d m 5 a n </code></pre>

Another tidyverse option is to use <code>keep()</code> from <code>purrr</code>. <pre class="prettyprint lang-r prettyprint-override"><code>library(purrr) df %>% keep( ~ any(.x %in% vals)) </code></pre>

R - identify cols that contain any of a values set

Tags:

r

tidyverse

I have a dataframe like this

df <- data.frame(col1 = c(letters[1:4],"a"),col2 = 1:5,col3 = letters[10:14])
 df
  col1 col2 col3
1    a    1    j
2    b    2    k
3    c    3    l
4    d    4    m
5    a    5    n

I would like to identify the columns that contain any value from the following vector:

vals=c("a","b","n","w")

A tidy solution would be awesome!

722

asked Dec 29 '21 19:12

tzema

2 Answers

We may use select

library(dplyr)
df %>% 
   select(where(~ any(. %in% vals, na.rm = TRUE)))

-output

   col1 col3
1    a    j
2    b    k
3    c    l
4    d    m
5    a    n

A similar option in base R is with Filter

Filter(\(x)  any(x %in% vals, na.rm = TRUE), df)
  col1 col3
1    a    j
2    b    k
3    c    l
4    d    m
5    a    n

121

answered Oct 07 '22 23:10

akrun

Another tidyverse option is to use keep() from purrr.

library(purrr)

df %>% 
  keep( ~ any(.x %in% vals))

answered Oct 07 '22 22:10

Adam

Related questions
                            
                                Create a time to and time after event variables
                            
                                R ERROR: dependencies ‘xml2’, ‘httr’ are not available for package (Linux Mint 20.1)
                            
                                R data.table: Difference between nested regressions results
                            
                                How can I create a new dataframe in R that combines the first date and last date available for each ID?
                            
                                Count occurrence of IDs within the last x days in R
                            
                                How to find out all integers between two real numbers using R
                            
                                knitr: Using subscript with fig.cap in Markdown
                            
                                Testing a conditional over every element of a matrix
                            
                                `data` must be a data frame, or other object coercible by `fortify()`, not an S3 object with class ranger
                            
                                Use `[` method from data.table package in package development
                            
                                In R ,how can i replac the NA by the previous character [duplicate]
                            
                                How to preserve decimal values when converting POSIXct to character?
                            
                                Partially read really large csv.gz in R using vroom
                            
                                How to print on a serie sof graphs pairwise comparisons bars and effect size value?
                            
                                Squid game Episode 7 with simulation
                            
                                How to use stringr functions to remove all empty words?
                            
                                Mutate across multiple columns to create new variable sets
                            
                                replace_na with tidyselect?
                            
                                Cumulative sum for more values in one entry
                            
                                How to define (and plot) a non-continuous function in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With