I want to filter a dataframe using a field which is defined in a variable, to select a value that is also in a variable. Say I have <pre class="prettyprint"><code>df <- data.frame(V=c(6, 1, 5, 3, 2), Unhappy=c("N", "Y", "Y", "Y", "N")) fld <- "Unhappy" sval <- "Y" </code></pre> The value I want would be <code>df[df$Unhappy == "Y", ]</code>. I've read the <code>nse</code> vignette to try use <code>filter_</code> but can't quite understand it. I tried <pre class="prettyprint"><code>df %>% filter_(.dots = ~ fld == sval) </code></pre> which returned nothing. I got what I wanted with <pre class="prettyprint"><code>df %>% filter_(.dots = ~ Unhappy == sval) </code></pre> but obviously that defeats the purpose of having a variable to store the field name. Any clues please? Eventually I want to use this where <code>fld</code> is a vector of field names and <code>sval</code> is a vector of filter values for each field in <code>fld</code>.

You can try with <code>interp</code> from <code>lazyeval</code> <pre class="prettyprint"><code> library(lazyeval) library(dplyr) df %>% filter_(interp(~v==sval, v=as.name(fld))) # V Unhappy #1 1 Y #2 5 Y #3 3 Y </code></pre> For multiple key/value pairs, I found this to be working but I think a better way should be there. <pre class="prettyprint"><code> df1 %>% filter_(interp(~v==sval1[1] & y ==sval1[2], .values=list(v=as.name(fld1[1]), y= as.name(fld1[2])))) # V Unhappy Col2 #1 1 Y B #2 5 Y B </code></pre> For these cases, I find the <code>base R</code> option to be easier. For example, if we are trying to <code>filter</code> the rows based on the 'key' variables in 'fld1' with corresponding values in 'sval1', one option is using <code>Map</code>. We subset the dataset (<code>df1[fld1]</code>) and apply the FUN (<code>==</code>) to each column of <code>df1[f1d1]</code> with corresponding value in 'sval1' and use the <code>&</code> with <code>Reduce</code> to get a logical vector that can be used to <code>filter</code> the rows of 'df1'. <pre class="prettyprint"><code> df1[Reduce(`&`, Map(`==`, df1[fld1],sval1)),] # V Unhappy Col2 # 2 1 Y B #3 5 Y B </code></pre> <h3>data</h3> <pre class="prettyprint"><code>df1 <- cbind(df, Col2= c("A", "B", "B", "C", "A")) fld1 <- c(fld, 'Col2') sval1 <- c(sval, 'B') </code></pre>

Now, with <code>rlang</code> 0.4.0, it introduces a new more intuitive way for this type of use case: <pre class="prettyprint"><code>packageVersion("rlang") # [1] ‘0.4.0’ df <- data.frame(V=c(6, 1, 5, 3, 2), Unhappy=c("N", "Y", "Y", "Y", "N")) fld <- "Unhappy" sval <- "Y" df %>% filter(.data[[fld]]==sval) #OR filter_col_val <- function(df, fld, sval) { df %>% filter({{fld}}==sval) } filter_col_val(df, Unhappy, "Y") </code></pre> More information can be found at https://www.tidyverse.org/articles/2019/06/rlang-0-4-0/ Previous Answer With dplyr 0.6.0 and later, this code works: <pre class="prettyprint"><code>packageVersion("dplyr") # [1] ‘0.7.1’ df <- data.frame(V=c(6, 1, 5, 3, 2), Unhappy=c("N", "Y", "Y", "Y", "N")) fld <- "Unhappy" sval <- "Y" df %>% filter(UQ(rlang::sym(fld))==sval) #OR df %>% filter((!!rlang::sym(fld))==sval) #OR fld <- quo(Unhappy) sval <- "Y" df %>% filter(UQ(fld)==sval) </code></pre> More about the <code>dplyr</code> syntax available at http://dplyr.tidyverse.org/articles/programming.html and the quosure usage in the <code>rlang</code> package https://cran.r-project.org/web/packages/rlang/index.html . If you find it challenging mastering non-standard evaluation in dplyr 0.6+, Alex Hayes has an excellent writing-up on the topic: https://www.alexpghayes.com/blog/gentle-tidy-eval-with-examples/ Original Answer With dplyr version 0.5.0 and later, it is possible to use a simpler syntax and gets closer to the syntax @Ricky originally wanted, which I also find more readable than using <code>lazyeval::interp</code> <pre class="prettyprint"><code>df %>% filter_(.dots = paste0(fld, "=='", sval, "'")) # V Unhappy #1 1 Y #2 5 Y #3 3 Y #OR df %>% filter_(.dots = glue::glue("{fld}=='{sval}'")) </code></pre>

Using filter_ in dplyr where both field and value are in variables

Tags:

r

dplyr

I want to filter a dataframe using a field which is defined in a variable, to select a value that is also in a variable. Say I have

df <- data.frame(V=c(6, 1, 5, 3, 2), Unhappy=c("N", "Y", "Y", "Y", "N"))
fld <- "Unhappy"
sval <- "Y"

The value I want would be df[df$Unhappy == "Y", ].

I've read the nse vignette to try use filter_ but can't quite understand it. I tried

df %>% filter_(.dots = ~ fld == sval)

which returned nothing. I got what I wanted with

df %>% filter_(.dots = ~ Unhappy == sval)

but obviously that defeats the purpose of having a variable to store the field name. Any clues please? Eventually I want to use this where fld is a vector of field names and sval is a vector of filter values for each field in fld.

725

asked Aug 01 '15 09:08

Ricky

3 Answers

You can try with interp from lazyeval

 library(lazyeval)
 library(dplyr)
 df %>%
     filter_(interp(~v==sval, v=as.name(fld)))
 #   V Unhappy
 #1 1       Y
 #2 5       Y
 #3 3       Y

For multiple key/value pairs, I found this to be working but I think a better way should be there.

  df1 %>% 
    filter_(interp(~v==sval1[1] & y ==sval1[2], 
           .values=list(v=as.name(fld1[1]), y= as.name(fld1[2]))))
 #  V Unhappy Col2
 #1 1       Y    B
 #2 5       Y    B

For these cases, I find the base R option to be easier. For example, if we are trying to filter the rows based on the 'key' variables in 'fld1' with corresponding values in 'sval1', one option is using Map. We subset the dataset (df1[fld1]) and apply the FUN (==) to each column of df1[f1d1] with corresponding value in 'sval1' and use the & with Reduce to get a logical vector that can be used to filter the rows of 'df1'.

 df1[Reduce(`&`, Map(`==`, df1[fld1],sval1)),]
 #   V Unhappy Col2
 # 2 1       Y    B
  #3 5       Y    B

data

df1 <- cbind(df, Col2= c("A", "B", "B", "C", "A"))
fld1 <- c(fld, 'Col2')
sval1 <- c(sval, 'B')

107

answered Oct 26 '22 00:10

akrun

Here's an alternative with base R, which is maybe not very elegant, but it might have the benefit of being rather easily understandable:

df[df[colnames(df)==fld]==sval,]
#  V Unhappy
#2 1       Y
#3 5       Y
#4 3       Y

answered Oct 26 '22 01:10

RHertel

Now, with rlang 0.4.0, it introduces a new more intuitive way for this type of use case:

packageVersion("rlang")
# [1] ‘0.4.0’

df <- data.frame(V=c(6, 1, 5, 3, 2), Unhappy=c("N", "Y", "Y", "Y", "N"))
fld <- "Unhappy"
sval <- "Y"

df %>% filter(.data[[fld]]==sval)

#OR
filter_col_val <- function(df, fld, sval) {
  df %>% filter({{fld}}==sval)
}

filter_col_val(df, Unhappy, "Y")

More information can be found at https://www.tidyverse.org/articles/2019/06/rlang-0-4-0/

Previous Answer

With dplyr 0.6.0 and later, this code works:

packageVersion("dplyr")
# [1] ‘0.7.1’

df <- data.frame(V=c(6, 1, 5, 3, 2), Unhappy=c("N", "Y", "Y", "Y", "N"))
fld <- "Unhappy"
sval <- "Y"

df %>% filter(UQ(rlang::sym(fld))==sval)

#OR
df %>% filter((!!rlang::sym(fld))==sval)

#OR
fld <- quo(Unhappy)
sval <- "Y"
df %>% filter(UQ(fld)==sval)

More about the dplyr syntax available at http://dplyr.tidyverse.org/articles/programming.html and the quosure usage in the rlang package https://cran.r-project.org/web/packages/rlang/index.html .

If you find it challenging mastering non-standard evaluation in dplyr 0.6+, Alex Hayes has an excellent writing-up on the topic: https://www.alexpghayes.com/blog/gentle-tidy-eval-with-examples/

Original Answer

With dplyr version 0.5.0 and later, it is possible to use a simpler syntax and gets closer to the syntax @Ricky originally wanted, which I also find more readable than using lazyeval::interp

df %>% filter_(.dots = paste0(fld, "=='", sval, "'"))

#  V Unhappy
#1 1       Y
#2 5       Y
#3 3       Y

#OR
df %>% filter_(.dots = glue::glue("{fld}=='{sval}'"))

answered Oct 26 '22 00:10

LmW.

Related questions
                            
                                How to retrieve overall accuracy value from confusionMatrix in R?
                            
                                Protect/encrypt R package code for distribution [closed]
                            
                                R Shiny input slider range values
                            
                                min max scaling/normalization in r for train and test data
                            
                                Use column index instead of name in group_by
                            
                                How do I limit the range of the viridis colour scale?
                            
                                Save output between pipes in dplyr [duplicate]
                            
                                R plot with an x time axis: how to force the ticks labels to be the days?
                            
                                write a gzip file from data frame
                            
                                Calling Custom functions from Python using rpy2
                            
                                Difference between neighbouring elements of a vector
                            
                                What's the difference between as.integer() and +0L used on booleans?
                            
                                Run a custom function on a data frame in R, by group
                            
                                R dplyr rowwise mean or min and other methods?
                            
                                Check to see if a value is within a range?
                            
                                How to parse JSON in a DataFrame column using R
                            
                                Map function to second level of nested list using purrr
                            
                                Suppress All Messages/Warnings with Readr read_csv function
                            
                                Multiple unions
                            
                                Generate correlated data in Python (3.3)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With