I'm trying to write a function which takes as argument a dataframe and the name of the function. When I try to write the function with the standard R syntax, I can get the good result using <code>eval</code> and <code>substitute</code> as recommanded by @hadley in http://adv-r.had.co.nz/Computing-on-the-language.html <pre class="prettyprint"><code>> df <- data.frame(y = 1:10) > f <- function(data, x) { + out <- mean(eval(expr = substitute(x), envir = data)) + return(out) + } > f(data = df, x = y) [1] 5.5 </code></pre> Now, when I try to write the same function using the <code>%>%</code> operator, it doesn't work : <pre class="prettyprint"><code>> df <- data.frame(y = 1:10) > f <- function(data, x) { + data %>% + eval(expr = substitute(x), envir = .) %>% + mean() + } > f(data = df, x = y) Show Traceback Rerun with Debug Error in eval(expr, envir, enclos) : objet 'y' introuvable > </code></pre> How can I using the combine the piping operator with the use of <code>eval</code> and <code>substitute</code> ? It's seems really tricky for me.

A workaround would be <pre class="prettyprint"><code>f <- function(data, x) { v <- substitute(x) data %>% eval(expr = v, envir = .) %>% mean() } </code></pre> The problem is that the pipe functions (<code>%>%</code>) are creating another level of closure which interferes with the evaluation of <code>substitute(x)</code>. You can see the difference with this example <pre class="prettyprint"><code>df <- data.frame(y = 1:10) f1 <- function(data, x) { print(environment()) eval(expr = environment(), envir = data) } f2 <- function(data, x) { print(environment()) data %>% eval(expr = environment(), envir = .) } f1(data = df, x = y) # <environment: 0x0000000006388638> # <environment: 0x0000000006388638> f2(data = df, x = y) # <environment: 0x000000000638a4a8> # <environment: 0x0000000005f91ae0> </code></pre> Notice how the environments differ in the matrittr version. You want to take care of <code>substitute</code> stuff as soon as possible when mucking about with non-standard evaluation. I hope your use case is a bit more complex than your example, because it seems like <pre class="prettyprint"><code>mean(df$y) </code></pre> would be a much easier bit of code to read.

I've been trying to understand my problem. First, I've written what I want with the <code>summarise()</code> function : <pre class="prettyprint"><code>> library(dplyr) > df <- data.frame(y = 1:10) > summarise_(.data = df, mean = ~mean(y)) mean 1 5.5 </code></pre> Then I try to program my own function. I've found a solution which seems to work with the <code>lazyeval</code> package in this post. I use the <code>lazy()</code> and the <code>interp()</code> functions to write what I want. The first possibility is here : <pre class="prettyprint"><code>> library(lazyeval) > f <- function(data, col) { + col <- lazy(col) + inter <- interp(~mean(x), x = col) + summarise_(.data = data, mean = inter) + } > f(data = df, col = y) mean 1 5.5 </code></pre> I can also use pipes : <pre class="prettyprint"><code>> f <- function(data, col) { + col <- lazy(col) + inter <- interp(~mean(x), x = col) + data %>% + summarise_(.data = ., mean = inter) + } > > f(data = df, col = y) mean 1 5.5 </code></pre>

How can I use dplyr/magrittr's pipe inside functions in R?

Tags:

r

dplyr

magrittr

nse

I'm trying to write a function which takes as argument a dataframe and the name of the function. When I try to write the function with the standard R syntax, I can get the good result using eval and substitute as recommanded by @hadley in http://adv-r.had.co.nz/Computing-on-the-language.html

> df <- data.frame(y = 1:10)
> f <- function(data, x) {
+   out <- mean(eval(expr = substitute(x), envir = data))
+   return(out)
+ }
> f(data = df, x = y)
[1] 5.5

Now, when I try to write the same function using the %>% operator, it doesn't work :

> df <- data.frame(y = 1:10)
> f <- function(data, x) {
+   data %>% 
+     eval(expr = substitute(x), envir = .) %>% 
+     mean()
+ }
> f(data = df, x = y)
Show Traceback
Rerun with Debug
 Error in eval(expr, envir, enclos) : objet 'y' introuvable 
>

How can I using the combine the piping operator with the use of eval and substitute ? It's seems really tricky for me.

534

asked Feb 11 '16 17:02

PAC

2 Answers

A workaround would be

f <- function(data, x) {
  v <- substitute(x)
  data %>% 
    eval(expr = v, envir = .) %>%
    mean()
}

The problem is that the pipe functions (%>%) are creating another level of closure which interferes with the evaluation of substitute(x). You can see the difference with this example

df <- data.frame(y = 1:10)
f1 <- function(data, x) {
  print(environment())
  eval(expr = environment(), envir = data)
}

f2 <- function(data, x) {
  print(environment())
  data %>% 
    eval(expr = environment(), envir = .)
}
f1(data = df, x = y)
# <environment: 0x0000000006388638>
# <environment: 0x0000000006388638>
f2(data = df, x = y)
# <environment: 0x000000000638a4a8>
# <environment: 0x0000000005f91ae0>

Notice how the environments differ in the matrittr version. You want to take care of substitute stuff as soon as possible when mucking about with non-standard evaluation.

I hope your use case is a bit more complex than your example, because it seems like

mean(df$y)

would be a much easier bit of code to read.

143

answered Oct 07 '22 08:10

MrFlick

I've been trying to understand my problem.

First, I've written what I want with the summarise() function :

> library(dplyr)
> df <- data.frame(y = 1:10)
> summarise_(.data = df, mean = ~mean(y))
  mean
1  5.5

Then I try to program my own function. I've found a solution which seems to work with the lazyeval package in this post. I use the lazy() and the interp() functions to write what I want.

The first possibility is here :

> library(lazyeval)
> f <- function(data, col) {
+   col <- lazy(col)
+   inter <- interp(~mean(x), x = col)
+   summarise_(.data = data, mean = inter)    
+   }
> f(data = df, col = y)
  mean
1  5.5

I can also use pipes :

> f <- function(data, col) {
+   col <- lazy(col)
+   inter <- interp(~mean(x), x = col)
+   data %>% 
+     summarise_(.data = ., mean = inter)    
+ }
> 
> f(data = df, col = y)
  mean
1  5.5

answered Oct 07 '22 08:10

PAC

Related questions
                            
                                Create table with all pairs of values from one column in R, counting unique values [duplicate]
                            
                                R data table recommended way to deal with date time
                            
                                match() values with tolerance
                            
                                Combine legend in ggplot2
                            
                                Improve R code, getting numbers with regular expressions
                            
                                Dplyr conditional windowing
                            
                                Unable to convert Month-Year string to Date in R
                            
                                check if package name belongs to a CRAN archived package
                            
                                sort bar chart by sum of values in ggplot
                            
                                How to find correct executable with Sys.which on Windows
                            
                                Converting an SEXP from R into a vector of strings in C++
                            
                                Delete certain rows in a group of rows in R
                            
                                ggplot2 legend with only one category / with only the shape and no scale
                            
                                How to collapse branches in a phylogenetic tree by the label in their nodes or leaves?
                            
                                Error in strsplit(unitspec, " ") in code for Machine Learning for Hackers
                            
                                Background of grid.arrange
                            
                                RPivotTable being able to list more than allowable amount
                            
                                Multivariate GARCH(1,1) in R
                            
                                Is there a way to simplify functions in R that utilize loops?
                            
                                How to fill colors in some specific area in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With