During a recent investigation into setting random seeds within functions, I came across an odd situation. Consider functions <code>f</code> and <code>g</code>, each of which sets the random seed and then performs a simple randomized operation: <pre class="prettyprint"><code>g <- function(size) { set.seed(1) ; runif(size) } f <- function(x) { set.seed(2) ; x*runif(length(x)) } </code></pre> Because each function sets the random seed, I would expect each function to always have the same return value given the same input. This would mean <code>f(g(2))</code> should return the same thing as <code>x <- g(2) ; f(x)</code>. To my surprise this is not the case: <pre class="prettyprint"><code>f(g(2)) # [1] 0.1520975 0.3379658 x <- g(2) f(x) # [1] 0.04908784 0.26137017 </code></pre> What is going on here?

This is an example of the double-slit R experiment. When x is observed, it acts as a particle; when unobserved it acts as a wave. Behold <pre class="prettyprint"><code>g <- function(size) { set.seed(1) ; runif(size) } f <- function(x) {set.seed(2) ; x*runif(length(x)) } f2 <- function(x) {print(x); set.seed(2) ; x*runif(length(x)) } f(g(2)) # [1] 0.1520975 0.3379658 x <- g(2) f(x) # [1] 0.04908784 0.26137017 f2(g(2)) # [1] 0.2655087 0.3721239 # [1] 0.04908784 0.26137017 x <- g(2) f2(x) # [1] 0.2655087 0.3721239 # [1] 0.04908784 0.26137017 </code></pre> I'm just josilbering you. <code>print</code> is forcing <code>x</code>. You can do that explicitly <pre class="prettyprint"><code>f <- function(x) {force(x); set.seed(2) ; x*runif(length(x)) } x <- g(2) f(x) # [1] 0.04908784 0.26137017 </code></pre> But not this <pre class="prettyprint"><code>f(force(g(2))) # [1] 0.1520975 0.3379658 </code></pre>

The <code>x</code> argument of your <code>f()</code> function is only evaluated at the moment that it is actually used inside the function. This means that the <code>set.seed(2)</code> is evaluated before the execution of the <code>g()</code> function when you try to compute <code>f(g(2))</code>. <pre class="prettyprint"><code>> f(g(2)) [1] 0.1520975 0.3379658 </code></pre> is basically equivalent to: <pre class="prettyprint"><code>> set.seed(2) > set.seed(1) > result <- runif(2) > result*runif(length(result)) [1] 0.1520975 0.3379658 </code></pre>

Inconsistent results for f(g(x)) together or split up

Q: How do you know when to use the chain rule?

We use the chain rule when differentiating a 'function of a function', like f(g(x)) in general. We use the product rule when differentiating two functions multiplied together, like f(x)g(x) in general. Take an example, f(x) = sin(3x).

Q: What is the formula for calculating product rule?

The product rule is one of the derivative rules that we use to find the derivative of functions of the form P(x) = f(x)·g(x).

Q: What is the difference between chain rule and product rule?

The chain ruleis used to dierentiate a function that has a function within it. The product ruleis used to dierentiate a function that is the multiplication of two functions. The quotient ruleis used to dierentiate a function that is the division of two functions.

Tags:

random

r

random-seed

During a recent investigation into setting random seeds within functions, I came across an odd situation. Consider functions f and g, each of which sets the random seed and then performs a simple randomized operation:

g <- function(size) { set.seed(1) ; runif(size) }
f <- function(x) { set.seed(2) ; x*runif(length(x)) }

Because each function sets the random seed, I would expect each function to always have the same return value given the same input. This would mean f(g(2)) should return the same thing as x <- g(2) ; f(x). To my surprise this is not the case:

f(g(2))
# [1] 0.1520975 0.3379658

x <- g(2)
f(x)
# [1] 0.04908784 0.26137017

What is going on here?

525

asked Jun 27 '15 01:06

josliber

2 Answers

This is an example of the double-slit R experiment. When x is observed, it acts as a particle; when unobserved it acts as a wave. Behold

g <- function(size) { set.seed(1) ; runif(size) }
f <- function(x) {set.seed(2) ; x*runif(length(x)) }
f2 <- function(x) {print(x); set.seed(2) ; x*runif(length(x)) }

f(g(2))
# [1] 0.1520975 0.3379658

x <- g(2)
f(x)
# [1] 0.04908784 0.26137017


f2(g(2))
# [1] 0.2655087 0.3721239
# [1] 0.04908784 0.26137017

x <- g(2)
f2(x)
# [1] 0.2655087 0.3721239
# [1] 0.04908784 0.26137017

I'm just josilbering you. print is forcing x. You can do that explicitly

f <- function(x) {force(x); set.seed(2) ; x*runif(length(x)) }
x <- g(2)
f(x)
# [1] 0.04908784 0.26137017

But not this

f(force(g(2)))
# [1] 0.1520975 0.3379658

194

answered Oct 31 '22 03:10

rawr

The x argument of your f() function is only evaluated at the moment that it is actually used inside the function. This means that the set.seed(2) is evaluated before the execution of the g() function when you try to compute f(g(2)).

> f(g(2))
[1] 0.1520975 0.3379658

is basically equivalent to:

> set.seed(2)
> set.seed(1)
> result <- runif(2)
> result*runif(length(result))
[1] 0.1520975 0.3379658

answered Oct 31 '22 01:10

Jellen Vermeir

Related questions
                            
                                Pandas - how to convert r dataframe back to pandas?
                            
                                if - else if - else statement and brackets
                            
                                Installing non-public packages from Gitlab using devtools::install_git
                            
                                lapply() when function returns NULL
                            
                                How to plot the survival curve generated by survreg (package survival of R)?
                            
                                using colClasses in fread
                            
                                Generate a repeating sequence based on vector
                            
                                Count number of vector values in range with R
                            
                                How to step through an R script from the beginning?
                            
                                Control the height in fluidRow in R shiny
                            
                                R: Assign variable labels of data frame columns
                            
                                Filtering dates in dplyr
                            
                                How can I remove empty factors from ggplot2 facets?
                            
                                How to convert NUM to INT in R?
                            
                                Text file to list in R
                            
                                How to directly select the same column from all nested lists within a list?
                            
                                Using grep to help subset a data frame
                            
                                How to get parameters from config file in R script
                            
                                find all functions (including private) in a package
                            
                                R: `which` statement with multiple conditions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With