I am working with the segmented package and have encountered a problem when calling <code>davies.test()</code> from within a function. Consider the following situation: <pre class="prettyprint"><code>library(segmented) data = data.frame(x = 1:21, y = c(10:1, 0:10)) fit = lm(y ~ x, data = data) fit.seg = segmented(fit, seg.Z = ~ x) davies.test(fit.seg, seg.Z = ~ x, alternative = "greater") </code></pre> That works perfectly and indicates that the segmented regression has two statistically different slopes. Now if I package all of that up into a function like this: <pre class="prettyprint"><code>testit <- function() { data = data.frame(x = 1:21, y = c(10:1, 0:10)) fit = lm(y ~ x, data) fit.seg = segmented(fit, seg.Z = ~ x) davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")$p.value } testit() </code></pre> Then it works fine... But if I delete <code>fit</code> from the global environment then it fails. <pre class="prettyprint"><code>> rm(fit) > testit() Error in eval(expr, envir, enclos) : object 'fit' not found </code></pre> The problem seems to be with the way that <code>davies.test</code> is trying to access the data encapsulated in <code>fit</code>: it doesn't seem to look for <code>fit</code> in the enclosing scope (which in this case is the <code>testit</code> function), but skips directly to the global scope. I'm sure that the problem relates to some subtlety with R's scoping rules. If I can find a quick fix that would prevent me from troubling the package author with this edge case, that would be great. Thanks, Andrew.

Try inserting the line marked <code>##</code> below. There is still a difference that this does not account for as shown by the warning that appears when the modified <code>testit</code> is run but the output pvalue is the same so it may be sufficient for your needs. This is, of course, a bug in the package and best would really be to ask the maintainer of the package if they would fix it. <pre class="prettyprint"><code>library(segmented) testit <- function() { data = data.frame(x = 1:21, y = c(10:1, 0:10)) fit = lm(y ~ x, data) fit.seg = segmented(fit, seg.Z = ~ x) environment(davies.test) <- environment() ## davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")$p.value } testit() </code></pre> giving: <pre class="prettyprint"><code>[1] 0.01858149 Warning message: In summary.lm(object) : essentially perfect fit: summary may be unreliable </code></pre>

I contacted the author of <code>segmented</code> and he promptly responded. Another solution he proposed to the original issue would be <pre class="prettyprint"><code>testit <- function() { data = data.frame(x = 1:21, y = c(10:1, 0:10)) fit = lm(y ~ x, data) fit.seg = segmented(fit, seg.Z = ~ x) fit.seg$call$obj<-fit davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")$p.value } </code></pre> However, he also pointed out that the <code>lm</code> object should actually be passed directly to <code>davies.test()</code> as follows: <pre class="prettyprint"><code>testit <- function() { data = data.frame(x = 1:21, y = c(10:1, 0:10)) fit = lm(y ~ x, data) davies.test(fit, seg.Z = ~ x, alternative = "greater")$p.value } </code></pre> For clarification though, it should be noted that these two bits of code do different things: the second fragment actually fulfills my original purpose (checking for a statistically significant break in the fit), while the first fragment checks to see whether there is a second break.

Function accessing data from enclosing environment

Q: How do I access global environment in R?

Global environment can be referred to as . GlobalEnv in R codes as well. We can use the ls() function to show what variables and functions are defined in the current environment. Moreover, we can use the environment() function to get the current environment.

Q: What is the parent environment of a function R?

The parent environment of a function is the environment in which the function was created. If a function was created in the execution environment (for example, in the global environment), then the environment in which the function was called will be the same as the environment in which the function was created.

Q: What does ls () mean in R?

Overview. The ls() function in R is used to return a vector of character strings containing all the variables and functions that are defined in the current working directory in R programming. Variables whose names begin with a dot are, by default, not returned.

Tags:

r

I am working with the segmented package and have encountered a problem when calling davies.test() from within a function.

Consider the following situation:

library(segmented)

data = data.frame(x = 1:21, y = c(10:1, 0:10))
fit = lm(y ~ x, data = data)
fit.seg = segmented(fit, seg.Z = ~ x)
davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")

That works perfectly and indicates that the segmented regression has two statistically different slopes.

Now if I package all of that up into a function like this:

testit <- function() {
  data = data.frame(x = 1:21, y = c(10:1, 0:10))
  fit = lm(y ~ x, data)
  fit.seg = segmented(fit, seg.Z = ~ x)
  davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")$p.value
}
testit()

Then it works fine...

But if I delete fit from the global environment then it fails.

> rm(fit)
> testit()
 Error in eval(expr, envir, enclos) : object 'fit' not found

The problem seems to be with the way that davies.test is trying to access the data encapsulated in fit: it doesn't seem to look for fit in the enclosing scope (which in this case is the testit function), but skips directly to the global scope.

I'm sure that the problem relates to some subtlety with R's scoping rules. If I can find a quick fix that would prevent me from troubling the package author with this edge case, that would be great.

Thanks, Andrew.

521

asked Jan 04 '16 12:01

datawookie

3 Answers

Try inserting the line marked ## below. There is still a difference that this does not account for as shown by the warning that appears when the modified testit is run but the output pvalue is the same so it may be sufficient for your needs. This is, of course, a bug in the package and best would really be to ask the maintainer of the package if they would fix it.

library(segmented)
testit <- function() {
  data = data.frame(x = 1:21, y = c(10:1, 0:10))
  fit = lm(y ~ x, data)
  fit.seg = segmented(fit, seg.Z = ~ x)
  environment(davies.test) <- environment() ##
  davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")$p.value
}
testit()

giving:

[1] 0.01858149
Warning message:
In summary.lm(object) : essentially perfect fit: summary may be unreliable

answered Nov 05 '22 14:11

G. Grothendieck

No need to make it a global variable. The problem is actually in segmented, not davies.test. It's not finding fit.

You can use dynGet to locate fit in any environment, including the calling function's environment:

testit <- function() {
  data = data.frame(x = 1:21, y = c(10:1, 0:10))
  fit = lm(y ~ x, data)
  fit.seg = segmented(dynGet("fit"), seg.Z = ~ x)
  davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")$p.value
}
testit()

That should work as you intend.

If you have multiple variables named fit in different environments, then use get (see ?get) to specify which environment you want to get it from. dynGet is the "look everywhere; return first" lazy version.

answered Nov 05 '22 14:11

Mekki MacAulay

I contacted the author of segmented and he promptly responded. Another solution he proposed to the original issue would be

testit <- function() {
  data = data.frame(x = 1:21, y = c(10:1, 0:10))
  fit = lm(y ~ x, data)
  fit.seg = segmented(fit, seg.Z = ~ x)
  fit.seg$call$obj<-fit
  davies.test(fit.seg, seg.Z = ~ x, alternative = "greater")$p.value
}

However, he also pointed out that the lm object should actually be passed directly to davies.test() as follows:

testit <- function() {
  data = data.frame(x = 1:21, y = c(10:1, 0:10))
  fit = lm(y ~ x, data)
  davies.test(fit, seg.Z = ~ x, alternative = "greater")$p.value
}

For clarification though, it should be noted that these two bits of code do different things: the second fragment actually fulfills my original purpose (checking for a statistically significant break in the fit), while the first fragment checks to see whether there is a second break.

answered Nov 05 '22 13:11

datawookie

Related questions
                            
                                How to cast multiple columns and values of a data.table?
                            
                                Shiny hyperlink relative path to a file
                            
                                R: Uniques (or dplyr distinct) + most recent date
                            
                                How to save Machine Learning models in R
                            
                                How to get disk space of windows machine with R?
                            
                                Formula manipulation (place interaction terms in proper order)
                            
                                Comparing speed of fread vs. read.table for reading the first 1M rows out of 100M
                            
                                collapse and paste text by multiple grouping sequential variables
                            
                                Error in `contrasts<-`(`*tmp*`, value = contr.funs[1 + isOF[nn]]): contrasts can be applied only to factors with 2 or more levels
                            
                                How to normalize a convolved image? Should I?
                            
                                Externally link to specific tabPanel in Shiny App
                            
                                How to apply function in each row in data.table
                            
                                Error in ncol(xj) : object 'xj' not found when using R matplot()
                            
                                installation of package ‘Rmpfr’ had non-zero exit status
                            
                                use rollapply and zoo to calculate rolling average of a column of variables
                            
                                REGEX in R: extracting words from a string
                            
                                R package - @example function can't be tested if the function is not exported
                            
                                Tabulate top n most repeated values including others
                            
                                geom_raster() with no padding and no legend [duplicate]
                            
                                Need to convert columns to rows in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Function accessing data from enclosing environment

Tags:

r

datawookie

People also ask

3 Answers

G. Grothendieck

Mekki MacAulay

datawookie

Recent Activity

Donate For Us