I have a model with transformed variables, e.g.: <pre class="prettyprint"><code> data = data.frame(y = runif(100,0,10), x1 = runif(100,0,10), x2 = runif(100, 0, 10)) mod = lm(y ~ scale(x1) + scale(x2), data) </code></pre> I would like to remove one entire variable from the formula, like so: <pre class="prettyprint"><code>mod = lm(y ~ scale(x1), # x2 is gone! data) </code></pre> But I would like to do this using a user-supplied character string of the variable to be removed (in other words, I'm wrapping this in a function and its not feasible to edit the formula by hand, as I have here). If the variable was untransformed, this would be simple using <code>gsub</code>: <pre class="prettyprint"><code>remove.var = "x2" update(mod, formula. = as.formula(gsub(remove.var, "", format(formula(mod))))) </code></pre> but as such, it returns the wholly predictable error: <pre class="prettyprint"><code> > Error in as.matrix(x) : argument "x" is missing, with no default </code></pre> because <code>scale()</code> is still in the formula! Is there a way to do this with <code>regexpr</code>, or some way that I am not seeing that is totally obvious? I would like it to be scalable to other types of transformations, e.g.: <code>log</code>, <code>log10</code>,etc. As another layer of complexity, suppose that the variable to be removed also appeared in an interaction: <pre class="prettyprint"><code> mod = lm(y ~ scale(x1) * scale(x2), data) </code></pre> In this case, one would have to remove the interaction <code>*</code> as well (errant <code>+</code>s, I have found, are ok). Any help is much appreciated. Thanks!

A terms-object is a formula with additional attributes: <pre class="prettyprint"><code>update(mod, formula=drop.terms(mod$terms, 2, keep.response=TRUE) ) Call: lm(formula = y ~ scale(x1), data = data) Coefficients: (Intercept) scale(x1) 5.0121 0.1236 </code></pre> If you need to calculate that position from a string argument, then you can grep the term.labels attribute: <pre class="prettyprint"><code>> grep( "x2", attr( mod$terms, "term.labels") ) [1] 2 </code></pre> Notice that this also succeeds with the interaction formula: <pre class="prettyprint"><code>update(mod, formula=drop.terms(mod$terms, grep( "x2", attr( mod$terms, "term.labels") ), keep.response=TRUE) ) #---------- Call: lm(formula = y ~ scale(x1), data = data) Coefficients: (Intercept) scale(x1) 5.0121 0.1236 </code></pre>

Remove variable wrapped in function from model formula in R

Tags:

regex

r

I have a model with transformed variables, e.g.:

 data = data.frame(y = runif(100,0,10), x1 = runif(100,0,10), x2 =   runif(100, 0, 10))
 mod = lm(y ~ scale(x1) + scale(x2), data)

I would like to remove one entire variable from the formula, like so:

mod = lm(y ~ scale(x1), # x2 is gone! 
 data)

But I would like to do this using a user-supplied character string of the variable to be removed (in other words, I'm wrapping this in a function and its not feasible to edit the formula by hand, as I have here).

If the variable was untransformed, this would be simple using gsub:

remove.var = "x2"
update(mod, formula. = as.formula(gsub(remove.var, "", format(formula(mod)))))

but as such, it returns the wholly predictable error:

 > Error in as.matrix(x) : argument "x" is missing, with no default

because scale() is still in the formula!

Is there a way to do this with regexpr, or some way that I am not seeing that is totally obvious? I would like it to be scalable to other types of transformations, e.g.: log, log10,etc.

As another layer of complexity, suppose that the variable to be removed also appeared in an interaction:

 mod = lm(y ~ scale(x1) * scale(x2), data)

In this case, one would have to remove the interaction * as well (errant +s, I have found, are ok).

Any help is much appreciated. Thanks!

961

asked Nov 12 '14 22:11

jslefche

1 Answers

A terms-object is a formula with additional attributes:

update(mod, formula=drop.terms(mod$terms, 2, keep.response=TRUE)  )

Call:
lm(formula = y ~ scale(x1), data = data)

Coefficients:
(Intercept)    scale(x1)  
     5.0121       0.1236

If you need to calculate that position from a string argument, then you can grep the term.labels attribute:

> grep( "x2", attr( mod$terms, "term.labels") )
[1] 2

Notice that this also succeeds with the interaction formula:

update(mod, formula=drop.terms(mod$terms, grep( "x2", attr( mod$terms, "term.labels") ), keep.response=TRUE) )
#----------

Call:
lm(formula = y ~ scale(x1), data = data)

Coefficients:
(Intercept)    scale(x1)  
     5.0121       0.1236

answered Oct 12 '22 01:10

IRTFM

Related questions
                            
                                PCRE - offset vector, multiple of 3?
                            
                                Looking for a PHP regex or function to filter variations using . of an email for security
                            
                                Regular Expression Pattern \K Alternatives in C#
                            
                                Get CSS value as written in stylesheet with jQuery or RegEx
                            
                                CMake regex match directories in list
                            
                                Generate random string based on Regex?
                            
                                Python: Getting text of a Regex match
                            
                                Java regex error - Look-behind with group reference
                            
                                Optimizing a regular expression to parse chinese pinyin [closed]
                            
                                Weird regex in inherit.js (by John Resig) - why, what and how? [duplicate]
                            
                                How do you reject a string if preceded by another string using standard POSIX regex?
                            
                                Repeatable, complex regular expression, with dot '.' delimited separators
                            
                                looping through scan and replacing matches individually
                            
                                awk FPAT variable: Working
                            
                                Detect and alter strings in PDFs
                            
                                Regular expression to match only if there are N unique characters
                            
                                Exclude strings of pattern "abba"
                            
                                preg_match :print: class matches tab character
                            
                                Regex match non-greedy on one optional string and greedy on another
                            
                                split line via regex in javascript?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With