R formula: wrap all variables in a transformation

Tags:

r

I have a formula with an arbitrary number of variables on the left and right-hand sides:

a + b * c ~ d + e

This formula can include various operators like + or *. I would like to wrap each variable of the formula in a transformation. For example, if my transformation is called Factor, then the formula above becomes:

Factor(a) + Factor(b) * Factor(c) ~ Factor(d) + Factor(e)

Notice that it preserved the same signs.

444

asked May 07 '21 11:05

Vincent

2 Answers

1) rrapply We can use rrapply to recursively walk the formula and surround every node that is a syntactic name with Factor(...). Alternately we could use is.word <- function(x) grepl("^\\w+$", x) to check for names that only contain word characters.

library(rrapply)
fo <- a + b * c ~ d + e

is.word <- function(x) make.names(x) == x
insert.Factor <- function(x) substitute(Factor(x), list(x = x))

rrapply(fo, is.word, insert.Factor)
## Factor(a) + Factor(b) * Factor(c) ~ Factor(d) + Factor(e)

If we can have formulas such as

fo2 <- a + b * c ~ I(d) + e

and we want I(Factor(d)) rather than Factor(I)(Factor(d)) then use this for is.word:

is.word <- function(x) make.names(x) == x && format(x) %in% all.vars(fo2)

2) gsub Convert to character string, perform the substitution and convert back. The input, fo, is defined above.

formula(gsub("(\\w+)", "Factor(\\1)", format(fo)), environment(fo))
## Factor(a) + Factor(b) * Factor(c) ~ Factor(d) + Factor(e)

3) Transform data frame If these variables will be obtained from a data frame DF then we could transform its columns and leave the formula as is.

DF[] <- lapply(DF, Factor)

answered Nov 11 '22 01:11

G. Grothendieck

Here is a way to update a formula with a recursive function:

update_formula <- function(x){
    if(length(x) == 3){
        x[[2]] <- update_formula(x[[2]])
        x[[3]] <- update_formula(x[[3]])
        return(x)
    }else{
        return(substitute(Factor(var), list(var = x)))
    }
}

f <- a + b * c ~ d + e
update_formula(f)
# Factor(a) + Factor(b) * Factor(c) ~ Factor(d) + Factor(e)

The main idea is that each binary operator corresponds to a list of length 3. For example:

> as.list(f)
[[1]]
`~`

[[2]]
a + b * c

[[3]]
d + e

> as.list(f[[2]])
[[1]]
`+`

[[2]]
a

[[3]]
b * c

> as.list(f[[3]])
[[1]]
`+`

[[2]]
d

[[3]]
e

So we update the second and third component each time we encounter a binary operator.

To apply arbitrary transformation:

update_formula2 <- function(x, trans){
    if(length(x) == 3){
        x[[2]] <- update_formula2(x[[2]], trans)
        x[[3]] <- update_formula2(x[[3]], trans)
        return(x)
    }else{
        return(substitute(fun(var), list(fun = trans, var = x)))
    }
}

f <- a + b * c ~ d + e
update_formula2(f, quote(Factor))
# Factor(a) + Factor(b) * Factor(c) ~ Factor(d) + Factor(e)
update_formula2(f, quote(log))
# log(a) + log(b) * log(c) ~ log(d) + log(e)

answered Nov 11 '22 02:11

mt1022

Related questions
                            
                                How to delete rows for leading and trailing NAs by group in R
                            
                                How to order a list by a custom function, discarding duplicates?
                            
                                Count the number of values between value and value - x by variable
                            
                                Error: The animation object does not specify a save_animation method
                            
                                Filter all rows with word next to a specified word in R
                            
                                Using table() function from base with dplyr pipe-syntax?
                            
                                tidyverse not loaded, it says "namespace ‘vctrs’ 0.2.0 is already loaded, but >= 0.2.1 is required"
                            
                                filter data in shiny app but keeping values in selectInput when updating table
                            
                                In writing an R package, using the flowCore::transform function, can I both use a variable name as text and get the actual value?
                            
                                Passing `lm` result to `stepAIC` works in script, fails inside function
                            
                                dplyr alternative for plyr::mapvalues (recode using dictionary)
                            
                                Combine multiple facet strips across columns in ggplot2 facet_wrap
                            
                                converting function calls to characters in R [duplicate]
                            
                                curly curly tidy evaluation programming with multiple inputs and custom function across columns
                            
                                Invalid value at 'start_index' (TYPE_UINT64), "1e+05" [invalid] issue while downloading data to R from BigQuery
                            
                                Print Shiny App Screen not working Error: shinyjs: extendShinyjs: `functions` argument must be provided
                            
                                Selecting a default value in an R plotly plot using a selectize box via crosstalk in R, using static html not shiny
                            
                                Setting Up Visual Studio code to work with R - "win32 can't use R"
                            
                                Pie chart and Bar chart aligned on same plot
                            
                                Warning in every model of glmmTMB 'giveCsparse'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With