I often like to fit and examine multiple models that relate two variables in an R dataframe. I can do that using syntax like this: <pre class="prettyprint"><code>require(tidyverse) require(broom) models <- list(hp ~ exp(cyl), hp ~ cyl) map_df(models, ~tidy(lm(data=mtcars, formula=.x))) </code></pre> But I'm used to the pipe syntax and was hoping to be able to something like this: <pre class="prettyprint"><code>mtcars %>% map_df(models, ~tidy(lm(data=., formula=.x))) </code></pre> That makes it clear that I'm "starting" with <code>mtcars</code> and then doing stuff to it to generate my output. But that syntax doesn't work, giving an error <code>Error: Index 1 must have length 1</code>. Is there a way to write my <code>purrr:map()</code> function in a way that I can pipe <code>mtcars</code> into it to get the same output as the working code above? I.e. <pre class="prettyprint"><code>mtcars %>% <<<something>>> </code></pre>

tl/dr: <code>mtcars %>% {map_df(models, function(.x) tidy(lm(data=., formula=.x)))}</code> Or <code>mtcars %>% map_df(models, ~tidy(lm(..1,..2)), ..2 = .)</code> <hr> There are 2 problems with the solution you've tried. The first is that you need to use curly braces if you want to place the dot in an unusual place. <pre class="prettyprint"><code>library(magrittr) 1 %>% divide_by(2) # 0.5 -> this works 1 %>% divide_by(2,.) # 2 -> this works as well 1 %>% divide_by(2,mean(.,3)) # this doesn't 1 %>% divide_by(.,2,mean(.,3)) # as it's equivalent to this one 1 %>% {divide_by(2,mean(.,3))} # but this one works as it forces all dots to be explicit. </code></pre> The second is that you can't use the dot with the <code>~</code> formulation in the way you intended, try <code>map(c(1,2), ~ 3+.)</code> and <code>map(c(1,2), ~ 3+.x)</code> (or even <code>map(c(1,2), ~ 3+..1)</code>) and you'll see you get the same result. By the time you use the dot in a <code>~</code> formula it's not linked to the pipe function anymore. To make sure the dot is interpreted as <code>mtcars</code> you need to use the good old <code>function(x) ...</code> definition. This works: <pre class="prettyprint"><code>mtcars %>% {map_df(models, function(.x) tidy(lm(data=., formula=.x)))} </code></pre> Finally, as a bonus, here's what I came up with, trying to find a solution without curly braces : <pre class="prettyprint"><code>mtcars %>% map(models,lm,.) %>% map_df(tidy) mtcars %>% map_df(models, ~tidy(lm(..1,..2)), ..2 = .) </code></pre>

working with lists of models using the pipe syntax

Tags:

dataframe

r

purrr

tidyverse

broom

I often like to fit and examine multiple models that relate two variables in an R dataframe.

I can do that using syntax like this:

require(tidyverse)
require(broom)
models <- list(hp ~ exp(cyl), hp ~ cyl)
map_df(models, ~tidy(lm(data=mtcars, formula=.x)))

But I'm used to the pipe syntax and was hoping to be able to something like this:

mtcars %>% map_df(models, ~tidy(lm(data=., formula=.x)))

That makes it clear that I'm "starting" with mtcars and then doing stuff to it to generate my output. But that syntax doesn't work, giving an error Error: Index 1 must have length 1.

Is there a way to write my purrr:map() function in a way that I can pipe mtcars into it to get the same output as the working code above? I.e.

mtcars %>% <<<something>>>

864

asked Feb 02 '18 17:02

Curt F.

1 Answers

tl/dr: mtcars %>% {map_df(models, function(.x) tidy(lm(data=., formula=.x)))}

Or mtcars %>% map_df(models, ~tidy(lm(..1,..2)), ..2 = .)

There are 2 problems with the solution you've tried.

The first is that you need to use curly braces if you want to place the dot in an unusual place.

library(magrittr)
1 %>% divide_by(2)   # 0.5     -> this works
1 %>% divide_by(2,.) # 2       -> this works as well
1 %>% divide_by(2,mean(.,3))   #  this doesn't    
1 %>% divide_by(.,2,mean(.,3)) #  as it's equivalent to this one
1 %>% {divide_by(2,mean(.,3))} #  but this one works as it forces all dots to be explicit.

The second is that you can't use the dot with the ~ formulation in the way you intended, try map(c(1,2), ~ 3+.) and map(c(1,2), ~ 3+.x) (or even map(c(1,2), ~ 3+..1)) and you'll see you get the same result. By the time you use the dot in a ~ formula it's not linked to the pipe function anymore.

To make sure the dot is interpreted as mtcars you need to use the good old function(x) ... definition.

This works:

mtcars %>% {map_df(models, function(.x) tidy(lm(data=., formula=.x)))}

Finally, as a bonus, here's what I came up with, trying to find a solution without curly braces :

mtcars %>% map(models,lm,.) %>% map_df(tidy)
mtcars %>% map_df(models, ~tidy(lm(..1,..2)), ..2 = .)

128

answered Nov 02 '22 04:11

Moody_Mudskipper

Related questions
                            
                                R How to install package 'graph'?
                            
                                Recode a string column into integer using dplyr
                            
                                R: Extreme bunching of random values from runif with Mersenne-Twister seed
                            
                                suppress line/index numbers in R output
                            
                                ggplot2 vertical lines from data points in grouped scatter plot
                            
                                add_trace in Plotly in a loop [duplicate]
                            
                                How to repeat sequence when condition is met
                            
                                How do I manually fit a viewport with a fixed aspect ratio into its parent such that no space is wasted like ggplot can do?
                            
                                Plot emojis/emoticons in R with ggplot
                            
                                Use strsplit with multiple delimiters [duplicate]
                            
                                Finding subsets within a dataframe and writing the result
                            
                                Memory leaks in a simple Rcpp function
                            
                                R Shiny - how to display choice label in selectInput
                            
                                In delayed expression evaluation, R Shiny uses changed values of variables
                            
                                Create unary operator in R
                            
                                specify position of some nodes in a graph
                            
                                R - Extract multiple rows from column 1 if certain value appears in column 2
                            
                                Using dplyr::group_by() to find min dates with NAs [duplicate]
                            
                                Remove an argument / element from ellipsis
                            
                                Italicize labels of only one legend in ggplot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With