Order of execution of nested functions in dplyr pipe

Tags:

When I use nested function in a piping step, the order of execution seems unintuitive.

df <- data.frame(a = c(1,NA,2), b = c(NA, NA, 1))
df %>% is.na %>% colSums # Produce correct count of missing values
df %>% colSums(is.na(.)) # Produce NA

Can anyone explain why the nested function in the third line does not produce the correct result?

230

asked Jan 15 '16 19:01

Heisenberg

1 Answers

It's because the . always gets passed as the first argument to the following function. So in your second attempt at colSums, you assume that you're passing is.na(.) as the first argument to colSums, but you're actually passing it as the second, which is the na.rm parameter. So what your code actually looks like is df %>% colSums(x = ., na.rm = is.na(.)). You can prevent the . being passed as the first parameter by using {} around the function. df %>% {colSums(is.na(.))}

answered Oct 25 '22 15:10

tblznbits

Related questions
                            
                                interpretation of the output of R function bs() (B-spline basis matrix)
                            
                                R plotly - Plotting grouped lines
                            
                                How to set default parameters for a graphical device?
                            
                                what is the best practice of handling time in R?
                            
                                how to calculate all pairwise distances in two dimensions
                            
                                Change histogram bar colours greater than a certain value
                            
                                Fast bounding of data in R
                            
                                Dissolve holes in polygon in R
                            
                                split dataframe in R by row
                            
                                How to optimaly upgrade RStudio and R and keep project files and all setup (Windows machine)?
                            
                                Can I use rpy2 to save a pandas dataframe to an .Rdata file?
                            
                                R style Negative Indexing in Python. Take NOT IN Slices
                            
                                Creating summaries at the top of a knitr report that use variables that are defined later
                            
                                R parallel computing and zombie processes
                            
                                R dplyr filter not masking base filter? [duplicate]
                            
                                Insert character at end of string in R, except for the last element
                            
                                Correlation matrix of grouped variables in dplyr
                            
                                How to save a grid plot in R?
                            
                                Plotting minor breaks on a log scale with ggplot
                            
                                R - How to find points within specific Contour

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Order of execution of nested functions in dplyr pipe

Tags:

r

dplyr

magrittr

Heisenberg

People also ask

1 Answers

tblznbits

Recent Activity

Donate For Us