Pass a variable to dplyr::count in a loop

Tags:

I’m trying to run dplyr::count() on an arbitrary set of variables in one dataset. If I manually run count() once for each variable, I get the expected results. But when I try to put count() in a for loop to run it automatically for each variable in a set of variables, I got an error. It seems like the problem is in how I am passing the variable to count() within the for loop. I know that count() takes its variables unquoted, and for whatever reason R cannot tell that what I am passing it is a variable.

I’ve tried a number of things to fix this, including passing the variables as data$var1, quo(var1), enquo(var1), var1, “var1”, quo(data$var1), and enquo(data$var1) as well as unquoting the iterator with !!. I also tried specifying the arguments to count() like count(x=data, var=i), but this caused count() to return the total number of rows in data as the count for each iteration. If you have any ideas about what is causing the error or how I can fix it, I would very much appreciate hearing them!

Here is a minimal reproducible example that relies on the lakers dataset included with lubridate.

Click to copy

# This code requires some of the packages in tidyverse. 
library(dplyr)
library(lubridate)


# results = empty data frame for filling with info from the count() command
results <- data.frame()

# mydata = the source data
myData <- lakers

# myCols = list of the names of columns I want to count()
myCols <- c("opponent", "game_type", "player", "period")


# Loop to count() every column in myCols automatically and store the results in 
# one giant tibble of vars (var) and counts (n)

for(i in myCols){
results <- bind_rows(results, count(x=myData, i))
}

754

asked Sep 15 '17 16:09

jozimck

2 Answers

This works:

Click to copy

myData[myCols] %>% tidyr::gather(var, value) %>% count(var, value)

# A tibble: 407 x 3
         var value     n
       <chr> <chr> <int>
 1 game_type  away 17153
 2 game_type  home 17471
 3  opponent   ATL   904
 4  opponent   BOS   886
 5  opponent   CHA   412
 6  opponent   CHI   964
 7  opponent   CLE   822
 8  opponent   DAL  1333
 9  opponent   DEN  1855
10  opponent   DET   845
# ... with 397 more rows

If you want to pass myCols in a tibbledish manner, you'll have to look up the rlang package.

answered Oct 14 '22 22:10

Frank

From :https://github.com/tidyverse/dplyr/blob/master/vignettes/programming.Rmd

If you have a character vector of variable names, and want to operate on them with a for loop, index into the special .data pronoun:

Click to copy

for (var in names(mtcars)) {
  mtcars %>% count(.data[[var]]) %>% print()
}

answered Oct 14 '22 22:10

sigia

Related questions
                            
                                How to write loops "for" loops in R using dplyr syntax
                            
                                Ridge regression with `glmnet` gives different coefficients than what I compute by "textbook definition"?
                            
                                Can I subsample different sizes per group with dplyr?
                            
                                How can I gather_ on all columns but one?
                            
                                Replace NULL in a dataframe
                            
                                how to reduce vertical spacing between facet labels when using facet_wrap?
                            
                                Recoding values with dpylr using a lookup table
                            
                                Multivariate regression splines in R
                            
                                Requiring OpenMP availability for use in an Rcpp package
                            
                                How to add axis text in this negative and positive bars differently using ggplot2?
                            
                                par(mfrow) in R for ggplot [duplicate]
                            
                                How to understand the metrics of H2OModelMetrics Object through h2o.performance
                            
                                cbind (R function) equivalent in numpy
                            
                                show str(...) as table in R Markdown
                            
                                Converting GeoJSON into a Simple Feature in R
                            
                                Use dplyr::case_when with arguments programmatically
                            
                                Remove timezone during POSIXlt Conversion in R
                            
                                R: geom_point - how to show statistics on top of figure
                            
                                Use an image as area fill in an R plot
                            
                                How do I calculate a grouped z score in R using dplyr?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pass a variable to dplyr::count in a loop

Tags:

r

dplyr

jozimck

People also ask

2 Answers

Frank

sigia

Recent Activity

Donate For Us