I would like to calculate Wald confidence intervals of the coefficients of a glm on a somewhat large data set, and use <code>broom</code> for a tidy output. <pre class="prettyprint"><code>mydata <- data.frame(y = rbinom(1e5,1,0.8), x1 = rnorm(1e5), x2 = rnorm(1e5)) glm.1 <- glm(y ~ x1 + x2, data = mydata, family = "binomial") </code></pre> Using <code>broom::tidy</code> takes a lot of time on large data, since it uses <code>confint.glm</code>, which calculates the confidence intervals based on the profiled log-likelihood function. <pre class="prettyprint"><code>tidy(glm.1, conf.int = TRUE) # can take literally hours </code></pre>

<code>confint</code> and <code>confint.glm</code> respectively do not take an argument for the method used to calculate the confidence intervals. If you want to use another method, you need to use a different function, e.g. <code>confint.default</code> for Wald. <code>broom::tidy</code> in turn does not have an argument for the function used (or did I miss something?), it always calls <code>confint.glm</code> for glm. To calculate confidence intervals with a different function, <code>broom</code> has <code>confint_tidy</code>, where you can specify the function you want to use: <pre class="prettyprint"><code>confint_tidy(glm.1, func = stats::confint.default) </code></pre> Put this together with the estimates: <pre class="prettyprint"><code>cbind(tidy(glm.1), confint_tidy(glm.1, func = stats::confint.default)) </code></pre>

Fast Wald confidence intervals for a glm with broom in R

Tags:

r

confidence-interval

glm

broom

I would like to calculate Wald confidence intervals of the coefficients of a glm on a somewhat large data set, and use broom for a tidy output.

mydata <- data.frame(y = rbinom(1e5,1,0.8), 
                 x1 = rnorm(1e5), 
                 x2 = rnorm(1e5))
glm.1 <- glm(y ~ x1 + x2, data = mydata, family = "binomial")

Using broom::tidy takes a lot of time on large data, since it uses confint.glm, which calculates the confidence intervals based on the profiled log-likelihood function.

tidy(glm.1, conf.int = TRUE) # can take literally hours

445

asked Oct 31 '18 20:10

bebru

1 Answers

confint and confint.glm respectively do not take an argument for the method used to calculate the confidence intervals. If you want to use another method, you need to use a different function, e.g. confint.default for Wald.

broom::tidy in turn does not have an argument for the function used (or did I miss something?), it always calls confint.glm for glm.

To calculate confidence intervals with a different function, broom has confint_tidy, where you can specify the function you want to use:

confint_tidy(glm.1, func = stats::confint.default)

Put this together with the estimates:

cbind(tidy(glm.1), confint_tidy(glm.1, func = stats::confint.default))

answered Oct 08 '22 00:10

bebru

Related questions
                            
                                Show letters as key glyphs for geom_text legend instead of default 'a'
                            
                                R: table function suprisingly slow
                            
                                How do I split a data frame among columns, say at every nth column?
                            
                                Can't figure out how to use conda environment after reticulate::use_condaenv(path)
                            
                                Implementing custom stopping metrics to optimize during training in H2O model directly from R
                            
                                How to make scatterplot points open a hyperlink using ggplotly - R
                            
                                Add column with percentage of matching words in two different columns (by row) in R
                            
                                How to output the columns with the maximum value
                            
                                Populating a "count matrix" with permutations of R data.table rows
                            
                                R: From GeoJson to DataFrame?
                            
                                How to Apply String Vector to Logical Vector
                            
                                data.table modifies parent environment / weird behavior with setDT
                            
                                R. plotly - padding or margin for graph inside Shinyapp?
                            
                                show multiple plots from ggplot on one page in r
                            
                                Fill down every other row with level above in tidyverse
                            
                                Combine rows based on ranges in a column
                            
                                Dot-and-whisker plots of filtered estimates for multiple regression models
                            
                                Conditional running count (cumulative sum) with reset in R (dplyr)
                            
                                p-value from fisher.test() does not match phyper()
                            
                                Foreach .combine Function to combine lists in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fast Wald confidence intervals for a glm with broom in R

Tags:

r

confidence-interval

glm

broom

bebru

People also ask

1 Answers

bebru

Recent Activity

Donate For Us