adjusted bootstrap confidence intervals (BCa) with parametric bootstrap in boot package

Q: What is a BCa confidence interval?

To compute a BCa confidence interval, you estimate z0 and a and use them to adjust the endpoints of the percentile confidence interval (CI). If the bootstrap distribution is positively skewed, the CI is adjusted to the right. If the bootstrap distribution is negatively skewed, the CI is adjusted to the left.

Q: What is parametric bootstrapping?

Parametric bootstrapping assumes that the data comes from a known distribution with unknown parameters. (For example the data may come from a Poisson, negative binomial for counts, or normal for continuous distribution.)

Q: What percentile would you use for a bootstrap 95% confidence interval?

Percentile Bootstrap Method The percentile bootstrap interval is just the interval between the 100×(α2) and 100×(1-α2) percentiles of the distribution of θ estimates obtained from resampling, where θ represents a parameter of interest and α is the level of significance (e.g., α = 0.05 for 95% CIs) (Efron, 1982).

Tags:

r

statistics-bootstrap

I am attempting to use boot.ci from R's boot package to calculate bias- and skew-corrected bootstrap confidence intervals from a parametric bootstrap. From my reading of the man pages and experimentation, I've concluded that I have to compute the jackknife estimates myself and feed them into boot.ci, but this isn't stated explicitly anywhere. I haven't been able to find other documentation, although to be fair I haven't looked at the original Davison and Hinkley book on which the code is based ...

If I naively run b1 <- boot(...,sim="parametric") and then boot.ci(b1), I get the error influence values cannot be found from a parametric bootstrap. This error occurs if and only if I specify type="all" or type="bca"; boot.ci(b1,type="bca") gives the same error. So does empinf(b1). The only way I can get things to work is to explicitly compute jackknife estimates (using empinf() with the data argument) and feed these into boot.ci.

Construct data:

Click to copy

set.seed(101)
d <- data.frame(x=1:20,y=runif(20))
m1 <- lm(y~x,data=d)

Bootstrap:

Click to copy

b1 <- boot(d$y,
           statistic=function(yb,...) {
             coef(update(m1,data=transform(d,y=yb)))
           },
           R=1000,
           ran.gen=function(d,m) {
             unlist(simulate(m))
           },
           mle=m1,
           sim="parametric")

Fine so far.

Click to copy

boot.ci(b1)
boot.ci(b1,type="bca")
empinf(b1)

all give the error described above.

This works:

Click to copy

L <- empinf(data=d$y,type="jack",
            stype="i",
            statistic=function(y,f) {
              coef(update(m1,data=d[f,]))
            })

boot.ci(b1,type="bca",L=L)

Does anyone know if this is the way I'm supposed to be doing it?

update: The original author of the boot package responded to an e-mail:

... you are correct that the issue is that you are doing a parametric bootstrap. The bca intervals implemented in boot are non-parametric intervals and this should have been stated explicitely somewhere. The formulae for parametric bca intervals are not the same and depend on derivatives of the least favourable family likelihood when there are nuisance parameters as in your case. (See pp 206-207 in Davison & Hinkley) empinf assumes that the statistic is in one of forms used for non-parametric bootstrapping (which you did in your example call to empinf) but your original call to boot (correctly) had the statistic in a different form appropriate for parametric resampling.

You can certainly do what you're doing but I am not sure of the theoretical properties of mixing parametric resampling with non-parametric interval estimation.

927

asked Sep 28 '11 19:09

Ben Bolker

1 Answers

After looking at the boot.ci page I decided to use a boot-object constructed along the lines of an example in Ch 6 of Davison and Hinkley and see whether it generated the errors you observed. I do get a warning but no errors.:

Click to copy

require(boot) 
lmcoef <- function(data, i){
      d <- data[i, ]
      d.reg <- lm(y~x, d)
      c(coef(d.reg)) }
lmboot <- boot(d, lmcoef, R=999)
m1
boot.ci(lmboot, index=2)   # I am presuming that the interest is in the x-coefficient
#----------------------------------
BOOTSTRAP CONFIDENCE INTERVAL CALCULATIONS
Based on 999 bootstrap replicates

CALL : 
boot.ci(boot.out = lmboot, index = 2)

Intervals : 
Level      Normal              Basic         
95%   (-0.0210,  0.0261 )   (-0.0236,  0.0245 )  

Level     Percentile            BCa          
95%   (-0.0171,  0.0309 )   (-0.0189,  0.0278 )  
Calculations and Intervals on Original Scale
Warning message:
In boot.ci(lmboot, index = 2) :
  bootstrap variances needed for studentized intervals

answered Nov 15 '22 17:11

IRTFM

Related questions
                            
                                How to change the first row to be the header in R?
                            
                                Creating a Unique Sequence of Dates
                            
                                How to use Outlier Tests in R Code
                            
                                tm_map has parallel::mclapply error in R 3.0.1 on Mac
                            
                                Get the right hand side variables of an R formula
                            
                                Find the indices of last occurrence of the unique elements in a vector
                            
                                Faster version of combn
                            
                                Idiom for ifelse-style recoding for multiple categories
                            
                                regex - return all before the second occurrence
                            
                                dplyr - summary table for multiple variables
                            
                                Access to MySQL with R using a pre 4.1.1 authentication protocol
                            
                                Error in : `data` must be a data frame, or other object coercible by `fortify()`
                            
                                Rtools is required to build R packages but is not currently installed
                            
                                Apply a function over all combinations of arguments
                            
                                Linked table of contents (toc) in md using rmarkdown
                            
                                adding a CSS Stylesheet in R shiny
                            
                                Why is := faster than `:=`()?
                            
                                Convert character to html in R
                            
                                Rmd/Kntir: Markdown citations in LaTeX environments
                            
                                How can you create a box around an axis tick label in ggplot2?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

adjusted bootstrap confidence intervals (BCa) with parametric bootstrap in boot package

Tags:

r

statistics-bootstrap

Ben Bolker

People also ask

1 Answers

IRTFM

Recent Activity

Donate For Us