Bootstrapping
Definition
The bootstrap is a method for estimating standard errors and computing confidence intervals.
Let
Intuition
If we only use the observed sample directly, we can certainly compute the statistic itself, such as the sample mean, median, or correlation. But that gives us only one realized value of the statistic. It does not tell us how much that statistic would change if we drew a different sample from the population.
That is the real goal of the bootstrap: not to recompute the statistic for its own sake, but to approximate the sampling distribution of the statistic. Once we have an approximation to that sampling distribution, we can estimate standard errors, variances, and confidence intervals.
Since the true population distribution
The bootstrap has two steps:
- Estimate
with . - Approximate
using simulation
For
Simulation
Suppose we draw an IID sample
as
More generally, if
as
Hence, we can use the sample variance of the simulated values to approximate
Limitations
The bootstrap is powerful, but it is not automatic or universally reliable.
- It only uses the observed sample. If the sample is biased or unrepresentative, the bootstrap inherits that problem.
- With small samples, the empirical distribution
may be a poor approximation to the true distribution , so the bootstrap approximation can be inaccurate. - It cannot recover behavior that is missing from the data, such as rare events or poorly observed tail behavior.
- The ordinary bootstrap assumes the data are IID. For dependent data, such as time series or clustered observations, the usual bootstrap should not be used without modification.
- Some statistics are not well approximated by the bootstrap, especially irregular or non-smooth ones such as maxima, minima, or post-model-selection estimators.
- Bootstrap confidence intervals can have poor coverage in small samples or highly skewed settings unless more careful variants are used.
- It can be computationally expensive when the statistic is costly to recompute many times.
So the bootstrap is most useful when the sample is reasonably representative, the IID assumption is plausible, and the statistic has a complicated sampling distribution but behaves regularly.
Parametric Bootstrap
For parametric models, standard errors and confidence intervals may also be estimated using the bootstrap. There is only one change. In the nonparametric bootstrap, we sample
Sources
- Wasserman, L. (2010). All of Statistics: A concise Course in Statistical Inference. Chapter 8.