The Wald Test
Let
Hypothesis testing says that a test is built by choosing a test statistic
Definition (The Wald Test)
Consider testing
Assume that
is asymptotically Normal: The size
Wald Test is: reject when where
Theorem
Asymptotically, the Wald test has a size
, that is, as
.
Intuition
The Wald statistic
measures how many estimated standard errors the observed estimate
If
That means values of
Intuition
Pretend the null value
is the truth. Then ask: would an estimate this far from , relative to its usual noise level, be surprising? If yes, reject
. If no, keep .
This is exactly the general hypothesis testing template with
So the rejection region is
Why Standardize?
The raw difference
- If
is small, even a modest difference from is strong evidence against . - If
is large, the same difference may just be sampling noise.
Dividing by
Connection to Confidence Intervals
The Wald test is the testing version of a normal-based confidence interval. The usual approximate
The null hypothesis
is equivalent to saying that
This is often the cleanest way to remember the Wald test:
Note
A Wald test rejects a null value precisely when that null value is not plausible according to the corresponding normal-based confidence interval.
Example
Suppose
Let
we estimate the standard error by
The Wald statistic is then
If
Informally: if the observed sample proportion is several estimated standard errors away from the hypothesized proportion
Caveat
The Wald test is simple and widely used, but it can be inaccurate in small samples, for skewed estimators, or when the parameter is near the boundary of its parameter space. In those settings, score tests, likelihood-ratio tests, or bootstrap-based methods can behave better.
Sources
- Wasserman, L. (2010). All of Statistics: A concise Course in Statistical Inference. Chapter 10.1.