The Empirical Distribution Function
Definition (Empirical Distribution Function)
The empirical distribution function is the CDF that puts mass at each data point . Formally,
where is the indicator function.
Theorem
For any fixed value of ,
The DKW Inequality
The following inequality bounds the probability that the random function differs from by more than given constant anywhere on the real line.
Theorem (The Dvoretzky-Kiefer-Wolfowitz (DKW) Inequality)
Let . Then, for any ,
Building CDF Bands
From the DKW inequality, we can construct a confidence bound as follows:
Definition (A Nonparametric Confidence Band for )
Define,
It follows that for any ,