Inference for CDFs
CDF estimation/inference is one of the most fundamental tasks in statistics, as the CDF completely describes the distribution of a random variable. Not so surprisingly, the empirical CDF serves as a natural and effective estimator. Suppose we have iid data . The empirical CDF is defined as the step function:
One can verify that is a valid CDF as it’s monotonic, right continuous, and has limits and .
For any distribution with CDF , the Glivenko–Cantelli theorem states the asymptotic almost sure convergence of , Donsker’s theorem states its asymptotic convergence in distribution, and Dvoretzky–Kiefer–Wolfowitz theorem gives the non-asymptotic convergence rate.
Glivenko–Cantelli
converges uniformly to the true CDF almost surely:
Proof
We use the weak Law of Large Numbers to prove Convergence in Probability. The Strong Convergence can be proved similarly using the strong law of large numbers.
We use a grid approach. Fix for the grid granularity . Construct grid points such that . By LLN, for all . Therefore, for any , there exists such that for all , . Let . Then by the union bound,
The above bound also applies when we let and as and .
For any , let be such that . Then, by the monotonicity,
Letting gives the result.
Dvoretzky–Kiefer–Wolfowitz
converges uniformly to the true CDF with a subGaussian tail bound:
Proof
Let . One important observation is that “stretching the x-axis” does not change . Therefore, we can arbitrarily stretch along the x-axis to make it arbitrarily close to .
Formally, we notice that
is the average of iid Bernoulli r.v.s with parameter . Let
where . We can see that is also a Bernoulli r.v. with parameter . Therefore,
This gives
where the equality holds iff iff is continuous.
Note that is just the empirical CDF of the uniform distribution. Therefore, has the same distribution for any continuous . We call such a distribution-invariant statistic a pivotal statistic.
Apply some concentration analysis to the supreme of uniform empirical CDF gives the desired result.
Donsker
If further is continuous, then
where is a Brownian bridge on .
To see the emergence of the Brownian bridge, we can fix a specific , and the CLT gives us
This is because is the average of indicator r.v.s. Then, the stochastic process has an asymptotic distribution similar to a Brownian motion , except that . Therefore, the stochastic process actually converges to a Brownian bridge: .