Inference for CDFs

CDF estimation/inference is one of the most fundamental tasks in statistics, as the CDF completely describes the distribution of a random variable. Not so surprisingly, the empirical CDF serves as a natural and effective estimator. Suppose we have iid data . The empirical CDF is defined as the step function:

One can verify that is a valid CDF as it’s monotonic, right continuous, and has limits and .

For any distribution with CDF , the Glivenko–Cantelli theorem states the asymptotic almost sure convergence of , Donsker’s theorem states its asymptotic convergence in distribution, and Dvoretzky–Kiefer–Wolfowitz theorem gives the non-asymptotic convergence rate.

Glivenko–Cantelli

converges uniformly to the true CDF almost surely:

Proof

We use the weak Law of Large Numbers to prove Convergence in Probability. The Strong Convergence can be proved similarly using the strong law of large numbers.

We use a grid approach. Fix for the grid granularity . Construct grid points such that . By LLN, for all . Therefore, for any , there exists such that for all , . Let . Then by the union bound,

The above bound also applies when we let and as and .

For any , let be such that . Then, by the monotonicity,

Letting gives the result.

Dvoretzky–Kiefer–Wolfowitz

converges uniformly to the true CDF with a subGaussian tail bound:

Proof

Let . One important observation is that “stretching the x-axis” does not change . Therefore, we can arbitrarily stretch along the x-axis to make it arbitrarily close to .

Formally, we notice that

is the average of iid Bernoulli r.v.s with parameter . Let

where . We can see that is also a Bernoulli r.v. with parameter . Therefore,

This gives

where the equality holds iff iff is continuous.

Note that is just the empirical CDF of the uniform distribution. Therefore, has the same distribution for any continuous . We call such a distribution-invariant statistic a pivotal statistic.

Apply some concentration analysis to the supreme of uniform empirical CDF gives the desired result.

Donsker

If further is continuous, then

where is a Brownian bridge on .

To see the emergence of the Brownian bridge, we can fix a specific , and the CLT gives us

This is because is the average of indicator r.v.s. Then, the stochastic process has an asymptotic distribution similar to a Brownian motion , except that . Therefore, the stochastic process actually converges to a Brownian bridge: .