Central Limit Theorem
Let be a set of i.i.d. Random Variables with Mean and Variance . Denote . Then
where means convergence in probability.
- 💡 Also holds for multi-variate distributions: .
CLT and LLN
Central limit theorem (CLT) implies the Weak Law of Large Numbers. To see this, we can rewrite the CLT as . That is, converges in distribution to a point mass at , which is equivalent to convergence in probability to .
A non-asymptotic version of CLT, the Berry-Essèen Theorem, together with an additional bounded third moment assumption, implies the Strong Law of Large Numbers.
However, neither weak or strong LLN requires assumptions beyond finite mean. Therefore, LLN holds under weaker conditions, but offers less information about the asymptotic dynamics of the sample mean. See also ^519975.
Proof
If has MGF, we can use MGF to prove the theorem. To be more general, we use Characteristic Function. WLOG, we can assume . Let be the characteristic function of . Then we have
Therefore,
And the characteristic function of is
Then we have
Therefore,
which is the CF of standard Normal Distribution. By the inverse property and Convergence of Characteristic Functions,
Sup-Norm Approximation Error: Berry-Essèen Theorem
The Berry-Essèen theorem gives a non-asymptotic bound on the difference between the CDF of the sample mean and standard normal, capturing the convergence rate in the CLT. Suppose . Let . Then
We also have a non-uniform Berry-Essèen bound, which is tighter for larger :1
where is some constant2. Under this additional bounded third moment assumption, this bound implies the Strong Law of Large Numbers. To see this, we have
which implies
where we use Mill’s Ratio. Thus, by the Borel-Cantelli Lemma, we have and thus .