Tail Bounds and Concentration Inequalities
Both tail bounds and concentration inequalities describe the probability that a random variable deviates from some certain value, such as its mean. In this notebook, we will explore how tail bounds and concentration inequalities predict the behavior of random variables.
Gaussian Tail Bound
The tail bound of a Gaussian random variable is described by Mill’s ratio:
We plot the theoretical tail bound:
We now demonstrate the tail bound of a Gaussian random variable using simulation.
t-Distribution Tail Bound
Gaussian distribution is known to have a light tail bound, i.e., an exponential decay in the tail probability. t Distribution, on the other hand, is known to have a heavy tail bound. For example, , or Cauchy, distribution, has a tail bound of
We plot the theoretical and empirical Cauchy tail bound against the theoretical Gaussian tail bound.
Chebyshev Inequality
Different from the tail bound of a specific random variable, concentration inequalities provide bounds for a class of random variables.
For example, for any random variable with finite mean and variance, Chebyshev’s inequality gives
Because of the generality, it’s often too loose for certain distributions.
Hoeffding’s Inequality
For bounded iid random variables , Hoeffding’s Inequality states that
For a single standard Gaussian random variable, Hoeffding’s inequality reduces to the Chernoff bound:
which is close to the exact Gaussian tail bound, with a slight difference caused by constants and the scaling of .
We now explore different concentration inequalities for non-Gaussian distributions. We use iid uniform random variables as an example.
Takeaways
- Tail bounds describe the probability of a random variable deviating from a specific value, such as its mean.
- Different distributions have different tail behaviors, with Gaussian having a light tail and Cauchy having a heavy tail. Tight-tailed distributions are easier to control and predict.
- Different from exact tail bounds, concentration inequalities provide general bounds for a class of random variables, such as Chebyshev’s inequality and Hoeffding’s inequality.
- Compared to exact tail bounds, concentration inequalities are often too conservative and thus provide looser bounds.