Tail Bounds and Concentration Inequalities

Both tail bounds and concentration inequalities describe the probability that a random variable deviates from some certain value, such as its mean. In this notebook, we will explore how tail bounds and concentration inequalities predict the behavior of random variables.

Gaussian Tail Bound

The tail bound of a Gaussian random variable is described by Mill’s ratio:

We plot the theoretical tail bound:

We now demonstrate the tail bound of a Gaussian random variable using simulation.

t-Distribution Tail Bound

Gaussian distribution is known to have a light tail bound, i.e., an exponential decay in the tail probability. t Distribution, on the other hand, is known to have a heavy tail bound. For example, , or Cauchy, distribution, has a tail bound of

We plot the theoretical and empirical Cauchy tail bound against the theoretical Gaussian tail bound.

Chebyshev Inequality

Different from the tail bound of a specific random variable, concentration inequalities provide bounds for a class of random variables.

For example, for any random variable with finite mean and variance, Chebyshev’s inequality gives

Because of the generality, it’s often too loose for certain distributions.

Hoeffding’s Inequality

For bounded iid random variables , Hoeffding’s Inequality states that

For a single standard Gaussian random variable, Hoeffding’s inequality reduces to the Chernoff bound:

which is close to the exact Gaussian tail bound, with a slight difference caused by constants and the scaling of .

We now explore different concentration inequalities for non-Gaussian distributions. We use iid uniform random variables as an example.

Takeaways

  • Tail bounds describe the probability of a random variable deviating from a specific value, such as its mean.
  • Different distributions have different tail behaviors, with Gaussian having a light tail and Cauchy having a heavy tail. Tight-tailed distributions are easier to control and predict.
  • Different from exact tail bounds, concentration inequalities provide general bounds for a class of random variables, such as Chebyshev’s inequality and Hoeffding’s inequality.
  • Compared to exact tail bounds, concentration inequalities are often too conservative and thus provide looser bounds.