High-Dimensional Probability

High-dimensional probability theory studies a high-dim random vector $X \in R^{n}$ and its transformation (high-dim functions). Without any additional structure, there is nothing special about this random vector. In high-dim probability, we always assume $X$ has independent (or weakly dependent) coordinates, i.e., $X = (X_{1}, \dots, X_{n})$ where $X_{i}$ are independent random variables. Suppose $X_{i}$ are iid, the very first result we have is regarding its variance:

Var (X) = Cov (X_{1}, \dots, X_{n}) = n Var (X_{1}) .

Such a factorization into the dimension $n$ and one-dim property is called tensorization, a key technique in high-dimensional probability.

For the transformation of such high-dim random vectors, high-dimensional probability theory studies high-dimensional functions of the form

f (X_{1}, \dots, X_{n}) .

Provided that $f$ is “smooth” enough, we expect Concentration of Measure, i.e., $f (X) \approx E [f (X)]$ . Provided that $f$ is not too “complex”, we can express it as a Suprema of Stochastic Processes, and bound its expectation by its “complexity”.

Applications

Statistical Learning

Compressed sensing

random matrices

covariance matrix

random graphs

Sampling

Optimal transport

Gaussian approximation

Concentration of Measure

Consider the simplest form: $f (x_{1}, \dots, x_{n}) = \frac{1}{n} \sum_{i = 1}^{n} x_{i}$ . If ${X_{i}}_{i = 1}^{\infty}$ is IID and has a finite mean, by the strong Central Limit Theorem, we know $f (X) \to a . s . E [X_{1}]$ . We ask

Qn

How about general functions?

How fast is this convergence? (non-asymptotic)

Informal Principle

If $X = {X_{i}}_{i = 1}^{n}$ is IID, then $f (X) \approx E f (X)$ provided that $f$ is “smooth” enough, i.e., $f$ does not depend too heavily on any of its coordinates.

Suprema of Stochastic Processes

Qn

How large is $E f (X)$ ?

Ex

L2 error: $∥ X - \hat{X} ∥_{2} = sup_{v \in S^{d - 1}} ⟨ v, (X - \hat{X}) v ⟩$ .

Convex conjugate: $f^{*} (x) = sup_{y \in R^{n}} {⟨ y, x ⟩ - f (y)}$

The prevalence of suprema is related to the variational principle, which transform the original problem into an Optimization problem, with the original solution corresponding to a supremum.

Informal Principle

If $t \mapsto Y_{t}$ is “smooth”, then $E [sup_{t \in T} Y_{t}]$ can be controlled by the complexity of $T$ .

Sufficient Statistics

Table of Contents

Backlinks

Graph View

High Dimensional Probability

Table of Contents

High-Dimensional Probability

Concentration of Measure

Suprema of Stochastic Processes

Backlinks

Graph View