Exponential Family

A family of univariate PDF/PMF is said to be exponential if it can be expressed as:

or equivalently,

where all values are scalars, , and is the normalizing constant.

We can extend this to cover multivariate random variables and multi-dimensional parameters. Specifically, consider real vectors , and where . A joint PDF/PMF is said to be exponential if

where the inner product can be matrix inner product. Common distributions have , and such distribution is said to be in a ==-parameter exponential family==. When , we say it is in a curved exponential family. An exponential family has the following components:

  • is the normalizing constant;
  • is the base measure;
  • is the natural parameter; it can be thought of as a reparameterization of , and thus we require the dimension of to be no less than that of ;
  • is a ==Sufficient Statistic== w.r.t. the natural parameter space:
  • is the exponential tilt that up(down)-weights the base measure.

If (perhaps after some reparameterization ), we say the exponential family is in canonical form. Further, if the sufficient statistic is the r.v. itself, i.e., , we say the exponential family is in natural form. In between, we have the dispersion form:

where is called the dispersion parameter. We can see that if is known, then is the only canonical parameter, but the sufficient statistic is the dispersed : . If is unknown, the model may corresponds to a multi-parameter exponential family.

  • 📗 For example, a Normal Distribution with known variance has a dispersion form with , , , and .

Examples

To verify if some family of distributions is of exponential type, we must be able to identify the functions (or ), , and .

Distribution \ ComponentPDF/PMF
Univariate Normal Distribution
Multivariate Normal Distribution
Exponential Distribution
Bernoulli Distribution
Binomial Distribution with known
Poisson Distribution
Chi-Square Distribution
Gamma Distribution
Beta Distribution
Categorical Distribution with known with

MLE for Exponential Family

Due to the exponential family’s special form of the likelihood, the MLE estimator of coincides with the moment estimator w.r.t. the exponent statistic . WLOG, suppose . We first note that the inverse normalizing constant is infinitely differentiable:

where and . Therefore, the derivative of the log-likelihood is

Then, the MLE estimator of , which is the zero of the derivative, satisfies , indicating that MLE is a General MM Estimator w.r.t . As a result, the asymptotic normality property of both MLE (M-estimator) and MM (Z-estimator) applies.

Moments of Dispersion Exponential Family

The above calculation also gives a convenient way to compute the first (mean) and second (variance) central moments of when follows a dispersion exponential family. Specifically, notice that and . Therefore, we have

Under For Maximum Likelihood Estimation, we have , and thus

Similarly, under the same conditions, we have

which gives

  • 📗 For example, for a Poisson Distribution, , , and . Thus, .