Confidence Interval

A (asymptotic) level $1 - α$ confidence set $C = C (X_{1}, \dots, X_{n})$ of a parameter is such that $l i m_{n \to \infty} P (θ \in C) \geq 1 - α$ .

❗️ The confidence set is random. The confidence means that $100 (1 - α) %$ of all the set instances will cover the unknown value of $θ$ . The wrong statement is that $θ$ lies in the confidence set with a probability at least $100 (1 - α) %$ . Note that $θ$ is not a Random Variable

Set relations are useful in constructing different CIs:

If $(L, U)$ is a CI for $𝜃$ and $h$ is a monotonically increasing function, then $(h (L), h (U)$ ) is a CI for $h (𝜃)$ .

Test Statistic and Critical Values

Recall that a Statistic is a function of the observed data, e.g., mean and variance. If a test involves some parameters, a test statistic is often a function of both the sample and the parameter, such that

the distribution of $t$ is known, e.g., a t Distribution or a Chi-Square Distribution, or can be approximated, e.g., using CLT
the distribution of $t$ does not depend on the parameter

Such a test statistic is also called a pivot (quantity).

Then, we can first construct a confidence interval for the test statistic $t$ . Using the knowledge of its distribution (or quantiles), the confidence interval can be given by:

P (c_{α /2} \leq t \leq c_{1 - α /2}) = 1 - α

where $c_{q}$ is the $q$ -th quantile of the distribution of $t$ , and $c_{α /2}$ and $c_{1 - α /2}$ are called the critical values.

First Example–Gaussian Mean

Let’s first consider a test statistic with a known distribution. Suppose a statistical model $P = {N (θ, σ^{2}) : θ \in R}$ with know variance. Then we know that $\overline{X} - θ \sim N (0, σ^{2} / n)$ is is known and independent of $θ$ . Using the quantile function (inverse CDF) of the Normal Distribution,

P (Φ^{- 1} (\frac{α}{2}) \leq \frac{n ( X - θ )}{σ} \leq Φ^{- 1} (1 - \frac{α}{2})) = 1 - α,

which gives the CI

C (\overline{X}) = [\overline{X} - \frac{σ}{n} Φ^{- 1} (1 - \frac{α}{2}), \overline{X} - \frac{σ}{n} Φ^{- 1} (\frac{α}{2})] = [\overline{X} + \frac{σ}{n} Φ^{- 1} (\frac{α}{2}), \overline{X} - \frac{σ}{n} Φ^{- 1} (\frac{α}{2})]

Standard Error Based CI

Usually the test statistic $t$ is of the form $(\hat{θ} - θ) / SE$ , where $\hat{θ}$ is the estimator of $θ$ , and $SE$ is the standard error, i.e., the (estimated) standard deviation of the estimator $\hat{θ}$ . Then, if we know the quantile function of $t$ , i.e., critical values, the confidence interval is of the form:

[\hat{θ} - c_{1 - α /2} \cdot SE, \hat{θ} - c_{α /2} \cdot SE] .

📗 See The Standard Error of the Mean for an example, which is the same as First Example–Gaussian Mean.

Confidence Interval Width

The width of the confidence interval, that is, its accuracy depends on:

The sample size n: the larger the sample size the narrow the width of the CI.

The confidence level: the higher the confidence the wider the CI will be!

The standard deviation of the population or SE: the larger the SE the wider the CI will be.

The method used to construct the CI

CLT CI

The CI in First Example–Gaussian Mean is finite-sample valid since we use exact quantiles. When estimating the mean of a unknown distribution and unknown variance, we can leverage the asymptotic normality by CLT:

C^{(CLT)} (X) = [\overline{X} - z_{α /2} \frac{σ ^}{n}, \overline{X} + z_{α /2} \frac{σ ^}{n}],

where $\overset{σ}{^}^{2} : = \frac{1}{n - 1} \sum_{i = 1}^{n} (X_{i} - \overline{X})^{2}$ ; then $\overset{σ}{^} / n$ is an estimated SE.

Asymptotic Valid

We prove that CLT CI is asymptotic valid. Denote $μ = E X_{i}$ . We have

P (μ \in C^{(CLT)} (X)) = = = P (\overline{X} - z_{α /2} \frac{σ ^}{n} \leq μ \leq \overline{X} + z_{α /2} \frac{σ ^}{n}) P (z_{α /2} \leq \frac{n ( X - μ )}{σ ^} \leq - z_{α /2}) P (z_{α /2} \leq \frac{n ( X - μ )}{σ} \frac{σ}{σ ^} \leq z_{1 - α /2}) .

By CLT, $n (\overline{X} - μ) / σ \to d N (0, 1)$ ; by LLN, $\overset{σ}{^} \to p σ$ . Then, by Slutsky’s Theorem,

P (μ \in C^{(CLT)} (X)) \to P (z_{α /2} \leq Z \leq - z_{α /2}) = 1 - α, as n \to \infty,

Hoeffding CI

For bounded r.v.s $X_{i} \in [a, b]$ , Hoeffding’s Inequality gives

P (\overline{X} - μ \geq t) \leq 2 exp (- \frac{2 n t ^{2}}{( b - a ) ^{2}}) .

Letting the RHS be $α$ gives $t = \frac{( b - a )}{2 n} lo g (2/ α)$ , which further gives a $1 - α$ level CI:

C^{(Hoeff)} (\overline{X}) = \overline{X} \pm (b - a) \frac{lo g ( 2/ α )}{2 n} .

❗️ Note that Hoeffding CI is finite-sample valid, in contrast to the CLT CI, which is asymptotic valid. However, Hoeffding is very conservative and not typically used in practice.

Wald CI

When the SE of the estimator depends on the true parameter $θ$ , we can simply plug in the estimator $\hat{θ}$ into the SE to get an estimated $SE (\hat{θ})$ . This gives the Wald statistics:

W = \frac{θ ^ - θ}{SE ( θ ^ )} .

Under some conditions, $W \to d Z$ . Thus, the Wald CI is $\hat{θ} \pm z_{α /2} SE (\hat{θ})$ . The Wald CI is also called the plug-in CI.

📗 For example, the Wald CI for Binomial Distribution is

\hat{θ} \pm z_{α /2} \cdot \frac{θ ^ ( 1 - θ ^ )}{n} .

👎 Different from the CLT CI where we use the sample variance to approximate the SE, Wald CI requires the knowledge of how SE depends on the parameter.
👍 However, Wald CI is usually easier to compute, and needs only one statistic.
- 📗 For example, when estimating the mean, Wald CI only needs $\overline{X}$ , while computing the sample variance generally needs to store the entire dataset.

Wilson Score CI

Instead of constructing the CI using the Standard Error Based CI with an estimated SE, we can also use the exact SE and solve the inequality with unknown parameter on both sides. However, only in a few cases, this leads to a closed form.

For example, for Binomial Distribution, using the exact SE, we have

P z_{α /2} \leq \frac{p ^ - p}{\frac{p ( 1 - p )}{n}} \leq - z_{α /2} \approx 1 - α .

The involved inequalities are quadratic in $p$ . Solving them gives the Wilson score CI:

\frac{1}{1 + \frac{z ^{2}}{n}} (\overset{p}{^} + \frac{z ^{2}}{2 n}) \pm \frac{z}{1 + \frac{z ^{2}}{n}} \frac{p ^ ( 1 - p ^ )}{n} + \frac{z ^{2}}{4 n ^{2}} .

Sufficient Statistics

Table of Contents

Backlinks

Graph View

Confidence Interval

Table of Contents

Confidence Interval

Test Statistic and Critical Values

First Example–Gaussian Mean

Standard Error Based CI

CLT CI

Asymptotic Valid

Hoeffding CI

Wald CI

Wilson Score CI

Backlinks

Graph View