Testing for Regression
Regression studies the effect of the explanatory variable on the response variable . After solving the regression task, we want to know if specific features of have significant effects on . We use Hypothesis Testing for this purpose.
For example, in a linear regression model:
we want to test
Other regression models and hypotheses can be considered. We can also test a group of features using Multiple Hypothesis Testing.
Gaussian Linear Model
We focus the above example with a Gaussian Linear Model, i.e., . We have two useful lemmas:1
- Let , where is feature dimension (). Then .
- .
Combining the two lemmas gives the test statistic:
where is the -th diagonal element of the matrix .
Footnotes
-
Both lemmas can be proved using the derivation in Minimax Estimator or ^ls-pred. Specifically, for the first item, and thus . For the second item is sufficient for and perpendicular to , and thus . ↩