Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter

Gene H. Golub,Michael Heath,Grace Wahba
DOI: https://doi.org/10.1080/00401706.1979.10489751
1979-05-01
Technometrics
Abstract:Consider the ridge estimate (λ) for β in the model unknown, (λ) = (X T X + nλI)−1 X T y. We study the method of generalized cross-validation (GCV) for choosing a good value for λ from the data. The estimate is the minimizer of V(λ) given by where A(λ) = X(X T X + nλI)−1 X T . This estimate is a rotation-invariant version of Allen's PRESS, or ordinary cross-validation. This estimate behaves like a risk improvement estimator, but does not require an estimate of σ2, so can be used when n − p is small, or even if p ≥ 2 n in certain cases. The GCV method can also be used in subset selection and singular value truncation methods for regression, and even to choose from among mixtures of these methods.
statistics & probability
What problem does this paper attempt to address?