The coefficient of determination in the ridge regression

Ainara Rodríguez Sánchez,Román Salmerón Gómez,Catalina García
DOI: https://doi.org/10.1080/03610918.2019.1649421
2019-10-08
Abstract:In a linear regression, the coefficient of determination, <i>R</i><sup>2</sup>, is a relevant measure that represents the percentage of variation in the dependent variable that is explained by a set of independent variables. Thus, it measures the predictive ability of the estimated model. For an ordinary least squares (OLS) estimator, this coefficient is calculated from the decomposition of the sum of squares. However, when the model presents collinearity problems (a strong linear relation between the independent variables), the OLS estimation is unstable, and other estimation methodologies are proposed, with the ridge estimation being the most widely applied. This paper shows that the decomposition of the sum of squares is not verified in the ridge regression and proposes how the coefficient of determination should be calculated in this case.
statistics & probability
What problem does this paper attempt to address?