How much should we trust R2 and adjusted R2: evidence from regressions in top economics journals and Monte Carlo simulations

Qiang Chen,Ji Qi
DOI: https://doi.org/10.1080/15140326.2023.2207326
2023-05-04
Journal of Applied Economics
Abstract:R 2 and adjusted R 2 may exaggerate a model's true ability to predict the dependent variable in the presence of overfitting, whereas leave-one-out R 2 (LOOR 2 ) is robust to overfitting. We demonstrate this by replicating 279 regressions from 100 papers in top economics journals, where the median increases of R 2 and adjusted R 2 over LOOR 2 reach 40.2% and 21.4% respectively. The inflation of test errors over training errors increases with the severity of overfitting as measured by the number of regressors and nonlinear terms, and the presence of outliers, but decreases with the sample size. These results are further validated by Monte Carlo simulations.
economics
What problem does this paper attempt to address?