Forward Regression for Ultra-High Dimensional Variable Screening

Hansheng Wang
DOI: https://doi.org/10.1198/jasa.2008.tm08516
IF: 4.369
2009-01-01
Journal of the American Statistical Association
Abstract:Motivated by the seminal theory of Sure Independence Screening (Fan and Lv 2008, SIS), we investigate here another popular and classical variable screening method, namely, forward regression (FR). Our theoretical analysis reveals that FR can identify all relevant predictors consistently, even if the predictor dimension is substantially larger than the sample size. In particular, if the dimension of the true model is finite, FR can discover all relevant predictors within a finite number of steps. To practically select the "best" candidate from the models generated by FR, the recently proposed BIC criterion of Chen and Chen (2008) can be used. The resulting model can then serve as an excellent starting point, from where many existing variable selection methods (e.g., SCAD and Adaptive LASSO) can be applied directly. FR's outstanding finite sample performances are confirmed by extensive numerical studies.
What problem does this paper attempt to address?