A note on marginal correlation based screening

Run Wang,Somak Dutta,Vivekananda Roy
DOI: https://doi.org/10.1002/sam.11491
2020-12-10
Abstract:<p>Independence screening methods such as the two‐sample <span><i>t</i></span>‐test and the marginal correlation based ranking are among the most widely used techniques for variable selection in ultrahigh‐dimensional data sets. In this short note, simple examples are used to demonstrate potential problems with the independence screening methods in the presence of correlated predictors. Also, an example is considered where all important variables are independent among themselves and all but one important variables are independent with the unimportant variables. Furthermore, a real data example from a genome‐wide association study is used to illustrate inferior performance of marginal correlation screening compared to another screening method.</p>
computer science, artificial intelligence, interdisciplinary applications,statistics & probability
What problem does this paper attempt to address?