Partial profile score feature selection in high-dimensional generalized linear interaction models

Zengchao Xu,Shan Luo,Zehua Chen
DOI: https://doi.org/10.4310/21-sii706
2022-01-01
Statistics and Its Interface
Abstract:Sequential method is promising for feature selection in high-dimensional models. In this paper, we propose a sequential approach based on partial profile score dubbed as PPSFS to feature selection for a broad class of high dimensional models, including high-dimensional generalized linear interaction models. The PPSFS approach has a prominent performance in feature selection while it keeps highly scalable for ultra-high-dimensional models. The selection consistency of the PPSFS approach is established under mild conditions. Comprehensive numerical studies demonstrating the performance of PPSFS are reported. A real data analysis for gene expression cancer RNA-Seq data is also presented.
What problem does this paper attempt to address?