Rank-Based Sequential Feature Selection for High-Dimensional Accelerated Failure Time Models with Main and Interaction Effects

Ke Yu,Shan Luo
DOI: https://doi.org/10.1016/j.csda.2024.107978
IF: 2.035
2024-05-15
Computational Statistics & Data Analysis
Abstract:High-dimensional accelerated failure time (AFT) models are commonly used regression models in survival analysis. Feature selection problem in high-dimensional AFT models is addressed, considering scenarios involving solely main effects or encompassing both main and interaction effects. A rank-based sequential feature selection (RankSFS) method is proposed, the selection consistency is established and illustrated by comparing it with existing methods through extensive numerical simulations. The results show that RankSFS achieves a higher Positive Discovery Rate (PDR) and lower False Discovery Rate (FDR). Additionally, RankSFS is applied to analyze the data on Breast Cancer Relapse. With a remarkable short computational time, RankSFS successfully identifies two crucial genes.
statistics & probability,computer science, interdisciplinary applications
What problem does this paper attempt to address?