Variable Selection for Longitudinal Data with High-Dimensional Covariates and Dropouts

Xueying Zheng,Bo Fu,Jiajia Zhang,Guoyou Qin
DOI: https://doi.org/10.1080/00949655.2017.1404603
IF: 1.225
2017-01-01
Journal of Statistical Computation and Simulation
Abstract:A new variable selection approach utilizing penalized estimating equations is developed for high-dimensional longitudinal data with dropouts under a missing at random (MAR) mechanism. The proposed method is based on the best linear approximation of efficient scores from the full dataset and does not need to specify a separate model for the missing or imputation process. The coordinate descent algorithm is adopted to implement the proposed method and is computational feasible and stable. The oracle property is established and extensive simulation studies show that the performance of the proposed variable selection method is much better than that of penalized estimating equations dealing with complete data which do not account for the MAR mechanism. In the end, the proposed method is applied to a Lifestyle Education for Activity and Nutrition study and the interaction effect between intervention and time is identified, which is consistent with previous findings.
What problem does this paper attempt to address?