Model-X Knockoffs for high-dimensional controlled variable selection under the proportional hazards model with heterogeneity parameter

Ran Hu,Di Xia,Haoyu Wang,Caixu Xu,Yingli Pan
DOI: https://doi.org/10.1007/s00184-024-00966-0
IF: 0.96
2024-05-09
Metrika
Abstract:A major challenge arising from data integration pertains to data heterogeneity in terms of study population, study design, or study coordination. Ignoring such heterogeneity in data analysis can lead to the biased estimation. In this paper, regression analysis of the proportional hazards model with heterogeneity parameter is studied. We combine the Model-X Knockoffs procedure with fused LASSO approach to control the false discovery rate in the variable selection and learn the integrative data analysis of partially heterogeneous subgroups when the outcome of interest is time to event. A regularized working partial likelihood function is established and a trick of reparameterization is developed for the numerical calculation of the proposed estimator. Simulation studies are conducted to assess the finite-sample performance of the proposed method. A data example from a clinical trial in primary biliary cirrhosis study is analyzed to demonstrate the application of our proposed method.
statistics & probability
What problem does this paper attempt to address?