Variable selection for misclassified current status data under the proportional hazards model

Wenshan Wang,Lijun Fang,Shuwei Li,Jianguo Sun
DOI: https://doi.org/10.1080/03610918.2022.2050391
2022-03-11
Abstract:Misclassified current status data arise when the failure time of interest is observed or known only to be either smaller or larger than an observation time rather than observed exactly, and the failure status is examined by a diagnostic test with testing error. Such data commonly occur in various scientific fields, including clinical trials, demographic studies and epidemiological surveys. This paper discusses regression analysis of such data with the focus on variable selection or identifying predictable and important covariates associated with the failure time of interest. For the problem, a penalized maximum likelihood approach is proposed under the Cox proportional hazards model and the smoothly clipped absolute deviation penalty. More specifically, we develop a penalized EM algorithm to relieve the computational burden in maximizing the resulting, complex penalized likelihood function. A simulation study is conducted to examine the empirical performance of the proposed approach in finite samples, and an illustration to a set of real data on chlamydia is also provided.
What problem does this paper attempt to address?