Model Averaging for Accelerated Failure Time Models with Missing Censoring Indicators

Longbiao Liao,Jinghao Liu
DOI: https://doi.org/10.3390/math12050641
IF: 2.4
2024-02-22
Mathematics
Abstract:Model averaging has become a crucial statistical methodology, especially in situations where numerous models vie to elucidate a phenomenon. Over the past two decades, there has been substantial advancement in the theory of model averaging. However, a gap remains in the field regarding model averaging in the presence of missing censoring indicators. Therefore, in this paper, we present a new model-averaging method for accelerated failure time models with right censored data when censoring indicators are missing. The model-averaging weights are determined by minimizing the Mallows criterion. Under mild conditions, the calculated weights exhibit asymptotic optimality, leading to the model-averaging estimator achieving the lowest squared error asymptotically. Monte Carlo simulations demonstrate that the method proposed in this paper has lower mean squared errors compared to other model-selection and model-averaging methods. Finally, we conducted an empirical analysis using the real-world Acute Myeloid Leukemia (AML) dataset. The results of the empirical analysis demonstrate that the method proposed in this paper outperforms existing approaches in terms of predictive performance.
mathematics
What problem does this paper attempt to address?
This paper attempts to solve the model averaging problem in the accelerated failure time model in the presence of missing censoring indicators. Specifically, the paper proposes a new model averaging method for dealing with right - censored data with missing censoring indicators. The model averaging weights are determined by minimizing the Mallows criterion, and it is proved that under certain conditions, the calculated weights are asymptotically optimal, so that the model average estimator asymptotically achieves the minimum squared error. In addition, Monte Carlo simulation results show that the method proposed in this paper has a lower mean squared error than other model selection and model averaging methods. Finally, the paper conducts an empirical analysis using a real - data set of acute myeloid leukemia (AML), and the results show that this method is superior to existing methods in terms of prediction performance.