Semiparametric Prognosis Models in Genomic Studies.

Shuangge Ma,Jian Huang,Mingyu Shi,Yang Li,Ben-Chang Shia
DOI: https://doi.org/10.1093/bib/bbp070
IF: 9.5
2010-01-01
Briefings in Bioinformatics
Abstract:Development of high-throughput technologies makes it possible to survey the whole genome. Genomic studies have been extensively conducted, searching for markers with predictive power for prognosis of complex diseases such as cancer, diabetes and obesity. Most existing statistical analyses are focused on developing marker selection techniques, while little attention is paid to the underlying prognosis models. In this article, we review three commonly used prognosis models, namely the Cox, additive risk and accelerated failure time models. We conduct simulation and show that gene identification can be unsatisfactory under model misspecification. We analyze three cancer prognosis studies under the three models, and show that the gene identification results, prediction performance of all identified genes combined, and reproducibility of each identified gene are model-dependent. We suggest that in practical data analysis, more attention should be paid to the model assumption, and multiple models may need to be considered.
What problem does this paper attempt to address?