Stage II Colon Cancer Prognosis Prediction by Tumor Gene Expression Profiling

Alain Barrier,Pierre-Yves Boelle,François Roser,Jennifer Gregg,Chantal Tse,Didier Brault,François Lacaine,Sidney Houry,Michel Huguier,Brigitte Franc,Antoine Flahault,Antoinette Lemoine,Sandrine Dudoit
DOI: https://doi.org/10.1200/jco.2005.05.0229
IF: 45.3
2006-10-10
Journal of Clinical Oncology
Abstract:Purpose This study mainly aimed to identify and assess the performance of a microarray-based prognosis predictor (PP) for stage II colon cancer. A previously suggested 23-gene prognosis signature (PS) was also evaluated. Patients and Methods Tumor mRNA samples from 50 patients were profiled using oligonucleotide microarrays. PPs were built and assessed by random divisions of patients into training and validation sets (TSs and VSs, respectively). For each TS/VS split, a 30-gene PP, identified on the TS by selecting the 30 most differentially expressed genes and applying diagonal linear discriminant analysis, was used to predict the prognoses of VS patients. Two schemes were considered: single-split validation, based on a single random split of patients into two groups of equal size (group 1 and group 2), and Monte Carlo cross validation (MCCV), whereby patients were repeatedly and randomly divided into TS and VS of various sizes. Results The 30-gene PP, identified from group 1 patients, yielded an 80% prognosis prediction accuracy on group 2 patients. MCCV yielded the following average prognosis prediction performance measures: 76.3% accuracy, 85.1% sensitivity, and 67.5% specificity. Improvements in prognosis prediction were observed with increasing TS size. The 30-gene PS were found to be highly-variable across TS/VS splits. Assessed on the same random splits of patients, the previously suggested 23-gene PS yielded a 67.7% mean prognosis prediction accuracy. Conclusion Microarray gene expression profiling is able to predict the prognosis of stage II colon cancer patients. The present study also illustrates the usefulness of resampling techniques for honest performance assessment of microarray-based PPs.
oncology
What problem does this paper attempt to address?