Pooling Information Across Different Studies and Oligonucleotide Chip Types to Identify Prognostic Genes for Lung Cancer

Jeffrey S. Morris,Guosheng Yin,Keith Baggerly,Chunlei Wu,Li Zhang
DOI: https://doi.org/10.1007/0-387-23077-7_5
2005-01-01
Abstract:Our goal in this work was to pool information across microarray studies conducted at different institutions using two different versions of Affymetrix chips to identify genes whose expression levels offer information on lung cancer patients' survival above and beyond the information provided by readily available clinical covariates. We combined information across chip types by identifying "matching probes" present on both chips, and then assembling them into new probesets based on Unigene clusters. This method yielded comparable expression level quantifications across chips without sacrificing much precision or significantly altering the relative ordering of the samples. We fit a series of multivariable Cox models containing clinical covariates and genes and identified 26 genes that provided information on survival after adjusting for the clinical covariates, while controlling the false discovery rate at 0.20 using the Beta-Uniform mixture method. Many of these genes appeared to be biologically interesting and worthy of future investigation. Only one gene in our list has been mentioned in previously published analyses of these data. It appears that the increased statistical power provided by the pooling was key in finding these new genes, since only nine out of the 26 genes were detected when we apply these methods to the two data sets separately, i.e., without pooling.
What problem does this paper attempt to address?