A Novel Feature Ensemble Technology To Improve Prediction Performance Of Multiple Heterogeneous Phenotypes Based On Microarray Data

Hy Wang,Qp Zhang,Yd Wang,X Li,Sq Rao,Zq Ding
DOI: https://doi.org/10.1007/11540007_109
2005-01-01
Abstract:Gene expression microarray technology provides the global information on transcriptional activities of essentially all genes simultaneously, and it thus promotes the new application of traditional feature selection methods in the fields of molecular biology and life sciences. The basic strategy for the traditional feature selection methods is to seek for a single gene subset that leads to the best prediction of biological types, for example tumor versus normal tissues. Because of complexities and genetic heterogeneities of biological phenotypes (e.g. complex diseases), robust computational approaches are desirable to achieve high generalization performance with multiple classifiers and perturbations of the data structures. The purpose of this study is to develop an ensemble decision approach to analysis of multiple heterogeneous phenotypes. The results from an application to a lymphoma data of five subtypes indicate that the proposed analysis strategy is feasible and powerful to perform biological subtype.
What problem does this paper attempt to address?