An Automatic Gene Selection Algorithm for Genome-Wide Association Analysis

LIU Qiao,WANG Juan,CHEN Wei,QIN Zhi-guang
2010-01-01
Abstract:Objective To study automatic gene selection algorithm adaptable for genome-wide association analysis. Methods The proposed L20 algorithm was defined as an optimization of the ridge regression problem which was based on the restraint of stochastic complexity of the gene models. A simple and effective derived solution to the optimized problem was also provided in this paper. Results Five binary diseases classification problems derived from a clinical diabetes data set and four publicly available microarray data sets were examined to verify the performance of the proposed algorithm. The proposed algorithm was not restricted by the size of the features and the samples, and its output was stable and accurate. Besides, it was computationally efficient since the model parameters could be decided through the feature selection process.Conclusion Numerical results also show that the L20 method outperforms many other conventional methods, which makes it a promising solution for tumor biomarker identification.
What problem does this paper attempt to address?