Genetic Algorithm-Partial Least Squares Algorithm As the Feature Selection Method for Proteomics Data of Ovarian Cancer

PAN Yi,ZHENG Bo,XIANG Jie,WEN Zhi-ning,DIAO Yuan-bo,LI Meng-long
DOI: https://doi.org/10.3969/j.issn.0490-6756.2007.04.030
2007-01-01
Abstract:Statistics method of two-side t-test combined with a new feature selection method,genetic algorithm-partial least squares algorithm,are used in this paper for the feature extraction for SELDI-TOF MS ovarian cancer data.4 m/z values are obtained from the original 15154 m/z values and the support vector machines(SVM) classifier works well based on these 4 m/z values.Both 3-fold cross validation and leave-one-out cross validation are used for checking the pattern's stability.The result of leave-one-out cross validation is 95.26%.The results indicated that genetic algorithm-partial least squares algorithm is an efficient feature extraction method for proteomics data and potential ovarian cancer biomarkers may exist in the 4 m/z values selected in this paper.
What problem does this paper attempt to address?