Gene Selection Based On Leave-One-Out Cross Validation And Least Squares Support Vector Machine

Jingqing Jiang,Yanchun Liang,Chunguo Wu
2007-01-01
Abstract:DNA microarrays could provide useful information for cancer classification. croarray is always several thothousands while the number of training samples is always less than one hundred. In such case over-fitting is a serious shortcoming of the machine learning models. It is necessary to select the most informative genes to construct the input vectors of machine learning models. A gene selection algorithm based on, leave-one-out cross validation and least squares support vector machines is presented in this paper. The proposed algorithm adopts sequential forward selection search scheme and can be used to determine adaptively the number of selected genes. Results of numerical experiments show that the proposed ooalgorithms effective and comparable performance is achieved on four well-known benchmark problems.
What problem does this paper attempt to address?