Adaptive and iterative gene selection based on least squares support vector regression

Jingqing Jiang,Chunguo Wu,Chuyi Song,Yanchun Liang
2006-01-01
Journal of Information and Computational Science
Abstract:Cancer classification and identification are major areas in medical research. DNA microarrays could provide useful information for cancer classification at the gene expression level. The number of genes in a microarray is always several thousands while the number of training samples always several dozens. In such case most of the machine learning models suffer from the overfitting and it is necessary to select a handful of most informative genes. An adaptive and iterative gene selection algorithm based on least squares support vector machines is proposed in this paper. The algorithm adopts sequential forward selection search scheme. The number of selected genes can be determined adaptively. The total number of genes processed by the proposed algorithm is smaller than that processed by other algorithms using support vector machines. Results of numerical experiments show that the proposed algorithm trains fast and achieves comparable performance on two well-known benchmark problems.
What problem does this paper attempt to address?