A Hybrid Ensemble Algorithm Combining AdaBoost and Genetic Algorithm for Cancer Classification with Gene Expression Data.
Huijuan Lu,Huiyun Gao,Minchao Ye,Xiuhui Wang
DOI: https://doi.org/10.1109/tcbb.2019.2952102
2019-01-01
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Abstract:The diversity of base classifiers and integration of multiple classifiers are two key issues in the field of ensemble learning. This paper puts forward a hybrid ensemble algorithm combining AdaBoost and genetic algorithm(GA) for cancer classification with gene expression data. The decision group is designed to increase the diversity of base classifier pool, and the GA is used to assign weight to each base classifier, thus to improve the classification performance by avoiding local extrema. The decision groups composed by using base classifiers, including K-nearest neighbor (KNN), Naïve Bayes (NB), and Decision Tree (C4.5). Experimental results show that the proposed algorithm is superior to those existing ensemble learning methods, such as Bagging, Random Forest (RF), Rotation Forest (RoF), AdaBoost, AdaBoost-BPNN, AdaBoost-SVM, and AdaBoost-RF, especially it has better performance on small samples and unbalanced gene expression data processing.