A Ga-Based Feature Selection and Ensemble Learning for High-Dimensional Datasets

Pei-Yong Xia,Xiang-Qian Ding,Bai-Ning Jiang
DOI: https://doi.org/10.1109/icmlc.2009.5212542
2009-01-01
Abstract:When dealing with high-dimensional datasets with fewer samples, feature selection and ensemble learning are two effective strategies. In this paper, we focus our attention on genetic based feature selection for ensemble learning. We use an improved GA algorithm (IGA) to reduce the dimensionality of the feature space, and then evaluate using Bagging and Ada-Boost constructed by the reduced features. Experimental results on several UCI datasets demonstrate that the improved GA-based feature selection algorithm (IGAFS) is often able to obtain a better feature subset when compared with the standard GA-based feature selection algorithm (SGAFS). Our experiments also indicate that ensemble learning using IGAFS is more accuracy than employing SGAFS and the whole feature space in general conditions.
What problem does this paper attempt to address?