A Cluster-Based Intelligence Ensemble Learning Method for Classification Problems.

Shaoze Cui,Yanzhang Wang,Yunqiang Yin,T. C. E. Cheng,Dujuan Wang,Mingyu Zhai
DOI: https://doi.org/10.1016/j.ins.2021.01.061
IF: 8.1
2021-01-01
Information Sciences
Abstract:Classification is a vital task in machine learning. By learning patterns of samples of known categories, the model can develop the ability to distinguish the categories of samples of unknown categories. Noticing the advantages of the clustering method in cluster structure analysis, we combine the clustering and classification methods to develop the novel cluster-based intelligence ensemble learning (CIEL) method. We use the clustering method to analyze the inherent distribution of the data and divide all the samples into clusters according to the characteristics of the dataset. Then, for each specific cluster, we use differ-ent classification algorithms to establish the corresponding classification model. Finally, we integrate the prediction results of each base classifier to form the final prediction result. In view of the problem of parameter sensitivity, we use a swarm intelligence algorithm to optimize the key parameters involved in the clustering, classification, and ensemble stages in order to boost the classification performance. To assess the effectiveness of CIEL, we per -form tenfold cross-validation experiments on the 24 benchmark datasets provided by UCI and KEEL. Designed to improve the performance of the classifiers, CIEL outperforms other popular machine learning methods such as naive Bayes, k-nearest neighbors, random for -est, and support vector machine. (c) 2021 Elsevier Inc. All rights reserved.
What problem does this paper attempt to address?