Construction of Structural Diversity of Ensemble Learning Based on Classification Coding

Yang Suting,Zhang Ning
DOI: https://doi.org/10.1109/itaic49862.2020.9338807
2020-01-01
Abstract:The diversity of base learners is an important factor affecting the generalization accuracy of ensemble learning, and structural diversity is one of the main methods to construct diversity. The classification coding diversity (CCD) method is proposed to solve the problems of inaccurate measurement of traditional structural diversity methods and incomplete use of heterogeneous base learners. The method constructs three valued classification coding according to the performance of the basic learning device in the data block partition, then the absolute difference of those is used to measure the pairwise diversity, and the global diversity measure is used to construct the most optimal solution. The greedy algorithm is used to complete the selective integration of the basic learning device. Compared with ordinary Bagging algorithm and tree structure matching (TMD), the method of classification coding diversity(CCD) has better applicability, explanatory and computational performance than the tree structure matching (TMD) algorithm, and has higher generalization accuracy than the Bagging algorithm, especially for the two-classification problem.
What problem does this paper attempt to address?