Efficient Learning Ensemble SuperParent-one-dependence Estimator by Maximizing Conditional Log Likelihood

Xiaolin Zheng,Zhen Lin,Huan Xu,Chaochao Chen,Ting Ye
DOI: https://doi.org/10.1016/j.eswa.2015.05.051
IF: 8.5
2015-01-01
Expert Systems with Applications
Abstract:The ensemble of SuperParent one-dependence estimators (SPODEs) is one of the most effective improved algorithms. It achieves high classification accuracy while decreasing variance. However, most existing approaches only focus on performance improvement of individual SPODEs in selection and weighting procedures but overlook the importance of the entire ensemble model.Based on the assumption that the performance of the entire ensemble classifier can obtain better weight distribution than using the greedy strategy inside each SPODE, we propose an ensemble SPODE algorithm by maximizing conditional log likelihood (EODE-CLL). First, we choose the maximum conditional probability as the global optimization goal, which can avoid over-fitting problem compared with the least squares error. Second, the algorithm assigns hierarchical weights for SPODEs and the attributes inside SPODE. The second weight layer can help fully optimize local SPODE model. Finally, stochastic gradient descent method is used to search best parameters. It has good scalability, which has spawned batch and distributed version. Compared to the existing ensemble SPODEs, our proposed model achieves more accurate and robust classification results, while shows better time complexity.We conduct experiments on a public benchmark containing 36 datasets. The results of the experiments show that our EODE-CLL significantly outperforms state-of-the-art ensemble SPODE methods in terms of accuracy, F-measure, bias, and variance. (C) 2015 Elsevier Ltd. All rights reserved.
What problem does this paper attempt to address?