To Select or to Weigh: A Comparative Study of Linear Combination Schemes for SuperParent-One-Dependence Estimators

Ying Yang,Geoffrey I. Webb,Jesus Cerquides,Kevin B. Korb,Janice Boughton,Kai Ming Ting
DOI: https://doi.org/10.1109/tkde.2007.190650
IF: 9.235
2007-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:We conduct a large-scale comparative study on linearly combining superparent-one-dependence estimators (SPODEs), a popular family of seminaive Bayesian classifiers. Altogether, 16 model selection and weighing schemes, 58 benchmark data sets, and various statistical tests are employed. This paper's main contributions are threefold. First, it formally presents each scheme's definition, rationale, and time complexity and hence can serve as a comprehensive reference for researchers interested in ensemble learning. Second, it offers bias-variance analysis for each scheme's classification error performance. Third, it identifies effective schemes that meet various needs in practice. This leads to accurate and fast classification algorithms which have an immediate and significant impact on real-world applications. Another important feature of our study is using a variety of statistical tests to evaluate multiple learning methods across multiple data sets.
What problem does this paper attempt to address?