A Parallelizable Bayesian Ensemble Online Learning Algorithm

Ping GU,Qing-Sheng ZHU
DOI: https://doi.org/10.3969/j.issn.1002-137X.2006.04.045
2006-01-01
Computer Science
Abstract:In situation where data is being generated continuously,storing large amounts of data is impractical for Boosting and Bagging algorithms.In this paper,we present a parallelizable Bayesian ensemble online learning algorithm (BEPOL),which follow the method of reweighting in determining the training sets,but only require one pass through the entire sets.In order to improve both the correlation and time efficiency,parallel training based on negative correla- tion is used in BEPOL for training each individuals in the ensemble.With experiments conducted on several UCI data- sets,BEPOL are proven to perform comparably to the Adaboost in terms of classification performance and have the ad- vantage of lower running time for large datasets.This make BEPOL particularly suitable for some applications where large or continuous datasets are available.
What problem does this paper attempt to address?