Study on an Ensemble Classification Algorithm for Data Streams with Cloud Computing

钱琳,秦亮曦
DOI: https://doi.org/10.19304/j.cnki.issn1000-7180.2012.02.022
2012-01-01
Abstract:According to comprehensive analysis on data streams classification algorithms and the basic theory of cloud computing,it is proposed an ensemble classification algorithm for data streams running on Hadoop framework,and it takes MapReduce parallel programming model to improve traditional dynamic weight-based ensemble,finally speed up classification efficiency.Results show that the algorithm for high speed massive data stream has much better running efficiency than traditional ensemble algorithm.
What problem does this paper attempt to address?