Web Page Topic Recognition Algorithm based on Ensemble Learning

Ge Dongmou,Zhang Gang,Li Qian
DOI: https://doi.org/10.3969/j.issn.1000-8519.2013.19.023
2013-01-01
Abstract:Topic automatic recognition in large amount of web pages is an important research subfield in web-based information analysis and mining.It has important both theoretical and applicable sense.This paper proposed a web page topic recognition algorithm framework based on ensemble learning.It constructed indi- vidual base learners through heterogeneous properties set in a largest separation margin meaning,and ap- plied ensemble learning to combine the results from individual learners.The proposed algorithm is evaluated on a benchmark data and the results illustrates its effectiveness.
What problem does this paper attempt to address?