Probability Based Voting Extreme Learning Machine for Multiclass XML Documents Classification

Xiangguo Zhao,Xin Bi,Baiyou Qiao
DOI: https://doi.org/10.1007/s11280-013-0230-8
2013-01-01
World Wide Web
Abstract:This paper presents a novel solution based on Extreme Learning Machine (ELM) for multiclass XML documents classification. ELM is a generalized Single-hidden Layer Feedforward Network (SLFN) with extremely fast learning capacity. An improved vector model DSVM (Distribution based Structured Vector Model) is proposed to represent XML documents with more structural information and more precise semantic information. The XML documents classifiers are conducted based on PV-ELM (Probablity based Voting ELM) with a postprocessing method ε -RCC ( ε - Revoting of Confusing Classes) to refine the voting results. To evaluate the overall performance of this solution, a series of experiments are conducted on two real datasets of news feeds online. The experimental results show that DSVM represents the XML documents more effectively and PV-ELM with ε -RCC achieves a higher accuracy than original ELM algorithm for multiclass classification.
What problem does this paper attempt to address?