A study based on distributed supervised machine learning system for text classification

Jingyi Xu,Duo Li,Shiwen Yu,Xue Bai
DOI: https://doi.org/10.1007/978-3-642-25781-0_111
2012-01-01
Abstract:Complex data confronts the centralized supervised machine learning system (CSMLS) with some embarrassments in performance, self-adaptability and scalability for text classification. In this paper, aiming to resolve these embarrassments, a novel distributed supervised machine learning system (DSMLS) model is proposed. Based on data distribution consistency, classifier performance, and evidence belief we fuse predicted information from these diverse classification agents in DSMLS. It is experimentally shown that DSMLS provides better performance than CSMLS. Compared with CSMLS, maximally DSMLS reduces 21.5% in training time and improves 8.4% in F1. © 2012 Springer-Verlag GmbH.
What problem does this paper attempt to address?