An Approach to Improve Text Classification Efficiency.

Shuigeng Zhou,Jihong Guan
DOI: https://doi.org/10.1007/3-540-45710-0_6
2002-01-01
Abstract:Text classification is becoming more and more important with the rapid growth of on-line information available. In this paper, we propose an approach to speedup the process of text classification based on pruning the training corpus. Effective algorithm for text corpus pruning is designed. Experiments over real-world text corpus are carried out, which validates the effectiveness and efficiency of the proposed approach. Our approach is especially suitable for applications of on-line text classification.
What problem does this paper attempt to address?