Preprocessing and Feature Preparation in Chinese Web Page Classification

Weitong Huang,Luxiong Xu,Yanmin Liu
DOI: https://doi.org/10.1109/iccet.2009.72
2009-01-01
Abstract:A detailed design and implementation of a Chinese web-page classification system is described in this paper, and some methods on Chinese web-page preprocessing and feature preparation are proposed. Experimental results on a Chinese web-page dataset show that methoss we designed can improve the performance from 75.82% to 81.88%.
What problem does this paper attempt to address?