Automatic Classification of Tibetan Web Pages

Guixian Xu,Chuncheng Xiang,Xu Gao,Xiaobing Zhao,Guosheng Yang
DOI: https://doi.org/10.1109/iccsee.2012.177
2012-01-01
Abstract:A classification approach for Tibetan web pages is introduced in this paper. It takes advantage of the class feature dictionary and Rocchio classification algorithm to classify the Tibetan web pages into the predefined classes rapidly and accurately. The experimental results present that the approach has better classification accuracy for Tibetan web pages classification. It is useful and helpful for the construction of the statistical and rule-based classification of Tibetan texts as well as construction of high-quality Tibetan corpus.
What problem does this paper attempt to address?