RESEARCH ON APPLICATION OF SVM IN WEBSITE CLASSIFICATION

Xia Ye,Qian Songrong
DOI: https://doi.org/10.3969/j.issn.1000-386x.2012.11.057
2012-01-01
Abstract:In this article we propose a generation algorithm of website vector model based on studying the application of SVM in classification of top-level classes of Chinese websites.Using homepage codes of the website as the base,the algorithm forms the vector model by the procedures of code cleaning,content extraction and word segmentation and feeds it as SVM input,and uses multi-class SVM classifiers to make classification.At last we implement the algorithm,and through actual website data test and analysing the experimental result,it proves that the algorithm can achieve fairly high classification accuracy rate and indicates the feasibility of such website classification system in circumstances of both low flux and low delay.Besides,it provides an experimental basis for further application of website classification in browsers and client software.
What problem does this paper attempt to address?