Web Spam Taxonomy Via Spare Intention Analysis

余慧佳,刘奕群,张敏,马少平,茹立云
DOI: https://doi.org/10.3969/j.issn.1003-0077.2009.02.014
2009-01-01
Abstract:Along with the rapid development of the Internet,the spam pages which produced by web spam are prevailing and seriously impacts the retrieval efficiency of the search engine and the user experience.Anti-spam has become one of the most important challenges for the search engines.State-of-the-art anti-spam techniques usually make use of Web page features,either content-based or hyper-link structure based,to construct Web spam classifiers,which can't deal with different spam techniques simultaneously.This paper proposes another kind of web spam taxonomy via spam intention analysis,so as to give some useful information for intent-based detection of spam pages.
What problem does this paper attempt to address?