Fighting webspam: detecting spam on the graph via content and link features

Yu-Jiu Yang,Shuang-Hong Yang,Bao-Gang Hu
DOI: https://doi.org/10.1007/978-3-540-68125-0_112
2008-01-01
Abstract:We address a novel semi-supervised learning strategy for Web Spam issue. The proposed approach explores graph construction which is the key of representing data semantical relationship, and emphasizes on label propagation from multi views under consistency criterion. Furthermore, we infer labels for the rest of the unlabeled nodes in fusing spectral space. Experiments on the Webspam Challenging dataset validate the efficiency and effectiveness of the proposed method.
What problem does this paper attempt to address?