Web Mining Based On Vips In Intention-Based Information Retrieval

Qiang Zhang,Xiaoxiao Jiang,Jiashen Sun
DOI: https://doi.org/10.1109/NLPKE.2009.5313791
2009-01-01
Abstract:This paper introduces a VIPS (Vision-based Page Segmentation) based web mining method which aims to user intents based retrieval. It firstly grasps information from web by making use of large search engines such as Baidu and so on, and then clusters the web pages basing on the intention-related features of web text. The main algorithm is described in detail and experiments are designed to grasp the query in Chinese from Baidu and Ask search engines. The results prove that the VIPS based method can achieve significant improvement comparing with some previous work.
What problem does this paper attempt to address?