Chinese Web Retrieval Test Collections: Construction, Analysis and Application

LI Jing-jing,YAN Hong-fei
DOI: https://doi.org/10.3969/j.issn.1003-0077.2008.01.005
2008-01-01
Abstract:With the rapid development of World Wide Web,Web information retrieval(IR) has been a hot research topic,but the research has been restricted by the lack of appropriate test collections.According to the framework of existing foreign test collections,we constructed large-scale Chinese Web Test collections(CWT),and organized SEWM Chinese Web search evaluation.Based on the investigation and analysis of current research,the details in constructing each component are introduced,and effective statistical analysis and experiments are carried through.The methodology used in engineering CWT should be readily applicable to the construction of future Web corpora.
What problem does this paper attempt to address?