Exploring and Exploiting Proximity Statistic for Information Retrieval Model.

Yadong Zhu,Yuanhai Xue,Jiafeng Guo,Yanyan Lan,Xueqi Cheng,Xiaoming Yu
DOI: https://doi.org/10.1007/978-3-642-35341-3_1
2012-01-01
Abstract:Proximity among query terms has been recognized to be useful for boosting retrieval performance. However, how to model proximity effectively and efficiently remains a challenging research problem. In this paper, we propose a novel proximity statistic, namely Phrase Frequency, to model term proximity systematically. Then we propose a new proximity-enhanced retrieval model named BM25PF that combines the phrase frequency information with the basic BM25 model to rank the documents. Extensive experiments on four standard TREC collections illustrate the effectiveness of the BM25PF model, and also shows the significant influence of the phrase frequency on retrieval performance. © Springer-Verlag 2012.
What problem does this paper attempt to address?