ExcluIR: Exclusionary Neural Information Retrieval

Wenhao Zhang,Mengqi Zhang,Shiguang Wu,Jiahuan Pei,Zhaochun Ren,Maarten de Rijke,Zhumin Chen,Pengjie Ren
2024-04-26
Abstract:Exclusion is an important and universal linguistic skill that humans use to express what they do not want. However, in information retrieval community, there is little research on exclusionary retrieval, where users express what they do not want in their queries. In this work, we investigate the scenario of exclusionary retrieval in document retrieval for the first time. We present ExcluIR, a set of resources for exclusionary retrieval, consisting of an evaluation benchmark and a training set for helping retrieval models to comprehend exclusionary queries. The evaluation benchmark includes 3,452 high-quality exclusionary queries, each of which has been manually annotated. The training set contains 70,293 exclusionary queries, each paired with a positive document and a negative document. We conduct detailed experiments and analyses, obtaining three main observations: (1) Existing retrieval models with different architectures struggle to effectively comprehend exclusionary queries; (2) Although integrating our training data can improve the performance of retrieval models on exclusionary retrieval, there still exists a gap compared to human performance; (3) Generative retrieval models have a natural advantage in handling exclusionary queries. To facilitate future research on exclusionary retrieval, we share the benchmark and evaluation scripts on \url{
Information Retrieval
What problem does this paper attempt to address?
This paper focuses on an underexplored area in information retrieval: exclusionary retrieval. In exclusionary retrieval, users explicitly state the information they do not want to see in their queries. For example, a user may ask, "Apart from 'Avengers: Endgame', what other science fiction movies were released in 2019?" If the retrieval system fails to understand this exclusionary requirement, it may return results that include content the user does not want to see. The main contributions of this paper are as follows: 1. Creation of a resource collection called ExcluIR, which includes an evaluation benchmark and a training set to help retrieval models understand and handle exclusionary queries. The benchmark consists of 3,452 manually annotated high-quality exclusionary queries, while the training set contains 70,293 exclusionary queries, each paired with positive and negative documents. 2. Investigation of the performance of existing retrieval methods with different architectures (e.g., sparse retrieval, dense retrieval, and generative retrieval) on exclusionary retrieval. It is found that these models struggle with understanding and handling exclusionary queries, particularly the insufficient understanding of the true intent behind exclusionary queries. 3. Discovery of the unique advantage of generative retrieval models in handling exclusionary queries, while later interaction models like ColBERT perform poorly in this aspect. 4. Fine-tuning retrieval models with ExcluIR's training data can improve their performance in exclusionary retrieval, but the results still fall short of human performance. Through experiments and analysis, the paper offers valuable insights for future research challenges in exclusionary retrieval, including how to improve models to better understand and handle exclusionary queries.