Abstract:Passage retrieval is a fundamental task in many information systems, such as web search and question answering, where both efficiency and effectiveness are critical concerns. In recent years, neural retrievers based on pre-trained language models (PLM), such as dual-encoders, have achieved huge success. Yet, studies have found that the performance of dual-encoders are often limited due to the neglecting of the interaction information between queries and candidate passages. Therefore, various interaction paradigms have been proposed to improve the performance of vanilla dual-encoders. Particularly, recent state-of-the-art methods often introduce late-interaction during the model inference process. However, such late-interaction based methods usually bring extensive computation and storage cost on large corpus. Despite their effectiveness, the concern of efficiency and space footprint is still an important factor that limits the application of interaction-based neural retrieval models. To tackle this issue, we incorporate implicit interaction into dual-encoders, and propose I^3 retriever. In particular, our implicit interaction paradigm leverages generated pseudo-queries to simulate query-passage interaction, which jointly optimizes with query and passage encoders in an end-to-end manner. It can be fully pre-computed and cached, and its inference process only involves simple dot product operation of the query vector and passage vector, which makes it as efficient as the vanilla dual encoders. We conduct comprehensive experiments on MSMARCO and TREC2019 Deep Learning Datasets, demonstrating the I^3 retriever's superiority in terms of both effectiveness and efficiency. Moreover, the proposed implicit interaction is compatible with special pre-training and knowledge distillation for passage retrieval, which brings a new state-of-the-art performance.

A Neural Passage Model for Ad-hoc Document Retrieval.

DAPR: A Benchmark on Document-Aware Passage Retrieval

Reranking Passages with Coarse-to-Fine Neural Retriever Enhanced by List-Context Information

MRNN: A Multi-Resolution Neural Network with Duplex Attention for Document Retrieval in the Context of Question Answering

I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval

A Multi-level Distillation based Dense Passage Retrieval Model

Modeling Diverse Relevance Patterns in Ad-hoc Retrieval

On Single and Multiple Representations in Dense Passage Retrieval

PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval.

A Neural Corpus Indexer for Document Retrieval

Neural Passage Quality Estimation for Static Pruning

Neural document expansion for ad-hoc information retrieval

Leveraging Semantic and Lexical Matching to Improve the Recall of Document Retrieval Systems: A Hybrid Approach

Investigating Passage-level Relevance and Its Role in Document-level Relevance Judgment

Dense Hierarchical Retrieval for Open-Domain Question Answering

Improving Language Estimation with the Paragraph Vector Model for Ad-Hoc Retrieval

A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections

Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting

A Method of Passage-Based Document Retrieval in Question Answering System

PARM: A Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval

Hybrid and Collaborative Passage Reranking.