Generative Retrieval as Multi-Vector Dense Retrieval

Shiguang Wu,Wenda Wei,Mengqi Zhang,Zhumin Chen,Jun Ma,Zhaochun Ren,Maarten de Rijke,Pengjie Ren
2024-03-31
Abstract:Generative retrieval generates identifiers of relevant documents in an end-to-end manner using a sequence-to-sequence architecture for a given query. The relation between generative retrieval and other retrieval methods, especially those based on matching within dense retrieval models, is not yet fully comprehended. Prior work has demonstrated that generative retrieval with atomic identifiers is equivalent to single-vector dense retrieval. Accordingly, generative retrieval exhibits behavior analogous to hierarchical search within a tree index in dense retrieval when using hierarchical semantic identifiers. However, prior work focuses solely on the retrieval stage without considering the deep interactions within the decoder of generative retrieval.
Information Retrieval,Artificial Intelligence
What problem does this paper attempt to address?
This paper explores the relationship between Generative Retrieval (GR) and Multi-Vector Dense Retrieval (MVDR). The authors point out that although these two methods have different architectures, they share the same framework for measuring the relevance between queries and documents. GR typically uses sequence-to-sequence architecture to directly generate identifiers for relevant documents, while MVDR uses multiple vectors to represent documents or queries. The study finds that GR can be considered as a special case of MVDR, where the relevance calculation involves the product of query and document vectors and the sum of attention matrices. In the paper, the authors analyze the attention layer and prediction head of GR, revealing that GR can be understood as a specific form of MVDR, with both having relevance calculation based on the inner product of query and document vectors. They also explore how GR computes document word vectors and attention matrices using different strategies. The experiments confirm these findings and demonstrate that both paradigms share similarities in term matching phenomena within align matrices. The main contributions of this paper include: 1. Providing new insights into GR from the perspective of MVDR, showing that both use the same framework for measuring query-document relevance. 2. Exploring how GR applies this framework with different document encoding and align matrix strategies. 3. Conducting experiments to validate the conclusions and showcasing the commonalities and characteristics of term matching in align matrices of both paradigms. This work contributes to understanding the working mechanism of GR and provides a theoretical foundation for developing new retrieval models.