Abstract:Dense retrieval models represent queries and documents with one or multiple fixed-width vectors and retrieve relevant documents via nearest neighbor search. Recently these models have shown improvement in retrieval performance and have drawn increasing attention from the IR community. Among a variety of dense retrieval models, the models that employ multiple vectors to represent texts achieve the state-of-the-art ranking performance. However, the multi-vector representation schema imposes tremendous storage overhead compared with single-vector representation, which may hinder its application in practical scenarios. We therefore intend to apply vector compression methods such as Product Quantization (PQ) to reduce the storage cost and improve retrieval efficiency. However, the gap between the original embeddings and the quantized vectors may degenerate retrieval performance. Recently, improved dense retrieval models such as JPQ have been proposed to reduce storage space while maintaining ranking effectiveness by jointly training the encoder and PQ index. They have achieved promising improvement in the single-vector dense retrieval scenario. We therefore try to introduce this joint optimization framework to tackle the storage overhead of the multi-vector models. The key idea is to Jointly optimize Multi-vector representations with Product Quantization (JMPQ). JMPQ prevents effectiveness degeneration by leveraging a joint optimization framework for the query encoding and index compressing processes. We evaluate the performance of JMPQ on publicly available ad-hoc retrieval benchmarks. Extensive experimental results show that JMPQ substantially reduces the memory footprint while achieving ranking effectiveness on par with or even better than its uncompressed counterpart.

Understanding the Multi-vector Dense Retrieval Models

CAMVR: Context-Adaptive Multi-View Representation Learning for Dense Retrieval

ESPN: Memory-Efficient Multi-Vector Information Retrieval

Efficient Multi-Vector Dense Retrieval Using Bit Vectors

3D Model Retrieval with Multi-Granular Semantics Based on Gaussian Process Classifier

Dense Sparse Retrieval: Using Sparse Language Models for Inference Efficient Dense Retrieval

Feature Representation for 3D Object Retrieval Based on Unconstrained Multi-View

On Single and Multiple Representations in Dense Passage Retrieval

Sparse, Dense, and Attentional Representations for Text Retrieval

Interpreting Dense Retrieval as Mixture of Topics

A Multi-level Distillation based Dense Passage Retrieval Model

Learning To Retrieve: How to Train a Dense Retrieval Model Effectively and Efficiently

Generative Retrieval as Multi-Vector Dense Retrieval

Joint Optimization of Multi-vector Representation with Product Quantization

Model-enhanced Vector Index

An Object-Level Feature Representation Model for the Multi-target Retrieval of Remote Sensing Images.

Learning Diverse Document Representations with Deep Query Interactions for Dense Retrieval

Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

A New Document Retrieval Model Using Dempster-Shafer Theory of Evidence

Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval

MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings