QuadrupletBERT: an Efficient Model for Embedding-Based Large-Scale Retrieval

Peiyang Liu,Sen Wang,Xi Wang,Wei Ye,Shikun Zhang
DOI: https://doi.org/10.18653/v1/2021.naacl-main.292
2021-01-01
Abstract:The embedding-based large-scale querydocument retrieval problem is a hot topic in the information retrieval (IR) field.Considering that pre-trained language models like BERT have achieved great success in a wide variety of NLP tasks, we present a Quadru-pletBERT model for effective and efficient retrieval in this paper.Unlike most existing BERT-style retrieval models, which only focus on the ranking phase in retrieval systems, our model makes considerable improvements to the retrieval phase and leverages the distances between simple negative and hard negative instances to obtaining better embeddings.Experimental results demonstrate that our QuadrupletBERT achieves state-of-the-art results in embedding-based large-scale retrieval tasks.
What problem does this paper attempt to address?