CAMVR: Context-Adaptive Multi-View Representation Learning for Dense Retrieval

zhilin liao,XIANGTING HOU,Dongfang Lou,Ningyu Zhang,Huajun Chen
DOI: https://doi.org/10.1109/IJCNN54540.2023.10192020
2023-01-01
Abstract:The recently proposed MVR (Multi-View Representation) model achieves remarkable performance in open-domain dense retrieval. In MVR, the document can match with multi-view queries by encoding the document into multiple representations. However, these representations tend to collapse into the same one when the percentage of documents answering multiple queries in training data is low. In this paper, we propose a CAMVR (Context-Adaptive Multi-View Representation) learning framework, which explicitly avoids the collapse problem by aligning each viewer token with different document snippets. In CAMVR, each viewer token is placed before each snippet to capture the local and global information with the consideration that answers of different view queries may scatter in one document. In addition, the view of the snippet containing the answer is used to explicitly supervise the learning process, from which the interpretability of view representation is provided. The extensive experiments show that CAMVR outperforms the existing models and achieves state-of-the-art results.
What problem does this paper attempt to address?