A Deep Top-K Relevance Matching Model for Ad-hoc Retrieval

Zhou Yang,Qingfeng Lan,Jiafeng Guo,Yixing Fan,Xiaofei Zhu,Yanyan Lan,Yue Wang,Xueqi Cheng
DOI: https://doi.org/10.1007/978-3-030-01012-6_2
2018-01-01
Abstract:In this paper, we propose a novel model named DTMM, which is specifically designed for ad-hoc retrieval. Given a query and a document, DTMM firstly builds an word-level interaction matrix based on word embeddings from query and document. At the same time, we also compress the embeddings of both document word and query word into a small dimension, to learn the importance of each word. Specifically, the compressed query word embedding is projected into the term gating network, and the compressed document word embedding is concatenated into the interaction matrix. Then, we apply the top-k pooling layer (i. e., ordered k-max pooling) on the interaction matrix, and get the essential top relevance signals. The top relevance signals is associated with each query term, and projected into a multi-layer perceptron neural network to obtain the query term level matching score. Finally, the query term level matching scores are aggregated with the term gating network to produce the final relevance score. We have tested our model on two representative benchmark datasets. Experimental results show that our model can significantly outperform existing baseline models.
What problem does this paper attempt to address?