Multi-Grained Knowledge Retrieval for End-to-End Task-Oriented Dialog

Fanqi Wan,Weizhou Shen,Ke Yang,Xiaojun Quan,Wei Bi
2023-05-17
Abstract:Retrieving proper domain knowledge from an external database lies at the heart of end-to-end task-oriented dialog systems to generate informative responses. Most existing systems blend knowledge retrieval with response generation and optimize them with direct supervision from reference responses, leading to suboptimal retrieval performance when the knowledge base becomes large-scale. To address this, we propose to decouple knowledge retrieval from response generation and introduce a multi-grained knowledge retriever (MAKER) that includes an entity selector to search for relevant entities and an attribute selector to filter out irrelevant attributes. To train the retriever, we propose a novel distillation objective that derives supervision signals from the response generator. Experiments conducted on three standard benchmarks with both small and large-scale knowledge bases demonstrate that our retriever performs knowledge retrieval more effectively than existing methods. Our code has been made publicly available.\footnote{<a class="link-external link-https" href="https://github.com/18907305772/MAKER" rel="external noopener nofollow">this https URL</a>}
Computation and Language
What problem does this paper attempt to address?
The paper aims to address the issue of knowledge retrieval in end-to-end task-oriented dialogue systems (E2E-TOD), particularly how to improve the effectiveness of knowledge retrieval when dealing with large-scale knowledge bases. Specifically, existing E2E-TOD systems typically mix knowledge retrieval with response generation and optimize through direct supervision by referencing responses. This approach can lead to suboptimal knowledge retrieval performance in the context of large-scale knowledge bases. To solve this problem, the authors propose a Multi-granularity Knowledge Retriever (MAKER), which decouples knowledge retrieval from response generation and introduces an entity selector and an attribute selector to retrieve relevant entities and their attributes from the knowledge base. The main contributions of MAKER include: 1. **Decoupling Knowledge Retrieval and Response Generation**: By separating the knowledge retrieval process from the response generation process, the system's flexibility and efficiency are improved. 2. **Multi-granularity Knowledge Retrieval**: Not only does it retrieve entities, but it also further filters out relevant attributes within the entities, achieving finer-grained knowledge retrieval. 3. **Training Method**: It uses the cross-attention scores of the response generator as a supervision signal to train the retriever, which better guides the retriever's learning. Experimental results show that on knowledge bases of different scales, MAKER demonstrates significant advantages in knowledge retrieval compared to existing methods. Especially when dealing with large-scale knowledge bases, its performance is markedly superior to baseline methods. This indicates that MAKER effectively addresses the knowledge retrieval problem in the context of large-scale knowledge bases.