Learning Retrieval Augmentation for Personalized Dialogue Generation

Qiushi Huang,Shuai Fu,Xubo Liu,Wenwu Wang,Tom Ko,Yu Zhang,Lilian Tang
DOI: https://doi.org/10.18653/v1/2023.emnlp-main.154
2024-06-27
Abstract:Personalized dialogue generation, focusing on generating highly tailored responses by leveraging persona profiles and dialogue context, has gained significant attention in conversational AI applications. However, persona profiles, a prevalent setting in current personalized dialogue datasets, typically composed of merely four to five sentences, may not offer comprehensive descriptions of the persona about the agent, posing a challenge to generate truly personalized dialogues. To handle this problem, we propose $\textbf{L}$earning Retrieval $\textbf{A}$ugmentation for $\textbf{P}$ersonalized $\textbf{D}$ial$\textbf{O}$gue $\textbf{G}$eneration ($\textbf{LAPDOG}$), which studies the potential of leveraging external knowledge for persona dialogue generation. Specifically, the proposed LAPDOG model consists of a story retriever and a dialogue generator. The story retriever uses a given persona profile as queries to retrieve relevant information from the story document, which serves as a supplementary context to augment the persona profile. The dialogue generator utilizes both the dialogue history and the augmented persona profile to generate personalized responses. For optimization, we adopt a joint training framework that collaboratively learns the story retriever and dialogue generator, where the story retriever is optimized towards desired ultimate metrics (e.g., BLEU) to retrieve content for the dialogue generator to generate personalized responses. Experiments conducted on the CONVAI2 dataset with ROCStory as a supplementary data source show that the proposed LAPDOG method substantially outperforms the baselines, indicating the effectiveness of the proposed method. The LAPDOG model code is publicly available for further exploration. <a class="link-external link-https" href="https://github.com/hqsiswiliam/LAPDOG" rel="external noopener nofollow">this https URL</a>
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### The Problem Addressed by the Paper The paper primarily focuses on an issue in Personalized Dialogue Generation: the persona profiles in existing personalized dialogue datasets typically consist of 4 to 5 sentences. Such brief descriptions are insufficient to comprehensively reflect the characteristics of the personas, leading to generated dialogues that lack personalization. To address this issue, the authors propose a method called **Learning Retrieval Augmentation for Personalized DialOgue Generation (LAPDOG)**. Specifically, LAPDOG consists of two components: 1. **Story Retriever**: Retrieves relevant information from an external story dataset based on the given persona description. 2. **Dialogue Generator**: Generates more personalized responses by combining dialogue history with the augmented persona description. Through this approach, LAPDOG can enrich persona descriptions by acquiring additional information from external data sources, thereby better reflecting the persona's individuality during the generation process. Experimental results show that the LAPDOG method significantly outperforms baseline methods on the CONV AI2 dataset, demonstrating its effectiveness and potential.