Learning to Memorize Entailment and Discourse Relations for Persona-Consistent Dialogues

Ruijun Chen,Jin Wang,Liang-Chih Yu,Xuejie Zhang
DOI: https://doi.org/10.48550/arXiv.2301.04871
2023-02-25
Abstract:Maintaining engagement and consistency is particularly important in dialogue systems. Existing works have improved the performance of dialogue systems by intentionally learning interlocutor personas with sophisticated network structures. One issue with this approach is that it requires more personal corpora with annotations. Additionally, these models typically perform the next utterance prediction to generate a response but neglect the discourse coherence in the entire conversation. To address these issues, this study proposes a method of learning to memorize entailment and discourse relations for persona-consistent dialogue tasks. Entailment text pairs in natural language inference dataset were applied to learn latent entailment relations as external memories by premise-to-hypothesis generation task. Furthermore, an internal memory with a similar architecture was applied to the discourse information in the dialogue. Placing orthogonality restrictions on these two memory spaces ensures that the latent entailment relations remain dialogue-independent. Both memories collaborate to obtain entailment and discourse representation for the generation, allowing a deeper understanding of both consistency and coherence. Experiments on two large public datasets, PersonaChat and DSTC7-AVSD, demonstrated the effectiveness of the proposed method. Both automatic and human evaluations indicate that the proposed model outperforms several strong baselines in terms of both persona consistency and response coherence. Our source code is available at <a class="link-external link-https" href="https://github.com/Chenrj233/LMEDR" rel="external noopener nofollow">this https URL</a>.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on two aspects in the dialogue system: personality consistency and dialogue coherence. 1. **Personality Consistency**: Existing dialogue systems often lack consistent personality characteristics when generating responses, that is, the responses generated by the system may be inconsistent with the set character background or previous information. For example, in two rounds of dialogue, for the questions "What is your job?" and "What do you do for a living?" which have similar meanings, the model answered "I am a lawyer" and "I am a doctor" respectively. Such inconsistency not only affects the natural fluency of the dialogue but also reduces the user's trust. 2. **Dialogue Coherence**: Most existing dialogue generation models mainly focus on the prediction of the next statement and ignore the coherence in the entire dialogue process. Dialogue coherence refers to the logical relationships and overall organizational structure among the statements in the dialogue, which is crucial for effectively conveying information. However, existing models usually only focus on local context information and ignore the overall structure of the dialogue. To address the above challenges, the paper proposes a new method - Learning to Memorize Entailment and Discourse Relations (LMEDR) to improve the personality consistency and dialogue coherence of the dialogue system. Specifically: - **External Memory Module**: Learn and store entailment relationships through the Natural Language Inference (NLI) dataset. These relationships are independent of specific dialogue content and are used to ensure that the generated responses are consistent with the character background. - **Internal Memory Module**: Learn and store coherent information in the dialogue to capture the context relationships in the dialogue process and ensure dialogue coherence. - **Orthogonal Constraint**: By imposing an orthogonal constraint between the two memory spaces, reduce the correlation between different memory spaces, thereby improving the performance of the model. The experimental results show that this method has achieved significant results on two public datasets (PersonaChat and DSTC7 - AVSD). Whether it is automatic evaluation or manual evaluation, it is superior to several powerful baseline models.