Abstract:Maintaining engagement and consistency is particularly important in dialogue systems. Existing works have improved the performance of dialogue systems by intentionally learning interlocutor personas with sophisticated network structures. One issue with this approach is that it requires more personal corpora with annotations. Additionally, these models typically perform the next utterance prediction to generate a response but neglect the discourse coherence in the entire conversation. To address these issues, this study proposes a method of learning to memorize entailment and discourse relations for persona-consistent dialogue tasks. Entailment text pairs in natural language inference dataset were applied to learn latent entailment relations as external memories by premise-to-hypothesis generation task. Furthermore, an internal memory with a similar architecture was applied to the discourse information in the dialogue. Placing orthogonality restrictions on these two memory spaces ensures that the latent entailment relations remain dialogue-independent. Both memories collaborate to obtain entailment and discourse representation for the generation, allowing a deeper understanding of both consistency and coherence. Experiments on two large public datasets, PersonaChat and DSTC7-AVSD, demonstrated the effectiveness of the proposed method. Both automatic and human evaluations indicate that the proposed model outperforms several strong baselines in terms of both persona consistency and response coherence. Our source code is available at <a class="link-external link-https" href="https://github.com/Chenrj233/LMEDR" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The problems that this paper attempts to solve mainly focus on two aspects in the dialogue system: personality consistency and dialogue coherence. 1. **Personality Consistency**: Existing dialogue systems often lack consistent personality characteristics when generating responses, that is, the responses generated by the system may be inconsistent with the set character background or previous information. For example, in two rounds of dialogue, for the questions "What is your job?" and "What do you do for a living?" which have similar meanings, the model answered "I am a lawyer" and "I am a doctor" respectively. Such inconsistency not only affects the natural fluency of the dialogue but also reduces the user's trust. 2. **Dialogue Coherence**: Most existing dialogue generation models mainly focus on the prediction of the next statement and ignore the coherence in the entire dialogue process. Dialogue coherence refers to the logical relationships and overall organizational structure among the statements in the dialogue, which is crucial for effectively conveying information. However, existing models usually only focus on local context information and ignore the overall structure of the dialogue. To address the above challenges, the paper proposes a new method - Learning to Memorize Entailment and Discourse Relations (LMEDR) to improve the personality consistency and dialogue coherence of the dialogue system. Specifically: - **External Memory Module**: Learn and store entailment relationships through the Natural Language Inference (NLI) dataset. These relationships are independent of specific dialogue content and are used to ensure that the generated responses are consistent with the character background. - **Internal Memory Module**: Learn and store coherent information in the dialogue to capture the context relationships in the dialogue process and ensure dialogue coherence. - **Orthogonal Constraint**: By imposing an orthogonal constraint between the two memory spaces, reduce the correlation between different memory spaces, thereby improving the performance of the model. The experimental results show that this method has achieved significant results on two public datasets (PersonaChat and DSTC7 - AVSD). Whether it is automatic evaluation or manual evaluation, it is superior to several powerful baseline models.

Learning to Memorize Entailment and Discourse Relations for Persona-Consistent Dialogues

MemBench: Towards Real-world Evaluation of Memory-Augmented Dialogue Systems

Long Time No See! Open-Domain Conversation with Long-Term Persona Memory

Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement

Memory-Augmented Dialogue Management for Task-Oriented Dialogue Systems

UniMC: A Unified Framework for Long-Term Memory Conversation via Relevance Representation Learning

Hello Again! LLM-powered Personalized Agent for Long-term Dialogue

Persona-centric Metamorphic Relation guided Robustness Evaluation for Multi-turn Dialogue Modelling

Learning to Memorize in Neural Task-Oriented Dialogue Systems

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Learning Personalized End-to-End Task-Oriented Dialogue Generation

Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems.

Spoken Language Understanding for Task-oriented Dialogue Systems with Augmented Memory Networks.

Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation

MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation

HIPPL: Hierarchical Intent-Inferring Pointer Network With Pseudo Labeling for Consistent Persona-Driven Dialogue Generation

Persona-aware Dialogue Generation with Enriched Profile

Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations

Retrieve & Memorize: Dialog Policy Learning with Multi-Action Memory

Learning to Improve Persona Consistency in Multi-party Dialogue Generation Via Text Knowledge Enhancement.

Personalized Dialogue Generation with Persona-Adaptive Attention