Joint Dual Learning with Mutual Information Maximization for Natural Language Understanding and Generation in Dialogues

Shang-Yu Su,Yung-Sung Chuang,Yun-Nung Chen
DOI: https://doi.org/10.1109/taslp.2024.3364063
2024-01-01
Abstract:Modular conversational systems heavily rely on the performance of their natural language understanding (NLU) and natural language generation (NLG) components. NLU focuses on extracting core semantic concepts from input texts, while NLG constructs coherent sentences based on these extracted semantics. Inspired by information theory in digital communication, we introduce a one-way communication model that mirrors human conversations, comprising two distinct phases: (1) the conversion of thoughts into messages, similar to NLG, and (2) the comprehension of received messages, similar to NLU. This paper presents a novel algorithm that trains NLU and NLG collaboratively by concatenating their models and maximizing mutual information between inputs and outputs. This approach efficiently facilitates the transmission of semantics, leading to enhanced learning performance for both components. Our experimental results, based on three benchmark datasets, consistently demonstrate significant improvements for both NLU and NLG tasks, highlighting the practical promise of our proposed method.
engineering, electrical & electronic,acoustics
What problem does this paper attempt to address?