PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable

Siqi Bao,Huang He,Fan Wang,Hua Wu,Haifeng Wang
2019-01-01
Abstract:Pre-training models have been proved effective for a wide range of naturallanguage processing tasks. Inspired by this, we propose a novel dialoguegeneration pre-training framework to support various kinds of conversations,including chit-chat, knowledge grounded dialogues, and conversational questionanswering. In this framework, we adopt flexible attention mechanisms to fullyleverage the bi-directional context and the uni-directional characteristic oflanguage generation. We also introduce discrete latent variables to tackle theinherent one-to-many mapping problem in response generation. Two reciprocaltasks of response generation and latent act recognition are designed andcarried out simultaneously within a shared network. Comprehensive experimentson three publicly available datasets verify the effectiveness and superiorityof the proposed framework.
What problem does this paper attempt to address?