Why Do Neural Dialog Systems Generate Short and Meaningless Replies? a Comparison Between Dialog and Translation.

Bolin Wei,Shuai Lu,Lili Mou,Hao Zhou,Pascal Poupart,Ge Li,Zhi Jin
DOI: https://doi.org/10.1109/icassp.2019.8682634
2019-01-01
Abstract:This paper addresses the question: In neural dialog systems, why do sequence-to-sequence (Seq2Seq) neural networks generate short and meaningless replies for open-domain response generation? We conjecture that in a dialog system, due to the randomness of spoken language, there may be multiple equally plausible replies for one utterance, causing the deficiency of a Seq2Seq model. To evaluate our conjecture, we propose a systematic way to mimic the dialog scenario in machine translation systems with both real datasets and toy datasets generated elaborately. Experimental results show that we manage to reproduce the phenomenon of generating short and meaningless sentences in the translation setting.
What problem does this paper attempt to address?