Investigating deep reinforcement learning techniques in personalized dialogue generation

Yang Min,Qu Qiang,Lei Kai,Zhu Jia,Zhao Zhou,Chen Xiaojun,Joshua Zhexue Huang
DOI: https://doi.org/10.1137/1.9781611975321.71
2018-01-01
Abstract:In this paper, we propose a personalized dialogue generation system, which combines reinforcement learning techniques with an attention-based hierarchical recurrent encoder-decoder model. Firstly, we incorporate user-specific information into the decoder to capture user's background information and speaking style. Secondly, we employ reinforcement learning techniques to maximize future reward in dialogue, which enables our system to generate topic-coherent, informative and grammatical responses. Moreover, we propose three types of rewards to characterize good conversations. Finally, we compare the performance of the following reinforcement learning methods in dialogue generation: policy gradient, Q-learning, and actor-critic algorithms. We conduct experiments to verify the effectiveness of the proposed model on two dialogue datasets. Experimental results demonstrate that our model can generate better personalized dialogues for different users. Quantitatively, our method achieves better performance than the state-of-the-art dialogue systems in terms of BLEU score, perplexity, and human evaluation. © 2018 by SIAM.
What problem does this paper attempt to address?