Personalized Response Generation via Domain adaptation

Min Yang,Zhou Zhao,Wei Zhao,Xiaojun Chen,Jia Zhu,Lianqiang Zhou,Zigang Cao
DOI: https://doi.org/10.1145/3077136.3080706
2017-01-01
Abstract:In this paper, we propose a novel personalized response generation model via domain adaptation (PRG-DM). First, we learn the human responding style from large general data (without user-specific information). Second, we fine tune the model on a small size of personalized data to generate personalized responses with a dual learning mechanism. Moreover, we propose three new rewards to characterize good conversations that are personalized, informative and grammatical. We employ the policy gradient method to generate highly rewarded responses. Experimental results show that our model can generate better personalized responses for different users.
What problem does this paper attempt to address?