Dialog generation based on hierarchical encoding and deep reinforcement learning

Yuqing ZHAO,Yang XIANG
DOI: https://doi.org/10.11772/j.issn.1001-9081.2017.10.2813
2017-01-01
Abstract:Aiming at dialog generation problem,a dialog generation model based on hierarchical encoding and deep reinforcement learning,namely Enhanced Hierarchical Recurrent Encoder-Decoder (EHRED) was proposed to solve the problem that standard sequence to sequence (seq2seq) architectures are more likely to raise highly generic responses due to the Maximum Likelihood Estimate (MLE) loss function.A multi-round dialog model was built by hierarchical structure,and a hierarchical layer was added to enhance the memory of history dialog based on the standard seq2seq architecture,and then a language model was used to build reward function,replacing traditional MLE loss function with policy gradient method in deep reinforcement learning for training.Experimental results show that EHRED can generate responses with richer semantic information and improve by 5.7-11.1 percentage points in standard manual evaluation compared with the widely used traditional standard seq2seq Recurrent Neural Network (RNN) dialog generation model.
What problem does this paper attempt to address?