Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks

Yiping Song,Zequn Liu,Wei Bi,Rui Yan,Ming Zhang
DOI: https://doi.org/10.18653/v1/2020.acl-main.517
2020-01-01
Abstract:Training the generative models with minimal corpus is one of the criticalchallenges for building open-domain dialogue systems. Existing methods tend touse the meta-learning framework which pre-trains the parameters on allnon-target tasks then fine-tunes on the target task. However, fine-tuningdistinguishes tasks from the parameter perspective but ignores themodel-structure perspective, resulting in similar dialogue models for differenttasks. In this paper, we propose an algorithm that can customize a uniquedialogue model for each task in the few-shot setting. In our approach, eachdialogue model consists of a shared module, a gating module, and a privatemodule. The first two modules are shared among all the tasks, while the thirdone will differentiate into different network structures to better capture thecharacteristics of the corresponding task. The extensive experiments on twodatasets show that our method outperforms all the baselines in terms of taskconsistency, response quality, and diversity.
What problem does this paper attempt to address?