MCPG: A Flexible Multi-Level Controllable Framework for Unsupervised Paraphrase Generation.

Yi Chen,Haiyun Jiang,Lemao Liu,Rui Wang,Shuming Shi,Ruifeng Xu
DOI: https://doi.org/10.18653/v1/2022.findings-emnlp.439
2022-01-01
Abstract:We present MCPG: a simple and effective approach for controllable unsupervised paraphrase generation, which is also flexible to adapt to specific domains without extra training.MCPG is controllable in different levels: local lexicons, global semantics, and universal styles.The unsupervised paradigm of MCPG combines factual keywords and diversified semantic embeddings as local lexical and global semantic constraints.The semantic embeddings are diversified by standard dropout, which is exploited for the first time to increase inference diversity by us.Moreover, MCPG is qualified with good domain adaptability by adding a transfer vector as a universal style constraint, which is refined from the exemplars retrieved from the corpus of the target domain in a training-free way.Extensive experiments show that MCPG outperforms state-of-the-art unsupervised baselines by a margin.Meanwhile, our domain-adapted MCPG also achieves competitive performance with strong supervised baselines even without training.
What problem does this paper attempt to address?