Non-autoregressive Transformer by Position Learning

Yu Bao,Hao Zhou,Jiangtao Feng,Mingxuan Wang,Shujian Huang,Jiajun Chen,Lei LI
DOI: https://doi.org/10.48550/arXiv.1911.10677
2019-11-25
Computation and Language
Abstract:Non-autoregressive models are promising on various text generation tasks. Previous work hardly considers to explicitly model the positions of generated words. However, position modeling is an essential problem in non-autoregressive text generation. In this study, we propose PNAT, which incorporates positions as a latent variable into the text generative process. Experimental results show that PNAT achieves top results on machine translation and paraphrase generation tasks, outperforming several strong baselines.
What problem does this paper attempt to address?