Zero-Shot Translation using Diffusion Models

Eliya Nachmani,Shaked Dovrat
DOI: https://doi.org/10.48550/arXiv.2111.01471
2021-11-02
Abstract:In this work, we show a novel method for neural machine translation (NMT), using a denoising diffusion probabilistic model (DDPM), adjusted for textual data, following recent advances in the field. We show that it's possible to translate sentences non-autoregressively using a diffusion model conditioned on the source sentence. We also show that our model is able to translate between pairs of languages unseen during training (zero-shot learning).
Computation and Language,Machine Learning
What problem does this paper attempt to address?