The Ubiqus English-Inuktitut System for WMT20

François Hernandez,Vincent Nguyen
DOI: https://doi.org/10.48550/arXiv.2011.09249
2020-11-18
Abstract:This paper describes Ubiqus' submission to the WMT20 English-Inuktitut shared news translation task. Our main system, and only submission, is based on a multilingual approach, jointly training a Transformer model on several agglutinative languages. The English-Inuktitut translation task is challenging at every step, from data selection, preparation and tokenization to quality evaluation down the line. Difficulties emerge both because of the peculiarities of the Inuktitut language as well as the low-resource context.
Computation and Language
What problem does this paper attempt to address?