Adapting BigScience Multilingual Model to Unseen Languages

Zheng-Xin Yong,Vassilina Nikoulina
DOI: https://doi.org/10.48550/arXiv.2204.04873
2022-04-11
Abstract:We benchmark different strategies of adding new languages (German and Korean) into the BigScience's pretrained multilingual language model with 1.3 billion parameters that currently supports 13 languages. We investigate the factors that affect the language adaptability of the model and the trade-offs between computational costs and expected performance.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?