Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases

Li-Hsin Chang,Sampo Pyysalo,Jenna Kanerva,Filip Ginter
DOI: https://doi.org/10.48550/arXiv.2105.02477
2021-05-06
Computation and Language
Abstract:In this paper, we present a quantitative evaluation of differences between alternative translations in a large recently released Finnish paraphrase corpus focusing in particular on non-trivial variation in translation. We combine a series of automatic steps detecting systematic variation with manual analysis to reveal regularities and identify categories of translation differences. We find the paraphrase corpus to contain highly non-trivial translation variants difficult to recognize through automatic approaches.
What problem does this paper attempt to address?