Using crowdsourcing system for creating site-specific statistical machine translation engine

Alexander Kalinin,George Savchenko
DOI: https://doi.org/10.48550/arXiv.1409.5502
2014-09-19
Abstract:A crowdsourcing translation approach is an effective tool for globalization of site content, but it is also an important source of parallel linguistic data. For the given site, processed with a crowdsourcing system, a sentence-aligned corpus can be fetched, which covers a very narrow domain of terminology and language patterns - a site-specific domain. These data can be used for training and estimation of site-specific statistical machine translation engine
Computation and Language
What problem does this paper attempt to address?