Linguistic features of genre and method variation in translation: a computational perspective

Ekaterina Lapshinova-Koltunski,Marcos Zampieri
DOI: https://doi.org/10.1515/9783110595864-005
2018-04-09
Abstract:In this contribution we describe the use of text classification methods to investigate genre and method variation in an English - German translation corpus. For this purpose we use linguistically motivated features representing texts using a combination of part-of-speech tags arranged in bigrams, trigrams, and 4-grams. The classification method used in this study is a Bayesian classifier with Laplace smoothing. We use the output of the classifiers to carry out an extensive feature analysis on the main difference between genres and methods of translation.
What problem does this paper attempt to address?