Class-based Named Entity Translation in a Speech to Speech Translation System.

Sameer R. Maskey,Martin Cmejrek,Bowen Zhou,Yuqing Gao
DOI: https://doi.org/10.1109/slt.2008.4777888
2008-01-01
Abstract:Named Entity (NE) Translation is a challenging problem in Machine Translation (MT). Most of the training bi-text corpora for MT lack enough samples of NEs to cover the wide variety of contexts NEs can appear in. In this paper, we present a technique to translate NEs based on their NE types in addition to a phrase-based translation model. Our NE translation model is based on a syntax-based system similar to [1]; but we produce syntax-based rules with non-terminals as NE types instead of general non-terminals. Such class-based rules allow us to better generalize the context NEs. We show that our proposed method obtains an improvement of 0.66 BLEU score absolute as well as 0.26% in F-1-measure over the baseline of phrase-based model in NE test set.
What problem does this paper attempt to address?