Study on the feature-rich collocation translation

CHEN Yin,LI Sheng
DOI: https://doi.org/10.3321/j.issn:0367-6234.2007.11.027
2007-01-01
Abstract:This paper proposes a new method for collocation translation. We exploit a collocation translation model that can make full use of all available information derived from both monolingual and bilingual corpora. Instead of heavily relying on bilingual parallel corpora, our approach can train translation models using monolingual corpora. Both inside-collocation information and contextual information are exploited in our model. The EM algorithm is applied to estimate contextual word translation probabilities using a monolingual corpus. Our model also has the ability to integrate bilingual derived features if they are available. Experiments show that our approach outperforms the existing monolingual corpus based on methods in collocation translation and achieves better results when making use of available bilingual corpus.
What problem does this paper attempt to address?