SMT Domain Adaptation Based on Monolingual Context Information

CAO Jie,LV Yajuan,SU Jinsong,LIU Qun
DOI: https://doi.org/10.3969/j.issn.1003-0077.2010.06.008
2010-01-01
Abstract:Domain adaptation problem will arise when statistical machine translation(SMT) system is used to translate domain-specific texts.When the texts to be translated and the training data come from the same domain,SMT system can achieve good performance.Otherwise,the translation quality will degrade dramatically.In general,domain-specific parallel corpus is limited,while domain-mixed parallel corpus and domain-specific monolingual corpus are easy to obtain.According to the fact,this paper proposed a new translation model which utilized domain-mixed parallel corpus and domain-specific monolingual corpus to improve the domain translation quality.Experiments show that the proposed method improves translation performance in three IWSLT evaluation tests significantly.
What problem does this paper attempt to address?