Joint Alignment and Artificial Data Generation: an Empirical Study of Pivot-based Machine Transliteration

Min Zhang,Xiangyu Duan,Ming Liu,Yunqing Xia,Haizhou Li
2011-01-01
Abstract:In this paper, we first carry out an investigation on two existing pivot strategies for statistical machine transliteration, namely system-based and model-based strategies, to figure out the reason why the previous model-based strategy performs much worse than the system-based one. We then propose a joint alignment algorithm to optimize transliteration alignments jointly across source, pivot and target languages to improve the performance of the modelbased strategy. In addition, we further propose a novel synthetic data-based strategy, which artificially generates source-target data using pivot language. Experimental results on benchmarking data show that the proposed joint alignment optimization algorithm significantly improves the accuracy of model-based strategy and the proposed synthetic data-based strategy is very effective for pivot-based machine transliteration.
What problem does this paper attempt to address?