I Can Guess What You Mean: A Monolingual Query Enhancement for Machine Translation

Chenxi Pang,Hai Zhao,Zhongyi Li
DOI: https://doi.org/10.1007/978-3-319-47674-2_5
2016-01-01
Abstract:We introduce a monolingual query method with additional webpage data to improve the translation quality for more and more official use requirement of statistical machine translation outputs. The motivation behind this method is that we can improve the readability of sentence once for all if we replace translation sentences with the most related sentences generated by human. Based on vector space representations for translated sentences, we perform a query on search engine for additional reference text data. Then we rank all translation sentences to make necessary replacement from the query results. Various vector representations for sentence, TFIDF, latent semantic indexing, and neural network word embedding, are conducted and the experimental results show an alternative solution to enhance the current machine translation with a performance improvement about 0.5 BLEU in French-to-English task and 0.7 BLEU in English-to-Chinese task.
What problem does this paper attempt to address?