TOPICS TRANSLATION MODEL-BASED BILINGUAL TEXT ERRORS CORRECTION

Huan Chen,Qi Zhang
DOI: https://doi.org/10.3969/j.issn.1000-386x.2016.03.067
2016-01-01
Abstract:Along with the globalisation of information in recent years,multilingual mixing phenomena have become increasingly popular in social networks texts.It is quite common in Chinese texts that other languages are mixed.Since most of the existing natural language processing algorithm is the monolingual task-based,the multilingual mixed text can’t be well processed,therefore it is crucial to pre-process the text before carrying out other natural language processing tasks.For the lack of the corpus of bilingual alignment in network text semantic space,we proposed a topics translation model-based method,it calculates the probability of bilingual alignment of network text semantic space using the corpus in different semantic spaces,then incorporates neural network language model to translate the English in mixed network text to corresponding Chinese text.The experiment was set on a manual labelled test corpus.Experimental result indicated that through different comparative experiments it was proved that the proposed approach was effective and was able to improve translation accuracy.
What problem does this paper attempt to address?