The Technical Analyses of Named Entity Translation

Ying Liu
DOI: https://doi.org/10.2991/isci-15.2015.266
2015-01-01
Abstract:There are three methods: rule-based method, statistical method and web mining method for named entity translation. The rule-based method did not achieve satisfactory results. High-quality translation equivalents can be obtained from parallel corpora for statistical method, and a prerequisite is the availability of a large scale of annotated corpora. The comparable corpora are easier to obtain than parallel corpora. But translation extraction from comparable corpora achieves lower accuracy than that of parallel corpora. Web mining method can acquire the translation of high-frequency named entities and it is difficult to translate the low-frequency named entities.
What problem does this paper attempt to address?