Abstract:Accurate translation of entities (e.g., person names, organizations, geography) is important in neural machine translation (briefly, NMT), as they are usually more difficult to translate than other words, and an incorrect translation of them will greatly hurt user experiences. In previous works, entities are either treated in the same way as other words, which leads to inaccurate translation, or handled by multiple steps (including named entity recognition, translation, and replacing entities back), which significantly increase the inference latency. In this work, we propose an end-to-end algorithm that carefully handles the translation of entities. There are mainly two novel parts compared to conventional NMT model: (1) The encoder and the decoder are attached with entity classifiers, which are used to verify whether the input token is a named entity. In this way, the encoder and decoder are capable to treat named entities differently; (2) The translation loss of each target token is adaptively increased by the probability that the target token is a named entity, which results in more accurate translation of entities. During inference time, these two parts will be removed so that the translation model maintains the same inference speed as conventional NMT models. Empirical results on six translation tasks demonstrate the effectiveness of our methods of improving the translation quality. Specifically, we improve 1.7 BLEU scores on Japanese to English translation and 4.6 entity F1\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$F_{1}$$\end{document} scores on English to Chinese translation, without additional inference cost.

Efficient Entity Translation Mining

Efficient Entity Translation Mining: A Parallelized Graph Alignment Approach

Mining Name Translations from Entity Graph Mapping.

Cross-Lingual Entity Matching for Heterogeneous Online Wikis.

Reserch of Entity Matching Based on Multiple Heterogenous Data

Named entity translation method based on machine translation lexicon

An Approach to Extract Named Entity Translingual Equivalence

English-Chinese Name Translation Based on Web Mining

Translation of English-Chinese Person Name Based on Dictionary, Bilingual Corpus and Web Mining.

Development of Translation Database based on Chinese-English parallel corpora

A Local Information Perception Enhancement–Based Method for Chinese NER

Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation

Chinese-English Oov Term Translation With Web Mining, Multiple Feature Fusion And Supervised Learning

Extract and Attend: Improving Entity Translation in Neural Machine Translation

End-to-end entity-aware neural machine translation

Learning Inter-Related Statistical Query Translation Models for English-Chinese Bi-Directional CLIR

Web-Based Terminology Translation Mining

Entity Matching Across Heterogeneous Sources

The Technical Analyses of Named Entity Translation

Unsupervised Deep Cross-Language Entity Alignment

Digital Mining Algorithm of English Translation Course Information Based on Digital Twin Technology