Abstract:Neural machine translation (NMT) has been bringing exciting news in the field of machine translation since its emergence. However, because NMT only employs single neural networks to convert natural languages, it suffers from two drawbacks in terms of reducing translation time: NMT is more sensitive to sentence length than statistical machine translation and the end-to-end implementation process fails to make explicit use of linguistic knowledge to improve translation performance. The network model performance of various deep learning machine translation tasks was constructed and compared in English-Chinese bilingual direction, and the defects of each network were solved by using an attention mechanism. The problems of gradient disappearance and gradient explosion are easy to occur in the recurrent neural network in the long-distance sequence. The short and long-term memory networks cannot reflect the information weight problems in long-distance sequences. In this study, through the comparison of examples, it is concluded that the introduction of an attention mechanism can improve the attention of context information in the process of model generation of the target language sequence, thus translating restore degree and fluency higher. This study proposes a neural machine translation method based on the divide-and-conquer strategy. Based on the idea of divide-and-conquer, this method identifies and extracts the longest noun phrase in a sentence and retains special identifiers or core words to form a sentence frame with the rest of the sentence. This method of translating the longest noun phrase and sentence frame separately by the neural machine translation system, and then recombining the translation, alleviates the poor performance of neural machine translation in long sentences. Experimental results show that the BLEU score of translation obtained by the proposed method has improved by 0.89 compared with the baseline method.

Controlling Text Complexity in Neural Machine Translation

Neural Machine Translation from Simplified Translations

Multilingual Controllable Transformer-Based Lexical Simplification

The Reality of Multi-Lingual Machine Translation

Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models

Neural Machine Translation Decoding with Terminology Constraints

Text Complexity Classification Based on Linguistic Information: Application to Intelligent Tutoring of ESL

Controlling Translation Formality Using Pre-trained Multilingual Language Models

Controlling the Output Length of Neural Machine Translation

Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project

Controlling Extra-Textual Attributes about Dialogue Participants -- A Case Study of English-to-Polish Neural Machine Translation

Simplifying Translations for Children: Iterative Simplification Considering Age of Acquisition with LLMs

Exploring Large Language Models to generate Easy to Read content

Neural machine translation for automated feedback on children's early-stage writing

Deep Learning-Based English-Chinese Translation Research

Document Sub-structure in Neural Machine Translation

Man or machine? Comparing the difficulty of human translation versus neural machine translation post-editing

Lexical Complexity Controlled Sentence Generation

The interaction effect between source text complexity and machine translation quality on the task difficulty of NMT post-editing from English to Chinese: A multi-method study

Controlling Utterance Length in NMT-based Word Segmentation with Attention

Automatic Classification of Text Complexity