Abstract:Neural machine translation (NMT) has been bringing exciting news in the field of machine translation since its emergence. However, because NMT only employs single neural networks to convert natural languages, it suffers from two drawbacks in terms of reducing translation time: NMT is more sensitive to sentence length than statistical machine translation and the end-to-end implementation process fails to make explicit use of linguistic knowledge to improve translation performance. The network model performance of various deep learning machine translation tasks was constructed and compared in English-Chinese bilingual direction, and the defects of each network were solved by using an attention mechanism. The problems of gradient disappearance and gradient explosion are easy to occur in the recurrent neural network in the long-distance sequence. The short and long-term memory networks cannot reflect the information weight problems in long-distance sequences. In this study, through the comparison of examples, it is concluded that the introduction of an attention mechanism can improve the attention of context information in the process of model generation of the target language sequence, thus translating restore degree and fluency higher. This study proposes a neural machine translation method based on the divide-and-conquer strategy. Based on the idea of divide-and-conquer, this method identifies and extracts the longest noun phrase in a sentence and retains special identifiers or core words to form a sentence frame with the rest of the sentence. This method of translating the longest noun phrase and sentence frame separately by the neural machine translation system, and then recombining the translation, alleviates the poor performance of neural machine translation in long sentences. Experimental results show that the BLEU score of translation obtained by the proposed method has improved by 0.89 compared with the baseline method.

Improved training of neural trans-dimensional random field language models with dynamic noise-contrastive estimation.

Learning neural trans-dimensional random field language models with noise-contrastive estimation

Improving and Scaling Trans-dimensional Random Field Language Models.

Model Interpolation with Trans-dimensional Random Field Language Models for Speech Recognition

Learning Trans-Dimensional Random Fields with Applications to Language Modeling

Trans-dimensional Random Fields for Language Modeling.

INTEGRATING DISCRETE AND NEURAL FEATURES VIA MIXED-FEATURE TRANS-DIMENSIONAL RANDOM FIELD LANGUAGE MODELS

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer

Investigating the Effect of Language Models in Sequence Discriminative Training for Neural Transducers

Neural Machine Translation with Noisy Lexical Constraints.

Learning FOFE Based FNN-LMs with Noise Contrastive Estimation and Part-of-speech Features

Improving RNN transducer with normalized jointer network

Improving Accented Mandarin Speech Recognition by Using Recurrent Neural Network Based Language Model Adaptation

Discriminative method for recurrent neural network language models

Frequency-Aware Contrastive Learning for Neural Machine Translation

NEWLSTM: an Optimized Long Short-Term Memory Language Model for Sequence Prediction.

Deep Learning-Based English-Chinese Translation Research

Lattice-Based Recurrent Neural Network Encoders for Neural Machine Translation

Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition

Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model

Revisiting Simple Neural Probabilistic Language Models