Abstract:Neural network language models (LMs) are shown to be effective in improving the performance of statistical machine translation (SMT) systems. However, state-of-the-art neural network LMs usually use words before the current position as context and neglect global topic information, which can helpmachine translation (MT) systems to select better translation candidates from a higher perspective. In this work, we propose improvement of the state-of-the-art feedforward neural language model with topic information. Two main issues need to be tackled when adding topics into neural network LMs for SMT: one is how to incorporate topics to the neural network; the other is how to get target-side topic distribution before translation. We incorporate topics by appending topic distribution to the input layer of a feedforward LM. We adopt a multinomial logistic-regression (MLR) model to predict the target-side topic distribution based on source side information. Moreover, we propose a feedforward neural network model to learn joint representations on the source side for topic prediction. LM experiments demonstrate that the perplexity on validation set can be greatly reduced by the topic-enhanced feedforward LM, and the prediction of target-side topics can be improved dramatically with the MLR model equipped with the joint source representations. A final MT experiment, conducted on a large-scale Chinese-English dataset, shows that our feedforward LM with predicted topics improves the translation performance against a strong baseline.

Adaptation of Language Models for SMT Using Neural Networks with Topic Information.

Mutual Information and Diverse Decoding Improve Neural Machine Translation.

An Investigation On Statistical Machine Translation With Neural Language Models

Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information.

A Novel Neural Topic Model and Its Supervised Extension.

Neural Topic Modeling with Large Language Models in the Loop

Learning Multilingual Topics with Neural Variational Inference

Neural Machine Translation Advised by Statistical Machine Translation

TAN-NTM: Topic Attention Networks for Neural Topic Modeling

Language Model-Driven Unsupervised Neural Machine Translation

A Context-Aware Topic Model for Statistical Machine Translation.

Neural Topic Modeling with Deep Mutual Information Estimation

S2vNTM: Semi-supervised vMF Neural Topic Modeling

Reciprocal Supervised Learning Improves Neural Machine Translation

Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

Improved Neural Machine Translation With Smt Features

Language Models are Good Translators

A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

Adversarial Training for Unknown Word Problems in Neural Machine Translation

Revisiting Topic-Guided Language Models

Joint Training for Neural Machine Translation Models with Monolingual Data