Abstract:In the past decade, statistical machine translation (SMT) has been advanced from word-based SMT to phraseand syntax-based SMT. Although this advancement produces significant improvements in BLEU scores, crucial meaning errors and lack of cross-sentence connections at discourse level still hurt the quality of SMT-generated translations. More recently, we have witnessed two active movements in SMT research: one towards combining semantics and SMT in attempt to generate not only grammatical but also meaningpreserved translations, and the other towards exploring discourse knowledge for document-level machine translation in order to capture intersentence dependencies. The emergence of semantic SMT are due to the combination of two factors: the necessity of semantic modeling in SMT and the renewed interest of designing models tailored to relevant NLP/SMT applications in the semantics community. The former is represented by recent numerous studies on exploring word sense disambiguation, semantic role labeling, bilingual semantic representations as well as semantic evaluation for SMT. The latter is reflected in CoNLL shared tasks, SemEval and SenEval exercises in recent years. The need of capturing cross-sentence dependencies for document-level SMT triggers the resurgent interest of modeling translation from the perspective of discourse. Discourse phenomena, such as coherent relations, discourse topics, lexical cohesion that are beyond the scope of conventional sentence-level n-grams, have been recently considered and explored in the context of SMT. This tutorial aims at providing a timely and combined introduction of such recent work along these two trends as discourse is inherently connected with semantics. The tutorial has three parts. The first part critically reviews the phraseand syntax-based SMT. The second part is devoted to the lines of research oriented to semantic SMT, including a brief introduction of semantics, lexical and shallow semantics tailored to SMT, semantic representations in SMT, semantically motivated evaluation as well as advanced topics on deep semantic learning for SMT. The third part is dedicated to recent work on SMT with discourse, including a brief review on discourse studies from linguistics and computational viewpoints, discourse research from monolingual to multilingual, discourse-based SMT and a few advanced topics. The tutorial is targeted for researchers in the SMT, semantics and discourse communities. In particular, the expected audience comes from two groups: 1) Researchers and students in the SMT community who want to design cutting-edge models and algorithms for semantic SMT with various semantic knowledge and representations, and who would like to advance SMT from sentence-bysentence translation to document-level translation with discourse information; 2) Researchers and students from the semantics and discourse community who are interested in developing models and methods and adapting them to SMT.

Invited Talk: Word Sense Induction for Machine Translation

A Sense-Based Translation Model For Statistical Machine Translation

Do Multi-Sense Embeddings Improve Natural Language Understanding?

Semantics, Discourse and Statistical Machine Translation.

Inducing Word Senses for Cross-lingual Document Clustering

Topic Models Incorporating Statistical Word Senses

Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

Statistical Word Sense Aware Topic Models

A Maximum-Entropy Segmentation Model for Statistical Machine Translation

To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models

Inducing Word Sense with Automatically Learned Hidden Concepts.

A Bilingual Graph-Based Semantic Model for Statistical Machine Translation.

Document Representation with Statistical Word Senses in Cross-Lingual Document Clustering

xSense: Learning Sense-Separated Sparse Representations and Textual Definitions for Explainable Word Sense Networks

A Context-Aware Topic Model for Statistical Machine Translation.

AutoSense Model for Word Sense Induction

Implement a Full- Text Automatic System for Word Sense Tagging

Sense-Aware Decoder for Character Based Japanese-Chinese NMT

Solution Strategies for Word Sense Problems Based on Vector Space Model and Maximum Entropy Model

Syntax-Enhanced Neural Machine Translation with Syntax-Aware Word Representations

A WORD-VECTOR -BASED QUANTIZATION MODEL OF CHINESE WORD SENSE