Neural machine translation with Gumbel Tree-LSTM based encoder
Chao Su,Heyan Huang,Shumin Shi,Ping Jian,Xuewen Shi
DOI: https://doi.org/10.1016/j.jvcir.2020.102811
IF: 2.887
2020-01-01
Journal of Visual Communication and Image Representation
Abstract:Neural machine translation has improved the translation accuracy greatly and received great attention of the machine translation community. Tree-based translation models aim to model the syntactic or semantic relation among long-distance words or phrases in a sentence. However, it faces the difficulties of expensive manual annotation cost and poor automatic annotation accuracy. In this paper, we focus on how to encode a source sentence into a vector in a unsupervised-tree way and then decode it into a target sentence. Our model incorporates Gumbel Tree-LSTM, which can learn how to compose tree structures from plain text without any tree annotation. We evaluate the proposed model on both spoken and news corpora, and show that the performance of our proposed model outperforms the attentional seq2seq model and the Transformer base model. (c) 2020 Published by Elsevier Inc.