Data-driven Fuzzy Target Representation for Intelligent Translation System

Kehai Chen,Muyun Yang,Tiejun Zhao,Min Zhang
DOI: https://doi.org/10.1109/tfuzz.2022.3167129
IF: 12.253
2022-01-01
IEEE Transactions on Fuzzy Systems
Abstract:The encoder-decoder framework has been widely used in various practical artificial intelligence cyber-physical systems including intelligent translation systems. The decoding process in such a framework usually demands the target-side representation, which is often completed by an auto-aggressive decoder to capture the target context information at the current time step. However, the auto-aggressive decoder only models the previously generated partial target fragment and fails in modeling the global contextual information. In this paper, we propose a new data-driven fuzzy context representation strategy to simulate the global target information. Specifically, we design two fuzzy methods to the global target contextual information: i) bag-of-word of target language generated via a softmax layer from the source sentence representation; ii) whole target sentence retrieved from the translation memory according to the source sentence representation. Both methods facilitate the auto-aggressive decoder to attend to the global target context at the current time-step and thereby learn a more effective context vector for enhancing the generation of target translation. Extensive experiments on two machine translation tasks demonstrated that the proposed method obtained a 3% improvement of BLEU scores over a strong baseline.
What problem does this paper attempt to address?