Dynamic Fusion Nearest Neighbor Machine Translation Via Dempster-Shafer Theory

Zongheng Yang,Hongxu Hou,Shuo Sun,Nier Wu,Yisong Wang,Weichen Jian,Pengcong Wang
DOI: https://doi.org/10.1007/978-981-19-7960-6_9
2022-01-01
Abstract:kNN-MT has been recently proposed, uses a token-level k-nearest neighbor approach to retrieve similar sentences, obtaining knowledge guidance from an external memory module, and then combined with the prediction results of the translation model, which greatly improves the accuracy of machine translation. However, kNN-MT uses simple linear interpolation in the fusion of retrieval probability and translation probability, which can not dynamically adjust the fusion ratio according to the matching degree of the retrieved sentences. Moreover, different fusion ratios need to be explored in different translation scenarios, and the translation effect will be affected when the retrieved sentences have a low matching degree or contain noise. In this paper, we propose an approach via Dempster-Shafer theory (DST) to dynamically fuse different probability distributions to suit different scenarios. We demonstrate that our approach is more significantly improved and more robust than the traditional kNN-MT, and we explore the application of kNN-MT in low-resource translation scenarios for the first time.
What problem does this paper attempt to address?