Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation

Minwoo Lee,Hyukhun Koh,Kang-il Lee,Dongdong Zhang,Minsung Kim,Kyomin Jung
DOI: https://doi.org/10.48550/arXiv.2305.14016
2023-05-23
Computation and Language
Abstract:Gender bias is a significant issue in machine translation, leading to ongoing research efforts in developing bias mitigation techniques. However, most works focus on debiasing of bilingual models without consideration for multilingual systems. In this paper, we specifically target the unambiguous gender bias issue of multilingual machine translation models and propose a new mitigation method based on a novel perspective on the problem. We hypothesize that the gender bias in unambiguous settings is due to the lack of gender information encoded into the non-explicit gender words and devise a scheme to encode correct gender information into their latent embeddings. Specifically, we employ Gender-Aware Contrastive Learning, GACL, based on gender pseudo-labels to encode gender information on the encoder embeddings. Our method is target-language-agnostic and applicable to already trained multilingual machine translation models through post-fine-tuning. Through multilingual evaluation, we show that our approach improves gender accuracy by a wide margin without hampering translation performance. We also observe that incorporated gender information transfers and benefits other target languages regarding gender accuracy. Finally, we demonstrate that our method is applicable and beneficial to models of various sizes.
What problem does this paper attempt to address?
The paper aims to address the issue of gender bias in multilingual machine translation. Specifically, the researchers found that existing multilingual neural machine translation systems exhibit significant gender bias when handling translations with explicit gender, especially in terms of performance differences across different target language directions. To tackle this problem, they proposed a new method—Gender-Aware Contrastive Learning (GACL), which mitigates bias by encoding correct gender information into the latent representations of non-explicit gender vocabulary. This method is not only applicable to already trained multilingual machine translation models but also significantly improves gender accuracy without compromising translation performance. Moreover, this debiasing effect can be transferred to other untrained target languages. Additionally, experimental results show that this method is effective for models of different scales. In summary, the main contributions of the paper are: 1. Investigating the relationship between translation performance and gender accuracy in existing multilingual machine translation systems. 2. Evaluating for the first time the effectiveness of gender debiasing techniques in multilingual machine translation models and demonstrating that the debiasing effect can be transferred across different languages. 3. Proposing a new Gender-Aware Contrastive Loss (GACL), which is target language-independent and effectively reduces gender bias in multilingual machine translation models. 4. Demonstrating through experiments that this contrastive learning method performs well across various model architectures with minimal impact on actual translation performance.