Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation

Minwoo Lee,Hyukhun Koh,Kang-il Lee,Dongdong Zhang,Minsung Kim,Kyomin Jung

DOI: https://doi.org/10.48550/arXiv.2305.14016

2023-05-23

Computation and Language

Abstract:Gender bias is a significant issue in machine translation, leading to ongoing research efforts in developing bias mitigation techniques. However, most works focus on debiasing of bilingual models without consideration for multilingual systems. In this paper, we specifically target the unambiguous gender bias issue of multilingual machine translation models and propose a new mitigation method based on a novel perspective on the problem. We hypothesize that the gender bias in unambiguous settings is due to the lack of gender information encoded into the non-explicit gender words and devise a scheme to encode correct gender information into their latent embeddings. Specifically, we employ Gender-Aware Contrastive Learning, GACL, based on gender pseudo-labels to encode gender information on the encoder embeddings. Our method is target-language-agnostic and applicable to already trained multilingual machine translation models through post-fine-tuning. Through multilingual evaluation, we show that our approach improves gender accuracy by a wide margin without hampering translation performance. We also observe that incorporated gender information transfers and benefits other target languages regarding gender accuracy. Finally, we demonstrate that our method is applicable and beneficial to models of various sizes.

What problem does this paper attempt to address?

The paper aims to address the issue of gender bias in multilingual machine translation. Specifically, the researchers found that existing multilingual neural machine translation systems exhibit significant gender bias when handling translations with explicit gender, especially in terms of performance differences across different target language directions. To tackle this problem, they proposed a new method—Gender-Aware Contrastive Learning (GACL), which mitigates bias by encoding correct gender information into the latent representations of non-explicit gender vocabulary. This method is not only applicable to already trained multilingual machine translation models but also significantly improves gender accuracy without compromising translation performance. Moreover, this debiasing effect can be transferred to other untrained target languages. Additionally, experimental results show that this method is effective for models of different scales. In summary, the main contributions of the paper are: 1. Investigating the relationship between translation performance and gender accuracy in existing multilingual machine translation systems. 2. Evaluating for the first time the effectiveness of gender debiasing techniques in multilingual machine translation models and demonstrating that the debiasing effect can be transferred across different languages. 3. Proposing a new Gender-Aware Contrastive Loss (GACL), which is target language-independent and effectively reduces gender bias in multilingual machine translation models. 4. Demonstrating through experiments that this contrastive learning method performs well across various model architectures with minimal impact on actual translation performance.

Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation

Mitigating Gender Bias in Machine Translation through Adversarial Learning

MABEL: Attenuating Gender Bias using Textual Entailment Data

Fine-grained Gender Control in Machine Translation with Large Language Models

Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem

Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer

GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models

Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words

Local Contrastive Editing of Gender Stereotypes

Reducing Gender Bias in Machine Translation through Counterfactual Data Generation

Locating and Mitigating Gender Bias in Large Language Models

On Measuring Gender Bias in Translation of Gender-neutral Pronouns

Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model

Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation

What is Your Favorite Gender, MLM? Gender Bias Evaluation in Multilingual Masked Language Models

Mitigating Gender Bias in Contextual Word Embeddings

Evaluating Gender Bias in Machine Translation

Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters

Investigating Markers and Drivers of Gender Bias in Machine Translations

Gender Lost In Translation: How Bridging The Gap Between Languages Affects Gender Bias in Zero-Shot Multilingual Translation