'Since Lawyers are Males..': Examining Implicit Gender Bias in Hindi Language Generation by LLMs

Ishika Joshi,Ishita Gupta,Adrita Dey,Tapan Parikh

2024-09-20

Abstract:Large Language Models (LLMs) are increasingly being used to generate text across various languages, for tasks such as translation, customer support, and education. Despite these advancements, LLMs show notable gender biases in English, which become even more pronounced when generating content in relatively underrepresented languages like Hindi. This study explores implicit gender biases in Hindi text generation and compares them to those in English. We developed Hindi datasets inspired by WinoBias to examine stereotypical patterns in responses from models like GPT-4o and Claude-3 sonnet. Our results reveal a significant gender bias of 87.8% in Hindi, compared to 33.4% in English GPT-4o generation, with Hindi responses frequently relying on gender stereotypes related to occupations, power hierarchies, and social class. This research underscores the variation in gender biases across languages and provides considerations for navigating these biases in generative AI systems.

Computation and Language,Artificial Intelligence,Human-Computer Interaction

What problem does this paper attempt to address?

The paper aims to explore the implicit gender bias exhibited by large language models (LLMs) when generating Hindi text and to conduct a comparative study with gender bias in English. Specifically, the authors developed two test datasets specifically for Hindi—HinStereo-100 and HEAStereo-50. These datasets were adapted from existing bias detection frameworks (such as WinoBias) to capture the unique linguistic structures of Hindi. Using these datasets, researchers tested models such as GPT-4o and Claude-3sonnet. The study found that gender bias in Hindi-generated text is significantly higher than in English (87.8% vs 33.4%), and this bias is often related to professions, power hierarchies, and social classes. For example, when dealing with gender-stereotyped professions (such as doctors and nurses), the models tend to generate text based on gender stereotypes. Additionally, the research explored how gendered grammatical features in Hindi affect model performance and how these biases are amplified in multilingual settings. In conclusion, the study reveals differences in gender bias across languages and highlights the limitations of existing debiasing methods when dealing with highly gendered languages like Hindi. This provides important insights for the future development of more inclusive and culturally sensitive AI systems.

'Since Lawyers are Males..': Examining Implicit Gender Bias in Hindi Language Generation by LLMs

''Fifty Shades of Bias'': Normative Ratings of Gender Bias in GPT Generated English Text

Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications

Gender Bias in Large Language Models across Multiple Languages

Efficient Gender Debiasing of Pre-trained Indic Language Models

Gender bias and stereotypes in Large Language Models

Investigating Cross-Linguistic Gender Bias in Hindi-English Across Domains

Evaluating Gender Bias in Hindi-English Machine Translation

Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias

Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology

Evaluating Gender Bias of LLMs in Making Morality Judgements

Evaluation of Large Language Models: STEM education and Gender Stereotypes

Evaluating Gender, Racial, and Age Biases in Large Language Models: A Comparative Analysis of Occupational and Crime Scenarios

Indian-BhED: A Dataset for Measuring India-Centric Biases in Large Language Models

With a Grain of SALT: Are LLMs Fair Across Social Dimensions?

Mitigating Gender Stereotypes in Hindi and Marathi

Assessing Gender Bias in LLMs: Comparing LLM Outputs with Human Perceptions and Official Statistics

Revealing Hidden Bias in AI: Lessons from Large Language Models

The African Woman is Rhythmic and Soulful: An Investigation of Implicit Biases in LLM Open-ended Text Generation

Gender Bias in LLM-generated Interview Responses