Sociodemographic Bias in Language Models: A Survey and Forward Path

Vipul Gupta,Pranav Narayanan Venkit,Shomir Wilson,Rebecca J. Passonneau
2024-08-14
Abstract:Sociodemographic bias in language models (LMs) has the potential for harm when deployed in real-world settings. This paper presents a comprehensive survey of the past decade of research on sociodemographic bias in LMs, organized into a typology that facilitates examining the different aims: types of bias, quantifying bias, and debiasing techniques. We track the evolution of the latter two questions, then identify current trends and their limitations, as well as emerging techniques. To guide future research towards more effective and reliable solutions, and to help authors situate their work within this broad landscape, we conclude with a checklist of open questions.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the sociodemographic bias in language models (LMs). Specifically, the paper focuses on the potential negative social impacts that may be caused by unfair performance for different social groups when these models are applied in the real world due to the biases in the training data. By comprehensively reviewing relevant research in the past decade, the paper aims to: 1. **Classify bias types**: Identify and classify different types of sociodemographic biases, such as gender bias, racial bias, etc. 2. **Quantify biases**: Explore how to measure these biases, including using intrinsic methods (such as distance - based metrics, probing indicators) and extrinsic methods (such as performance - based metrics, prompt - based metrics). 3. **Debiasing techniques**: Summarize existing debiasing techniques, including methods in the fine - tuning, training, and inference stages, and discuss the effectiveness and limitations of these techniques. The paper also points out the current research trends and limitations and makes some suggestions to guide future research directions to make them more effective and reliable. Through these efforts, the paper hopes to provide researchers with a clear framework for better understanding and dealing with the sociodemographic bias problem in language models.