Born With a Silver Spoon? Investigating Socioeconomic Bias in Large Language Models

Smriti Singh,Shuvam Keshari,Vinija Jain,Aman Chadha
2024-04-17
Abstract:Socioeconomic bias in society exacerbates disparities, influencing access to opportunities and resources based on individuals' economic and social backgrounds. This pervasive issue perpetuates systemic inequalities, hindering the pursuit of inclusive progress as a society. In this paper, we investigate the presence of socioeconomic bias, if any, in large language models. To this end, we introduce a novel dataset SilverSpoon, consisting of 3000 samples that illustrate hypothetical scenarios that involve underprivileged people performing ethically ambiguous actions due to their circumstances, and ask whether the action is ethically justified. Further, this dataset has a dual-labeling scheme and has been annotated by people belonging to both ends of the socioeconomic spectrum. Using SilverSpoon, we evaluate the degree of socioeconomic bias expressed in large language models and the variation of this degree as a function of model size. We also perform qualitative analysis to analyze the nature of this bias. Our analysis reveals that while humans disagree on which situations require empathy toward the underprivileged, most large language models are unable to empathize with the socioeconomically underprivileged regardless of the situation. To foster further research in this domain, we make SilverSpoon and our evaluation harness publicly available.
Computers and Society,Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The paper attempts to address the issue of whether there is socioeconomic bias in large language models (LLMs). Specifically, the authors explore this issue by constructing a dataset named SILVER SPOON, which contains 3,000 samples describing morally ambiguous behaviors taken by individuals in disadvantaged socioeconomic positions under extreme circumstances, and asking whether these behaviors can be morally justified. Additionally, the annotations in the dataset are completed by people from both ends of the socioeconomic spectrum to assess whether LLMs exhibit bias or empathy when dealing with socioeconomic dilemmas. The study also analyzes the degree and variation of socioeconomic bias in LLMs of different scales when handling these issues. Through quantitative and qualitative analysis, the paper reveals that most large language models lack empathy when facing socioeconomically disadvantaged groups.