AGR: Age Group fairness Reward for Bias Mitigation in LLMs

Shuirong Cao,Ruoxi Cheng,Zhiqiang Wang
2024-09-06
Abstract:LLMs can exhibit age biases, resulting in unequal treatment of individuals across age groups. While much research has addressed racial and gender biases, age bias remains little explored. The scarcity of instruction-tuning and preference datasets for age bias hampers its detection and measurement, and existing fine-tuning methods seldom address age-related fairness. In this paper, we construct age bias preference datasets and instruction-tuning datasets for RLHF. We introduce ARG, an age fairness reward to reduce differences in the response quality of LLMs across different age groups. Extensive experiments demonstrate that this reward significantly improves response accuracy and reduces performance disparities across age groups. Our source code and datasets are available at the anonymous \href{https://anonymous.4open.science/r/FairRLHF-D445/readme.md}{link}.
Machine Learning,Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The problem this paper attempts to address is the age bias present in large language models (LLMs). Specifically, LLMs may exhibit unequal treatment when processing information from different age groups, leading to differences in response quality among these groups. Although there has been extensive research on racial and gender biases, studies on age bias are relatively scarce. The lack of instruction tuning and preference datasets specifically targeting age bias makes it more challenging to detect and measure age bias. Additionally, existing fine-tuning methods rarely focus on age-related fairness. To tackle this challenge, the authors constructed an age bias preference dataset and an instruction tuning dataset, and introduced an age fairness reward mechanism called AGR (Age Group fairness Reward), aimed at reducing the disparity in response quality among different age groups. Through extensive experiments, the authors demonstrated that AGR can significantly improve response accuracy and reduce performance gaps between different age groups.