Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification

Garima Chhikara,Anurag Sharma,Kripabandhu Ghosh,Abhijnan Chakraborty
2024-02-29
Abstract:Employing Large Language Models (LLM) in various downstream applications such as classification is crucial, especially for smaller companies lacking the expertise and resources required for fine-tuning a model. Fairness in LLMs helps ensure inclusivity, equal representation based on factors such as race, gender and promotes responsible AI deployment. As the use of LLMs has become increasingly prevalent, it is essential to assess whether LLMs can generate fair outcomes when subjected to considerations of fairness. In this study, we introduce a framework outlining fairness regulations aligned with various fairness definitions, with each definition being modulated by varying degrees of abstraction. We explore the configuration for in-context learning and the procedure for selecting in-context demonstrations using RAG, while incorporating fairness rules into the process. Experiments conducted with different LLMs indicate that GPT-4 delivers superior results in terms of both accuracy and fairness compared to other models. This work is one of the early attempts to achieve fairness in prediction tasks by utilizing LLMs through in-context learning.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve fairness when using large language models (LLMs) for classification tasks. Specifically, the authors explore ensuring that LLMs can understand and apply different fairness criteria through in - context learning when handling classification tasks. The paper mentions that although LLMs perform well in various downstream applications, they may produce unfair results due to biases in the training data. Therefore, the main objective of the research is to evaluate whether LLMs can effectively incorporate these fairness criteria into their prediction results when receiving specific fairness prompts, thereby reducing or eliminating biases in the prediction. To achieve this goal, the authors: 1. **Propose a framework**: This framework defines rules aligned with different fairness concepts and introduces these rules into the prediction process of LLMs through in - context learning. 2. **Compare the performance of different LLMs**: In the experiment, three state - of - the - art LLMs, Llama - 70b, GPT - 4 and Gemini, are used, and their performance in zero - shot and few - shot settings is evaluated. 3. **Analyze the trade - off between accuracy and fairness**: The study investigates whether LLMs can achieve higher fairness while ensuring accuracy in different settings. 4. **Publish the experimental results**: The authors publish the prediction results of more than 1,000 test instances to promote future research. Overall, this paper aims to explore and verify the potential of LLMs to achieve fairness through in - context learning in classification tasks, which is of great significance for ensuring the accountability and inclusiveness of AI systems.