Ensuring Fairness with Transparent Auditing of Quantitative Bias in AI Systems

Chih-Cheng Rex Yuan,Bow-Yaw Wang
2024-08-25
Abstract:With the rapid advancement of AI, there is a growing trend to integrate AI into decision-making processes. However, AI systems may exhibit biases that lead decision-makers to draw unfair conclusions. Notably, the COMPAS system used in the American justice system to evaluate recidivism was found to favor racial majority groups; specifically, it violates a fairness standard called equalized odds. Various measures have been proposed to assess AI fairness. We present a framework for auditing AI fairness, involving third-party auditors and AI system providers, and we have created a tool to facilitate systematic examination of AI systems. The tool is open-sourced and publicly available. Unlike traditional AI systems, we advocate a transparent white-box and statistics-based approach. It can be utilized by third-party auditors, AI developers, or the general public for reference when judging the fairness criterion of AI systems.
Computers and Society,Artificial Intelligence,Human-Computer Interaction
What problem does this paper attempt to address?
The paper attempts to address the issue of ensuring the fairness of Artificial Intelligence (AI) systems and proposes a transparent auditing framework to evaluate quantitative biases in AI systems. Specifically, the paper focuses on the following aspects: 1. **Background and Motivation**: - With the rapid development of AI technology, more and more decision-making processes are beginning to rely on AI systems. However, these AI systems may have biases, leading to unfair decisions. - For example, the COMPAS system used in the U.S. judicial system has been found to be unfair to minorities and disadvantaged groups, particularly violating the fairness standard of "equal opportunity." 2. **Research Objectives**: - Propose a comprehensive framework for auditing the fairness of AI systems. - This framework involves third-party auditors and AI system providers, aiming to evaluate the fairness of AI systems through a transparent approach. - Develop an open-source tool to help third-party auditors, AI developers, or the public assess the fairness standards of AI systems. 3. **Main Contributions**: - Define multiple fairness metrics, such as disparate impact, demographic parity, conditional statistical parity, overall accuracy equality, mean difference, equal opportunity, predictive equality, conditional use accuracy equality, predictive parity, equal calibration, positive balance, and negative balance. - Provide a Python package that supports common data set formats (such as CSV), enabling users to conveniently conduct fairness assessments. - Validate the effectiveness and accuracy of the framework through practical applications, such as analyzing the COMPAS dataset from ProPublica. 4. **Application Example**: - The paper conducts a detailed analysis of the COMPAS dataset using the proposed framework, confirming that African American defendants indeed face unfair scoring in the COMPAS system, consistent with ProPublica's findings. In summary, the paper aims to help various parties better evaluate and ensure the fairness of AI systems by proposing a transparent and systematic auditing framework, thereby reducing bias and unfair phenomena.