Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

Somnath Basu Roy Chowdhury,Nicholas Monath,Ahmad Beirami,Rahul Kidambi,Avinava Dubey,Amr Ahmed,Snigdha Chaturvedi
2024-04-28
Abstract:Fairness, especially group fairness, is an important consideration in the context of machine learning systems. The most commonly adopted group fairness-enhancing techniques are in-processing methods that rely on a mixture of a fairness objective (e.g., demographic parity) and a task-specific objective (e.g., cross-entropy) during the training process. However, when data arrives in an online fashion -- one instance at a time -- optimizing such fairness objectives poses several challenges. In particular, group fairness objectives are defined using expectations of predictions across different demographic groups. In the online setting, where the algorithm has access to a single instance at a time, estimating the group fairness objective requires additional storage and significantly more computation (e.g., forward/backward passes) than the task-specific objective at every time step. In this paper, we propose Aranyani, an ensemble of oblique decision trees, to make fair decisions in online settings. The hierarchical tree structure of Aranyani enables parameter isolation and allows us to efficiently compute the fairness gradients using aggregate statistics of previous decisions, eliminating the need for additional storage and forward/backward passes. We also present an efficient framework to train Aranyani and theoretically analyze several of its properties. We conduct empirical evaluations on 5 publicly available benchmarks (including vision and language datasets) to show that Aranyani achieves a better accuracy-fairness trade-off compared to baseline approaches.
Machine Learning
What problem does this paper attempt to address?
This paper attempts to solve the problem of achieving group fairness in an online environment, especially when data arrives instance - by - instance. Specifically, the authors point out that there are several challenges in optimizing group fairness goals in an online setting: 1. **Definition of group fairness goals**: Group fairness goals (such as demographic parity) are defined based on the expected values of prediction results for different demographic groups. In an online environment, the algorithm can only access one instance at a time, so estimating group fairness goals requires additional storage and significantly more computational resources (such as forward/backward passes). 2. **High computational complexity**: During the online learning process, if one wants to optimize the group fairness loss, it is usually necessary to store all (or at least a part) of the input instances that have been seen and use these instances for forward and backward passes in each learning step, which is very computationally expensive. To solve these problems, the authors propose a new framework - Aranyani, which consists of a set of oblique decision trees. Aranyani takes advantage of the characteristics of the tree structure to efficiently estimate the group fairness gradient by maintaining the aggregated statistics of local node - level decisions, thus eliminating the need for additional storage and forward/backward passes. In addition, the authors also provide an efficient training framework that can be trained using modern automatic differentiation libraries and accelerators, and theoretically analyze several properties of Aranyani. ### Formula Representation - **Demographic Parity (DP)**: \[ \text{DP}=\left|\mathbb{E}[f(x|a = 0)]-\mathbb{E}[f(x|a = 1)]\right| \] where \(a\) is the protected attribute (such as gender or race), and \(f(x)\) is the predicted output of the model. - **Optimization Goal (Offline Setting)**: \[ \min_f L(f(x),y)\quad\text{subject to}\quad\left|\mathbb{E}[f(x|a = 0)]-\mathbb{E}[f(x|a = 1)]\right|<\epsilon \] where \(L(\cdot,\cdot)\) is the task loss function (such as cross - entropy loss), and \(\epsilon\) is the allowed fairness error. - **Node - level Fairness Constraints**: \[ F_{ij}=\mathbb{E}[n_{ij}(x|a = 0)]-\mathbb{E}[n_{ij}(x|a = 1)] \] where \(n_{ij}(x)\) is the output of the \(j\) - th node in the \(i\) - th layer. - **Relaxed Optimization Objective**: \[ \min_f\left\{L(f(x),y)+\lambda\sum_{i,j}H_\delta(F_{ij})\right\} \] where \(H_\delta(F_{ij})\) is the Huber loss function, which is used to smooth the L1 norm. Through these methods, Aranyani can achieve group fairness more efficiently in an online environment while maintaining high task performance.