Jing Li,Yinghua Yao,Yuangang Pan,Xuanqian Wang,Ivor W. Tsang,Xiuju Fu
Abstract:Worst-case fairness with off-the-shelf demographics achieves group parity by maximizing the model utility of the worst-off group. Nevertheless, demographic information is often unavailable in practical scenarios, which impedes the use of such a direct max-min formulation. Recent advances have reframed this learning problem by introducing the lower bound of minimal partition ratio, denoted as $\alpha$, as side information, referred to as ``$\alpha$-sized worst-case fairness'' in this paper. We first justify the practical significance of this setting by presenting noteworthy evidence from the data privacy perspective, which has been overlooked by existing research. Without imposing specific requirements on loss functions, we propose reweighting the training samples based on their intrinsic importance to fairness. Given the global nature of the worst-case formulation, we further develop a stochastic learning scheme to simplify the training process without compromising model performance. Additionally, we address the issue of outliers and provide a robust variant to handle potential outliers during model training. Our theoretical analysis and experimental observations reveal the connections between the proposed approaches and existing ``fairness-through-reweighting'' studies, with extensive experimental results on fairness benchmarks demonstrating the superiority of our methods.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the worst - case fairness in the absence of demographic information. Specifically, the article explores how to optimize the group fairness in the worst - case by introducing the minimum partition ratio α as auxiliary information when sensitive attributes (such as race, gender, etc.) are unavailable. The following are the specific problems that the paper attempts to solve:
1. **Challenges of Lack of Demographic Information**:
- In practical scenarios, due to legal or privacy protection reasons, demographic information (such as race, gender, etc.) is usually not included in the training data. This hinders the direct application of the max - min formulation to ensure group fairness.
2. **Worst - Case Fairness of α - Sized**:
- To solve the above problems, the article introduces the minimum partition ratio α to constrain the group size as side information. This setting is called "α - sized worst - case fairness", aiming to indirectly optimize the worst - case fairness in this way.
3. **Intrinsic Reweighting (IRW)**:
- The article proposes a new reweighting method - Intrinsic Reweighting (IRW), which reallocates weights according to the actual contribution of each training sample to the worst - case fairness. Different from existing methods, IRW not only depends on the loss value but also considers gradient information, thus more reliably reflecting the importance of samples.
4. **Handling Outliers**:
- The worst - case formula is vulnerable to outliers. For this reason, the article proposes a robust variant IRWO (Intrinsic Reweighting with Outliers handling), which uses gradient information to identify and remove potential outliers to improve the robustness and performance of the model.
5. **Theoretical Analysis and Experimental Verification**:
- The article verifies the effectiveness of the proposed methods through theoretical analysis and extensive experimental results, and shows its superior performance on multiple fairness benchmark datasets.
### Formula Summary
- The optimization problem of worst - case fairness can be expressed as:
\[
\theta^*=\arg\max_{\theta}\min_{k\in[K]}ACC_k(\theta)
\]
where \(ACC_k(\theta)\) represents the accuracy rate of the classifier \(f_\theta\) on the group \(G_k\).
- The objective function after introducing the minimum partition ratio α:
\[
\hat{J}_{N\alpha}(\theta; \alpha)=\frac{1}{N\alpha}\sum_{i = 1}^{N\alpha}\ell(\theta; z_{(i)})
\]
where \(\ell(\theta; z_{(i)})\) represents the \(i\)-th largest loss value.
- The weight update formula in the Intrinsic Reweighting method:
\[
w_{t,i}=\frac{[v_{t,i}]_+}{\sum_j [v_{t,j}]_++\delta\left(\sum_j [v_{t,j}]_+ \right)}
\]
where \(v_{t,i}=(\nabla_\theta F(\theta_t))^\top\nabla_\theta\ell(\theta_t; z_i)\), \(\delta(a) = 1\) if \(a = 0\), otherwise \(\delta(a)=0\).
Through these methods and formulas, the article successfully solves the problem of how to optimize the worst - case fairness in the absence of demographic information and provides a more robust and effective solution.