Beyond Submodularity: A Unified Framework of Randomized Set Selection with Group Fairness Constraints

Shaojie Tang,Jing Yuan
DOI: https://doi.org/10.48550/arXiv.2304.06596
2023-04-13
Abstract:Machine learning algorithms play an important role in a variety of important decision-making processes, including targeted advertisement displays, home loan approvals, and criminal behavior predictions. Given the far-reaching impact of these algorithms, it is crucial that they operate fairly, free from bias or prejudice towards certain groups in the population. Ensuring impartiality in these algorithms is essential for promoting equality and avoiding discrimination. To this end we introduce a unified framework for randomized subset selection that incorporates group fairness constraints. Our problem involves a global utility function and a set of group utility functions for each group, here a group refers to a group of individuals (e.g., people) sharing the same attributes (e.g., gender). Our aim is to generate a distribution across feasible subsets, specifying the selection probability of each feasible set, to maximize the global utility function while meeting a predetermined quota for each group utility function in expectation. Note that there may not necessarily be any direct connections between the global utility function and each group utility function. We demonstrate that this framework unifies and generalizes many significant applications in machine learning and operations research. Our algorithmic results either improves the best known result or provide the first approximation algorithms for new applications.
Machine Learning,Artificial Intelligence,Computers and Society,Data Structures and Algorithms
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is to ensure the fairness of machine - learning algorithms in the decision - making process, especially to avoid bias or discrimination against certain groups. Specifically, the author proposes a unified random subset selection framework, which combines group fairness constraints to ensure that each group is appropriately represented in the final result. ### Problem Background With the wide application of machine - learning algorithms in important decision - making processes such as targeted advertising display, housing loan approval, and criminal behavior prediction, the issue of whether these algorithms are biased or discriminatory has attracted wide attention. Ensuring the fairness and unbiasedness of these algorithms is crucial for promoting equality and social justice. To this end, the author introduces a new framework, aiming to maximize the global utility under group fairness constraints through random subset selection. ### Specific Problem Description In the paper, a global utility function \( f: 2^V \to \mathbb{R}_{\geq 0} \) and a set of group utility functions \( g_1, g_2, \dots, g_m: 2^V \to \mathbb{R}_{\geq 0} \) are defined, where \( V \) is a set of items and \( m \) is the number of groups. Each group utility function \( g_t(S) \) measures the utility obtained by the \( t \)-th group from the solution set \( S \). The goal is to maximize the expected value of the global utility function on the premise that the expected utility of each group is not lower than the preset threshold \( \alpha_t \). ### Mathematical Representation Given a set of feasible subsets \( F \), we need to find a distribution \( x \in [0,1]^F \) such that: \[ \max_{x \in [0,1]^F} \sum_{S \in F} x_S f(S) \] while satisfying the following constraints: \[ \sum_{S \in F} x_S g_t(S) \geq \alpha_t, \quad \forall t \in [m] \] \[ \sum_{S \in F} x_S \leq 1 \] ### Main Contributions 1. **Algorithm Framework**: The author develops a polynomial - time algorithm framework based on the ellipsoid method to solve the above - mentioned optimization problem. 2. **Approximation Algorithm**: A polynomial - time \( (\rho, \mu) \)-approximation algorithm is proposed, which is applicable to multiple application scenarios, such as fair - aware submodular maximization, sequential submodular maximization with group constraints, and stochastic combinatorial programming under market share constraints. 3. **Theoretical Analysis**: It is proved that when the global utility function \( f \) is a non - negative monotone submodular function and the group utility functions \( g_t \) are modular functions, a feasible and optimal \( (1 - 1/e, 1) \)-approximate solution can be obtained. ### Application Extension The author also explores other commonly used fairness metrics and proposes effective algorithms to solve these problems. For example: - The upper bound \( \beta_t \) of the expected utility of each group is introduced. - The fairness metric in which the pairwise utility difference between different groups does not exceed the parameter \( \gamma \) is explored. Through these extensions, the author shows the wide applicability and flexibility of his framework, which can handle fairness problems in various practical applications.