Abstract:Problem definition: Data analytics models and machine learning algorithms are increasingly deployed to support consequential decision-making processes, from deciding which applicants will receive job offers and loans to university enrollments and medical interventions. However, recent studies show these models may unintentionally amplify human bias and yield significant unfavorable decisions to specific groups. Methodology/results: We propose a distributionally robust classification model with a fairness constraint that encourages the classifier to be fair in the equality of opportunity criterion. We use a type-[Formula: see text] Wasserstein ambiguity set centered at the empirical distribution to represent distributional uncertainty and derive a conservative reformulation for the worst-case equal opportunity unfairness measure. We show that the model is equivalent to a mixed binary conic optimization problem, which standard off-the-shelf solvers can solve. We propose a convex, hinge-loss-based model for large problem instances whose reformulation does not incur binary variables to improve scalability. Moreover, we also consider the distributionally robust learning problem with a generic ground transportation cost to hedge against the label and sensitive attribute uncertainties. We numerically examine the performance of our proposed models on five real-world data sets related to individual analysis. Compared with the state-of-the-art methods, our proposed approaches significantly improve fairness with negligible loss of predictive accuracy in the testing data set. Managerial implications: Our paper raises awareness that bias may arise when predictive models are used in service and operations. It generally comes from human bias, for example, imbalanced data collection or low sample sizes, and is further amplified by algorithms. Incorporating fairness constraints and the distributionally robust optimization (DRO) scheme is a powerful way to alleviate algorithmic biases. Funding: This work was supported by the National Science Foundation [Grants 2342505 and 2343869] and the Chinese University of Hong Kong [Grant 4055191]. Supplemental Material: The online appendices are available at https://doi.org/10.1287/msom.2022.0230 .

FairDR: Ensuring Fairness in Mixed Data of Fairly and Unfairly Treated Instances.

FairRec: Fairness Testing for Deep Recommender Systems

Fairness Through Equality of Effort

Is it Still Fair? A Comparative Evaluation of Fairness Algorithms through the Lens of Covariate Drift

Explaining Algorithmic Fairness Through Fairness-Aware Causal Path Decomposition

FairIF: Boosting Fairness in Deep Learning via Influence Functions with Validation Set Sensitive Attributes

AIM: Attributing, Interpreting, Mitigating Data Unfairness

AdapFair: Ensuring Continuous Fairness for Machine Learning Operations

FairFML: Fair Federated Machine Learning with a Case Study on Reducing Gender Disparities in Cardiac Arrest Outcome Prediction

Unified Group Fairness on Federated Learning

FaiR-N: Fair and Robust Neural Networks for Structured Data

Fairness-enhancing mixed effects deep learning improves fairness on in- and out-of-distribution clustered (non-iid) data

FairFix: Enhancing Fairness of Pre-Trained Deep Neural Networks with Scarce Data Resources

Fairness Without Harm: An Influence-Guided Active Sampling Approach

Fairness with Adaptive Weights.

Wasserstein Robust Classification with Fairness Constraints

Bridging Fairness Gaps: A (Conditional) Distance Covariance Perspective in Fairness Learning

Chasing Fairness Under Distribution Shift: A Model Weight Perturbation Approach

Does Machine Bring in Extra Bias in Learning? Approximating Fairness in Models Promptly

Data vs. Model Machine Learning Fairness Testing: An Empirical Study

FADE: Towards Fairness-aware Augmentation for Domain Generalization via Classifier-Guided Score-based Diffusion Models