Abstract:Although machine learning algorithms demonstrate impressive performance, their trustworthiness remains a critical issue, particularly concerning fairness when implemented in real-world applications. Many notions of group fairness aim to minimize disparities in performance across protected groups. However, it can inadvertently reduce performance in certain groups, leading to sub-optimal outcomes. In contrast, Min-max group fairness notion prioritizes the improvement for the worst-performing group, thereby advocating a utility-promoting approach to fairness. However, it has been proven that existing efforts to achieve Min-max fairness exhibit limited effectiveness. In response to this challenge, we leverage the recently proposed "Neural Collapse'' framework to re-examine Empirical Risk Minimization (ERM) training, specifically investigating the root causes of poor performance in minority groups. The layer-peeled model is employed to decompose a network into two parts: an encoder to learn latent representation, and a subsequent classifier, with a systematic characterization of their training behaviors being conducted. Our analysis reveals that while classifiers achieve maximum separation, the separability of representations is insufficient, particularly for minority groups. This indicates the sub-optimal performance in minority groups stems from less separable representations, rather than classifiers. To tackle this issue, we introduce a novel strategy that incorporates a frozen classifier to directly enhance representation. Furthermore, we introduce two easily implemented loss functions to guide the learning process. The experimental assessments carried out on real-world benchmark datasets spanning the domains of Computer Vision, Natural Language Processing, and Tabular data demonstrate that our approach outperforms existing state-of-the-art methods in promoting the Min-max fairness notion.

Exploring deep neural networks via layer-peeled model: Minority collapse in imbalanced training

Layer-Peeled Model: Toward Understanding Well-Trained Deep Neural Networks

Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data

The Exploration of Neural Collapse under Imbalanced Data

Towards Deeper Insights into Deep Learning from Imbalanced Data.

Rethinking the Usage of Batch Normalization and Dropout in the Training of Deep Neural Networks

Neural Collapse Inspired Debiased Representation Learning for Min-max Fairness

Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?

Neural Collapse versus Low-rank Bias: Is Deep Neural Collapse Really Optimal?

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Feature Model

Wide Neural Networks Trained with Weight Decay Provably Exhibit Neural Collapse

Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model

Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path

Towards Understanding Neural Collapse: The Effects of Batch Normalization and Weight Decay

The Persistence of Neural Collapse Despite Low-Rank Bias: An Analytic Perspective Through Unconstrained Features

Beyond Unconstrained Features: Neural Collapse for Shallow Neural Networks with General Data

Imbalanced Deep Learning by Minority Class Incremental Rectification

A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks

Neural Collapse in the Intermediate Hidden Layers of Classification Neural Networks

Limitations of Neural Collapse for Understanding Generalization in Deep Learning

Prevalence of Neural Collapse during the terminal phase of deep learning training