Abstract:Label noise is now a common problem in many applications, which may lead to significant learning performance degeneration. To deal with the label noise, Active Label Correction (ALC) was proposed to query the true labels for a small subset of instances. As the true labels costs can be high, the focus of ALC is to maximally improve the learning performance with minimal query costs. Existing ALC methods mainly proceed by querying the most likely mislabeled instances, or using criteria derived from standard active learning. In this paper, we focus on deep neural network models and show that due to their intrinsic memorization effect, the true labels of a large proportion of mislabeled instances can be correctly predicted with early stopped training, even under severe noise. Inspired by this, we propose to train deep label noise learning models robustly with dual ALC (DALC): on one hand, we select the most useful instances for classifier improvement and query their true labels from external experts; on the other hand, due to the active data sampling bias, the label noise model estimation can be highly biased, which may in turn hurt the classifier learning. To alleviate this issue, we propose to identify the instances that are most likely predicted with true labels by the classifier, and take the predictions as their true labels. By integrating the two sources of true labels, we experiment on multiple benchmark datasets with various label noise rate and show the effectiveness of the proposed DALC on both the classification accuracy and the label noise model estimation. The code is available at https://github.com/lilylisy/mlj21DALC.

Confidence Adaptive Regularization for Deep Learning with Noisy Labels

FGCM: Noisy Label Learning via Fine-Grained Confidence Modeling

Early-Learning Regularization Prevents Memorization of Noisy Labels

Searching to Exploit Memorization Effect in Deep Learning with Noisy Labels.

Learning with Noisy Labels Via Self-supervised Adversarial Noisy Masking

On Better Detecting and Leveraging Noisy Samples for Learning with Severe Label Noise

Unleashing the Potential of Regularization Strategies in Learning with Noisy Labels

Class-Independent Regularization for Learning with Noisy Labels.

Feature-Induced Label Distribution for Learning with Noisy Labels

Learning with Neighbor Consistency for Noisy Labels

A Simple yet Effective Baseline for Robust Deep Learning with Noisy Labels

Mitigating Memorization in Sample Selection for Learning with Noisy Labels

MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels

Improving deep label noise learning with dual active label correction

Robust early-learning: Hindering the memorization of noisy labels

Learning with Noisy Labels Via Sparse Regularization

Two Wrongs Don't Make a Right: Combating Confirmation Bias in Learning with Label Noise.

Learning Confident Classifiers in the Presence of Label Noise

An Ensemble Model for Combating Label Noise

Safeguarded Dynamic Label Regression for Noisy Supervision

Neighborhood Collective Estimation for Noisy Label Identification and Correction