Abstract:Clustering-based methods, which alternate between the generation of pseudo labels and the optimization of the feature extraction network, play a dominant role in both unsupervised learning (USL) and unsupervised domain adaptive (UDA) person re-identification (Re-ID). To alleviate the adverse effect of noisy pseudo labels, the existing methods either abandon unreliable labels or refine the pseudo labels via mutual learning or label propagation. However, a great many erroneous labels are still accumulated because these methods mostly adopt traditional unsupervised clustering algorithms which rely on certain assumptions on data distribution and fail to capture the distribution of complex real-world data. In this paper, we propose the plug-and-play graph-based pseudo label correction network (GLC) to refine the pseudo labels in the manner of supervised clustering. GLC is trained to perceive the varying data distribution at each epoch of the self-training with the supervision of initial pseudo labels generated by any clustering method. It can learn to rectify the initial noisy labels by means of the relationship constraints between samples on the k Nearest Neighbor (kNN) graph and early-stop training strategy. Specifically, GLC learns to aggregate node features from neighbors and predict whether the nodes should be linked on the graph. Besides, GLC is optimized with 'early stop' before the noisy labels are severely memorized to prevent overfitting to noisy pseudo labels. Consequently, GLC improves the quality of pseudo labels though the supervision signals contain some noise, leading to better Re-ID performance. Extensive experiments in USL and UDA person Re-ID on Market-1501 and MSMT17 show that our method is widely compatible with various clustering-based methods and promotes the state-of-the-art performance consistently.

Identifying and Correcting Mislabeled Training Instances

Identifying and Correcting Mislabled Training Instances Using Bayes

COMIRE: A Consistence-Based Mislabeled Instances Removal Method

OT Cleaner: Label Correction As Optimal Transport

Interactive Correction of Mislabeled Training Data

Learning Discrimination from Contaminated Data: Multi-Instance Learning for Unsupervised Anomaly Detection

FGCM: Noisy Label Learning via Fine-Grained Confidence Modeling

Error-Bounded Correction of Noisy Labels

Detecting Label Errors in Token Classification Data

Improved Naive Bayes with Mislabeled Data.

Identifying Mislabeled Data using the Area Under the Margin Ranking

Confident Learning: Estimating Uncertainty in Dataset Labels

A Mutually Attentive Co-Training Framework for Semi-Supervised Recognition

Classification with Label Noise: a Markov Chain Sampling Framework.

Plug-and-Play Pseudo Label Correction Network for Unsupervised Person Re-identification

Two Wrongs Don't Make a Right: Combating Confirmation Bias in Learning with Label Noise.

Synergistic Network Learning and Label Correction for Noise-robust Image Classification

Learning to Detect Noisy Labels Using Model-Based Features

An joint end-to-end framework for learning with noisy labels

Mislabeled examples detection viewed as probing machine learning models: concepts, survey and extensive benchmark

An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets