Abstract:With the rise of artificial intelligence in recent years,along with the improvement of hardware computing power,deep learning has emerged as the new paradigm for artificial intelligence algorithms.In realistic multi-class classification scenarios,deep learning relies heavily on the availability of massive manually labeled data;the limitations of labeling costs and privacy protections,however,often make it difficult to obtain adequate amounts of appropriately labeled data for deep learning.Recently,crowdsourcing and web crawling have provided an easy way to collect large amounts of labeled data,but they are limited by the inevitable introduction of label noise.As deep neural networks have a high capacity to fit noisy labels,it is challenging to train deep networks robustly with noisy labels.For robust learning,existing works commonly rely explicitly or implicitly on a given set of anchor points,i.e.,instances that almost certainly belong to the true classes.Unfortunately,anchor points are difficult to obtain in practice,which makes these works fragile.To address this problem,in this paper,we build an anchor-free statistically consistent algorithm in the presence of label noise by creatively transforming the multi-class label-noise learning problem into a mixture proportion estimation(MPE)problem.This paper makes the following contributions:(i)we for the first time generalize the existing Regrouping-MPE(R-MPE)method that is only suitable for two-component scenarios,and propose a multi-component oriented R-MPE(MR-MPE)method without relying on the common irreducible assumption;and(ii)from a theoretical perspective,we demonstrate that the anchor point hypothesis for label-noise learning is equivalent to the irreducible hypothesis for MPE problems in the context of multi-class classification.Therefore,an anchor-free statistically consistent label-noise learning algorithm is subsequently constructed based on the proposed MR-MPE method.In this paper,comparative experiments with existing algorithms are conducted on both synthetic noisy datasets and real-world noisy datasets.The results demonstrate that the proposed algorithm performs most effectively on multiple datasets.Additionally,the robustness of the proposed algorithm is verified when anchor points are removed.

Label-noise Learning Via Mixture Proportion Estimation

Multi-Label Noise Transition Matrix Estimation with Label Correlations: Theory and Algorithm

Estimating Per-Class Statistics for Label Noise Learning

A Holistic View of Label Noise Transition Matrix in Deep Learning and Beyond

Multi-class Label Noise Learning via Loss Decomposition and Centroid Estimation

Classification with Label Noise: a Markov Chain Sampling Framework.

Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning

Potential Energy based Mixture Model for Noisy Label Learning

Learning from Noisy Labels with Decoupled Meta Label Purifier

Instance-dependent Label Distribution Estimation for Learning with Label Noise

Instance-specific Label Distribution Regularization for Learning with Label Noise

Regroup Median Loss for Combating Label Noise

Part-dependent Label Noise: Towards Instance-dependent Label Noise

Proxy-based Robust Deep Metric Learning in the Presence of Label Noise

An Active Learning Approach for Multi-Label Image Classification with Sample Noise

Robust Long-Tailed Learning under Label Noise

An Efficient and Provable Approach for Mixture Proportion Estimation Using Linear Independence Assumption.

A Parametrical Model for Instance-Dependent Label Noise

An joint end-to-end framework for learning with noisy labels

Learning from Label Proportions by Learning with Label Noise

Positive Label Is All You Need for Multi-Label Classification