Abstract:Label noise is ubiquitous in many real-world scenarios which often misleads training algorithm and brings about the degraded classification performance. Therefore, many approaches have been proposed to correct the loss function given corrupted labels to combat such label noise. Among them, a trend of works achieve this goal by unbiasedly estimating the data centroid, which plays an important role in constructing an unbiased risk estimator for minimization. However, they usually handle the noisy labels in different classes all at once, so the local information inherited by each class is ignored which often leads to unsatisfactory performance. To address this defect, this paper presents a novel robust learning algorithm dubbed "Class-Wise Denoising" (CWD), which tackles the noisy labels in a class-wise way to ease the entire noise correction task. Specifically, two virtual auxiliary sets are respectively constructed by presuming that the positive and negative labels in the training set are clean, so the original false-negative labels and false-positive ones are tackled separately. As a result, an improved centroid estimator can be designed which helps to yield more accurate risk estimator. Theoretically, we prove that: 1) the variance in centroid estimation can often be reduced by our CWD when compared with existing methods with unbiased centroid estimator; and 2) the performance of CWD trained on the noisy set will converge to that of the optimal classifier trained on the clean set with a convergence rate O(1/vn) )where n is the number of the training examples. These sound theoretical properties critically enable our CWD to produce the improved classification performance under label noise, which is also demonstrated by the comparisons with ten representative state-of-the-art methods on a variety of benchmark datasets.

Class noise detection by multiple voting

Noise is the Fatal Poison: A Noise-aware Network for Noisy Dataset Classification

Label noise detection under the Noise at Random model with ensemble filters

Rethinking Noisy Label Learning in Real-world Annotation Scenarios from the Noise-type Perspective

Certainty weighted voting-based noise correction for crowdsourcing

Cost-sensitive probability for weighted voting in an ensemble model for multi-class classification problems

A boosting method to detect noisy data

A Parallel Impulse-Noise Detection Algorithm Based On Ensemble Learning For Switching Median Filters

A Unified Framework for Connecting Noise Modeling to Boost Noise Detection

Bagged Voting Ensembles

The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Novel hybrid ensemble credit scoring model with stacking-based noise detection and weight assignment

A Bi-level Formulation for Label Noise Learning with Spectral Cluster Discovery

Class-Wise Denoising for Robust Learning under Label Noise

Ensemble Network-Based Distillation for Hyperspectral Image Classification in the Presence of Label Noise

Decoding class dynamics in learning with noisy labels

An Ensemble Noise-Robust K-fold Cross-Validation Selection Method for Noisy Labels

mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework

Ensemble Classifier Design Based on Perturbation Binary Salp Swarm Algorithm for Classification

Supply chain finance credit risk assessment using support vector machine–based ensemble improved with noise elimination

Improving Speaker Verification with Noise-Aware Label Ensembling and Sample Selection: Learning and Correcting Noisy Speaker Labels