Abstract:In noisy label learning, estimating noisy class posteriors plays a fundamental role for developing consistent classifiers, as it forms the basis for estimating clean class posteriors and the transition matrix. Existing methods typically learn noisy class posteriors by training a classification model with noisy labels. However, when labels are incorrect, these models may be misled to overemphasize the feature parts that do not reflect the instance characteristics, resulting in significant errors in estimating noisy class posteriors. To address this issue, this paper proposes to augment the supervised information with part-level labels, encouraging the model to focus on and integrate richer information from various parts. Specifically, our method first partitions features into distinct parts by cropping instances, yielding part-level labels associated with these various parts. Subsequently, we introduce a novel single-to-multiple transition matrix to model the relationship between the noisy and part-level labels, which incorporates part-level labels into a classifier-consistent framework. Utilizing this framework with part-level labels, we can learn the noisy class posteriors more precisely by guiding the model to integrate information from various parts, ultimately improving the classification performance. Our method is theoretically sound, while experiments show that it is empirically effective in synthetic and real-world noisy benchmarks.

What problem does this paper attempt to address?

The paper primarily addresses a key issue in Noisy Label Learning (NLL): how to more accurately estimate the noisy class posterior probability. In NLL, accurately estimating the posterior probability of noisy labels is crucial for constructing consistent classifiers. The paper points out that in the presence of noisy labels, traditional learning methods may be misled, overly focusing on parts of the information related to incorrect labels that do not reflect the true characteristics of the samples. This can lead to significant errors in estimating the noisy class posterior probability, thereby affecting subsequent tasks such as consistent classifier construction and transition matrix estimation. To solve the above problem, the authors propose a method called "Part-Level Multi-labeling" (PLM). This method introduces part-level labels to guide the model to focus on different parts of the sample's features, avoiding the model's over-reliance on specific misleading features. Specifically, PLM first crops the samples to obtain different parts and uses a trained network to assign labels to these parts, thereby generating multiple part-level labels. Then, by introducing a single-to-multiple transition matrix to model the relationship between a single noisy label and multiple part-level labels, it combines noisy label and part-level supervision information for joint training within a consistent classifier framework. This method can more accurately estimate the noisy class posterior probability and can serve as a component of loss correction methods to enhance the performance of existing NLL methods. Experimental results show that the PLM method can significantly improve the accuracy of noisy class posterior probability estimation and effectively enhance the classification performance of NLL tasks on various synthetic and real datasets.

Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning