Enhancing Label Sharing Efficiency in Complementary-Label Learning with Label Augmentation

Wei-I Lin,Gang Niu,Hsuan-Tien Lin,Masashi Sugiyama
DOI: https://doi.org/10.48550/arXiv.2305.08344
2023-05-15
Abstract:Complementary-label Learning (CLL) is a form of weakly supervised learning that trains an ordinary classifier using only complementary labels, which are the classes that certain instances do not belong to. While existing CLL studies typically use novel loss functions or training techniques to solve this problem, few studies focus on how complementary labels collectively provide information to train the ordinary classifier. In this paper, we fill the gap by analyzing the implicit sharing of complementary labels on nearby instances during training. Our analysis reveals that the efficiency of implicit label sharing is closely related to the performance of existing CLL models. Based on this analysis, we propose a novel technique that enhances the sharing efficiency via complementary-label augmentation, which explicitly propagates additional complementary labels to each instance. We carefully design the augmentation process to enrich the data with new and accurate complementary labels, which provide CLL models with fresh and valuable information to enhance the sharing efficiency. We then verify our proposed technique by conducting thorough experiments on both synthetic and real-world datasets. Our results confirm that complementary-label augmentation can systematically improve empirical performance over state-of-the-art CLL models.
Machine Learning
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve a key problem in **Complementary - Label Learning (CLL)**: how to improve the sharing efficiency of complementary labels during the training process. Specifically, CLL is a weakly - supervised learning method that uses only complementary labels (i.e., the classes to which an instance does not belong) to train a classifier. However, most of the existing CLL research focuses on designing new loss functions or training techniques, and less attention is paid to how complementary labels collectively provide information to train a general classifier. The main contributions of the paper include: 1. **Analysis of the impact of implicit label sharing**: Through analysis, the authors find that the success of the CLL model depends on the implicit label sharing brought by the smoothness of the neural network. The efficiency of this implicit sharing is closely related to the model performance. 2. **Proposing an explicit label augmentation method**: To further improve the label - sharing efficiency, the authors propose a new technique - **Complementary - Label Augmentation**, which enriches the data and provides new valuable information by explicitly spreading additional complementary labels to each instance. 3. **Verifying the effectiveness of the method**: Through experiments on synthetic datasets and real - world datasets, the authors verify that the proposed complementary - label augmentation method can systematically improve the performance of the existing state - of - the - art CLL models. ### Specific problem description In CLL, since only complementary labels are used for training, the model needs to infer the correct class from these incomplete information. However, the existing methods have deficiencies in label sharing, resulting in limited model performance. For this reason, the paper proposes an explicit label augmentation method to improve the efficiency of label sharing, thereby enhancing the classification accuracy of the model. ### Summary of mathematical formulas 1. **Definition of implicit label - sharing efficiency**: \[ \text{Implicit label - sharing efficiency} = 1-\frac{1}{N}\sum_{i = 1}^{N}\frac{1}{K - 2}\sum_{\bar{y}'\notin\{y_i,\bar{y}_i\}}f_{\bar{y}'}(x_i) \] 2. **The augmented loss function**: \[ R'(g;\ell)=\alpha\cdot\frac{1}{N}\sum_{i = 1}^{N}\ell(\bar{y}_i,g(x_i))+(1 - \alpha)\cdot\frac{1}{N}\sum_{i = 1}^{N}\sum_{k = 1}^{N_K}w_{i,k}\ell(\bar{y}_{i,k},g(x_i)) \] 3. **Matrix form of multi - step label augmentation**: \[ Z=\alpha\bar{Y}+(1 - \alpha)WZ \] where \(\bar{Y}\) is the original complementary - label matrix, \(W\) is the affinity matrix, and \(Z\) is the augmented soft - label matrix. Through these formulas and methods, the paper shows how to improve the performance of the CLL model through explicit label augmentation and provides new ideas for future research.