Adaptive Negative Evidential Deep Learning for Open-set Semi-supervised Learning

Yang Yu,Danruo Deng,Furui Liu,Yueming Jin,Qi Dou,Guangyong Chen,Pheng-Ann Heng
2024-04-14
Abstract:Semi-supervised learning (SSL) methods assume that labeled data, unlabeled data and test data are from the same distribution. Open-set semi-supervised learning (Open-set SSL) considers a more practical scenario, where unlabeled data and test data contain new categories (outliers) not observed in labeled data (inliers). Most previous works focused on outlier detection via binary classifiers, which suffer from insufficient scalability and inability to distinguish different types of uncertainty. In this paper, we propose a novel framework, Adaptive Negative Evidential Deep Learning (ANEDL) to tackle these limitations. Concretely, we first introduce evidential deep learning (EDL) as an outlier detector to quantify different types of uncertainty, and design different uncertainty metrics for self-training and inference. Furthermore, we propose a novel adaptive negative optimization strategy, making EDL more tailored to the unlabeled dataset containing both inliers and outliers. As demonstrated empirically, our proposed method outperforms existing state-of-the-art methods across four datasets.
Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the challenges in open - set semi - supervised learning (Open - set SSL), especially when unlabeled data and test data contain new classes (i.e., outliers or OOD samples) that have not been seen in the labeled data. Specifically, the paper focuses on the following two main problems: 1. **Limitations of existing methods**: - **Insufficient uncertainty quantification**: Most existing methods use binary classifiers for anomaly detection. These classifiers can only provide point - estimated probabilities and cannot distinguish different types of uncertainty (such as epistemic uncertainty and aleatoric uncertainty). - **Poor robustness to complex datasets**: When dealing with multi - class information, softmax - based binary classifiers perform poorly in identifying outliers. 2. **How to effectively utilize the information of unlabeled data**: - Existing methods fail to fully utilize the information contained in unlabeled data to improve the model. Especially when distinguishing outliers from normal values, they fail to effectively use this information for model regularization. ### The method proposed in the paper To solve the above problems, the paper proposes a new framework named Adaptive Negative Evidential Deep Learning (ANEDL). The main contributions of ANEDL include: - **Introducing Evidential Deep Learning (EDL) as an anomaly detector**: By regarding the network output as evidence to parameterize the Dirichlet distribution, different types of uncertainty are quantified, and different uncertainty measures are designed for self - training and inference. - **Proposing an adaptive negative optimization strategy**: For unlabeled datasets containing normal and abnormal values, explicitly adjust the EDL detector to output low evidence values to reduce the influence of uncertain categories, and encourage the model to pay more attention to these uncertain categories through adaptive loss weights. - **Jointly optimizing Softmax and EDL**: Combine the Softmax head for representation learning and classification probability prediction, and at the same time use the EDL head to quantify uncertainty, thereby improving classification performance. ### Experimental results The paper conducted extensive experiments on four public datasets (CIFAR - 10, CIFAR - 100, ImageNet - 30, and Mini - ImageNet), and the results show that ANEDL outperforms the existing state - of - the - art methods in different settings. ### Summary This paper solves the challenges of anomaly detection and classification in open - set semi - supervised learning by introducing Evidential Deep Learning and an adaptive negative optimization strategy, significantly improving the performance and robustness of the model.