Semi-supervised Concept Bottleneck Models

Lijie Hu,Tianhao Huang,Huanyi Xie,Chenyang Ren,Zhengyu Hu,Lu Yu,Di Wang
2024-06-27
Abstract:Concept Bottleneck Models (CBMs) have garnered increasing attention due to their ability to provide concept-based explanations for black-box deep learning models while achieving high final prediction accuracy using human-like concepts. However, the training of current CBMs heavily relies on the accuracy and richness of annotated concepts in the dataset. These concept labels are typically provided by experts, which can be costly and require significant resources and effort. Additionally, concept saliency maps frequently misalign with input saliency maps, causing concept predictions to correspond to irrelevant input features - an issue related to annotation alignment. To address these limitations, we propose a new framework called SSCBM (Semi-supervised Concept Bottleneck Model). Our SSCBM is suitable for practical situations where annotated data is scarce. By leveraging joint training on both labeled and unlabeled data and aligning the unlabeled data at the concept level, we effectively solve these issues. We proposed a strategy to generate pseudo labels and an alignment loss. Experiments demonstrate that our SSCBM is both effective and efficient. With only 20% labeled data, we achieved 93.19% (96.39% in a fully supervised setting) concept accuracy and 75.51% (79.82% in a fully supervised setting) prediction accuracy.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The main problem that this paper attempts to address is the high dependency of existing Concept Bottleneck Models (CBMs) on annotated concept data during training, which leads to several key issues: 1. **High annotation cost**: High-quality concept labels usually require expert input, which is not only time-consuming but also expensive. 2. **Scarcity of annotated data**: In practical applications, obtaining a large amount of annotated data is often unrealistic, especially in critical fields such as healthcare and finance. 3. **Alignment issues between concepts and input features**: In existing CBMs methods, concept saliency maps often do not align with input saliency maps, leading to concept predictions corresponding to irrelevant input features. To address these issues, the paper proposes a new framework—Semi-Supervised Concept Bottleneck Model (SSCBM). SSCBM effectively addresses the aforementioned problems through the following approaches: - **Joint training with a small amount of annotated data and a large amount of unannotated data**: In this way, SSCBM can maintain high performance even in the case of scarce annotated data. - **Generating pseudo-labels**: Using the K-Nearest Neighbors (KNN) algorithm to generate pseudo-labels for unannotated data, thereby making full use of these data during training. - **Alignment loss**: Introducing alignment loss to ensure the alignment between concept embeddings and input images, thereby improving the interpretability and prediction accuracy of the model. Experimental results show that SSCBM can achieve a concept accuracy of 93.19% and a prediction accuracy of 75.51% using only 20% of the annotated data, which is close to the performance under fully supervised settings. This indicates that SSCBM has high practical value in real-world applications.