Abstract:Concept Bottleneck Models (CBMs) have garnered increasing attention due to their ability to provide concept-based explanations for black-box deep learning models while achieving high final prediction accuracy using human-like concepts. However, the training of current CBMs heavily relies on the accuracy and richness of annotated concepts in the dataset. These concept labels are typically provided by experts, which can be costly and require significant resources and effort. Additionally, concept saliency maps frequently misalign with input saliency maps, causing concept predictions to correspond to irrelevant input features - an issue related to annotation alignment. To address these limitations, we propose a new framework called SSCBM (Semi-supervised Concept Bottleneck Model). Our SSCBM is suitable for practical situations where annotated data is scarce. By leveraging joint training on both labeled and unlabeled data and aligning the unlabeled data at the concept level, we effectively solve these issues. We proposed a strategy to generate pseudo labels and an alignment loss. Experiments demonstrate that our SSCBM is both effective and efficient. With only 20% labeled data, we achieved 93.19% (96.39% in a fully supervised setting) concept accuracy and 75.51% (79.82% in a fully supervised setting) prediction accuracy.

What problem does this paper attempt to address?

The main problem that this paper attempts to address is the high dependency of existing Concept Bottleneck Models (CBMs) on annotated concept data during training, which leads to several key issues: 1. **High annotation cost**: High-quality concept labels usually require expert input, which is not only time-consuming but also expensive. 2. **Scarcity of annotated data**: In practical applications, obtaining a large amount of annotated data is often unrealistic, especially in critical fields such as healthcare and finance. 3. **Alignment issues between concepts and input features**: In existing CBMs methods, concept saliency maps often do not align with input saliency maps, leading to concept predictions corresponding to irrelevant input features. To address these issues, the paper proposes a new framework—Semi-Supervised Concept Bottleneck Model (SSCBM). SSCBM effectively addresses the aforementioned problems through the following approaches: - **Joint training with a small amount of annotated data and a large amount of unannotated data**: In this way, SSCBM can maintain high performance even in the case of scarce annotated data. - **Generating pseudo-labels**: Using the K-Nearest Neighbors (KNN) algorithm to generate pseudo-labels for unannotated data, thereby making full use of these data during training. - **Alignment loss**: Introducing alignment loss to ensure the alignment between concept embeddings and input images, thereby improving the interpretability and prediction accuracy of the model. Experimental results show that SSCBM can achieve a concept accuracy of 93.19% and a prediction accuracy of 75.51% using only 20% of the annotated data, which is close to the performance under fully supervised settings. This indicates that SSCBM has high practical value in real-world applications.

Semi-supervised Concept Bottleneck Models

Eliminating Information Leakage in Hard Concept Bottleneck Models with Supervised, Hierarchical Concept Learning

Incremental Residual Concept Bottleneck Models

On the Concept Trustworthiness in Concept Bottleneck Models

Concept Bottleneck Model with Additional Unsupervised Concepts

Can we Constrain Concept Bottleneck Models to Learn Semantically Meaningful Input Features?

Stochastic Concept Bottleneck Models

Post-hoc Concept Bottleneck Models

Decoupling Concept Bottleneck Model

Concept Bottleneck Models Without Predefined Concepts

Improving Concept Alignment in Vision-Language Concept Bottleneck Models

Label-Free Concept Bottleneck Models

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning

Editable Concept Bottleneck Models

Aligning Visual and Semantic Interpretability through Visually Grounded Concept Bottleneck Models

The Decoupling Concept Bottleneck Model

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

Bayesian Concept Bottleneck Models with LLM Priors

VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance

Coarse-to-Fine Concept Bottleneck Models

Counterfactual Concept Bottleneck Models