Abstract:The performance of neural networks increases steadily, but our understanding of their decision-making lags behind. Concept Bottleneck Models (CBMs) address this issue by incorporating human-understandable concepts into the prediction process, thereby enhancing transparency and interpretability. Since existing approaches often rely on large language models (LLMs) to infer concepts, their results may contain inaccurate or incomplete mappings, especially in complex visual domains. We introduce visually Grounded Concept Bottleneck Models (GCBM), which derive concepts on the image level using segmentation and detection foundation models. Our method generates inherently interpretable concepts, which can be grounded in the input image using attribution methods, allowing interpretations to be traced back to the image plane. We show that GCBM concepts are meaningful interpretability vehicles, which aid our understanding of model embedding spaces. GCBMs allow users to control the granularity, number, and naming of concepts, providing flexibility and are easily adaptable to new datasets without pre-training or additional data needed. Prediction accuracy is within 0.3-6% of the linear probe and GCBMs perform especially well for fine-grained classification interpretability on CUB, due to their dataset specificity. Our code is available on <a class="link-external link-https" href="https://github.com/KathPra/GCBM" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to solve the problems of transparency and interpretability in the decision - making process of neural networks. Although the performance of deep neural networks (DNNs) in various tasks has been continuously improved, the opacity of their decision - making processes poses challenges to the reliability of the models. Especially in safety - critical systems, the lack of interpretability will limit their applications. #### Specific problem description 1. **Improve model transparency and interpretability**: - The "black - box" nature of neural networks makes it difficult to understand their decision - making processes. Existing concept bottleneck models (CBMs) enhance the interpretability of models by introducing human - understandable concepts. 2. **Limitations of existing methods**: - Many existing CBM methods rely on large - language models (LLMs) to generate concepts, which may lead to inaccurate or incomplete mappings, especially in the complex visual field. LLMs may introduce biases, affecting the quality of concepts and the interpretive ability of the models. 3. **Propose new solutions**: - The authors introduce the **Graph - based Concept Bottleneck Model (GCBM) with Visual Alignment**, which generates intrinsically interpretable concepts by extracting concepts at the image level using segmentation and detection base models. These concepts can be traced back to the input image, providing end - to - end interpretability. 4. **Flexibility and adaptability**: - GCBM allows users to control the granularity, quantity, and naming of concepts, and can easily adapt to new datasets without pre - training or additional data. This makes GCBM more flexible and adaptable when dealing with domain - specific data. 5. **Evaluation and verification**: - The authors conduct extensive evaluations through multiple datasets and concept generation methods, verifying the performance of GCBM in different tasks, especially its advantages in fine - grained classification and cross - domain generalization. ### Summary The core problem of this paper is to solve the transparency and interpretability problems in the decision - making process of neural networks by introducing the Graph - based Concept Bottleneck Model (GCBM) with Visual Alignment, especially in the complex visual field, avoiding the biases brought by relying on large - language models, and providing more reliable and intuitive explanations.

Aligning Visual and Semantic Interpretability through Visually Grounded Concept Bottleneck Models

VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning

Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification

Coarse-to-Fine Concept Bottleneck Models

Label-Free Concept Bottleneck Models

AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis

Stochastic Concept Bottleneck Models

Can we Constrain Concept Bottleneck Models to Learn Semantically Meaningful Input Features?

Post-hoc Concept Bottleneck Models

Improving Concept Alignment in Vision-Language Concept Bottleneck Models

Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable?

Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts

Concept Bottleneck Models Without Predefined Concepts

Incremental Residual Concept Bottleneck Models

Interpretable-by-Design Text Understanding with Iteratively Generated Concept Bottleneck

Concept Bottleneck with Visual Concept Filtering for Explainable Medical Image Classification

Concept Bottleneck Model with Additional Unsupervised Concepts

Concept Bottleneck Large Language Models

I saw, I conceived, I concluded: Progressive Concepts as Bottlenecks

Conceptual Learning via Embedding Approximations for Reinforcing Interpretability and Transparency