Abstract:Existing methods, such as concept bottleneck models (CBMs), have been successful in providing concept-based interpretations for black-box deep learning models. They typically work by predicting concepts given the input and then predicting the final class label given the predicted concepts. However, (1) they often fail to capture the high-order, nonlinear interaction between concepts, e.g., correcting a predicted concept (e.g., "yellow breast") does not help correct highly correlated concepts (e.g., "yellow belly"), leading to suboptimal final accuracy; (2) they cannot naturally quantify the complex conditional dependencies between different concepts and class labels (e.g., for an image with the class label "Kentucky Warbler" and a concept "black bill", what is the probability that the model correctly predicts another concept "black crown"), therefore failing to provide deeper insight into how a black-box model works. In response to these limitations, we propose Energy-based Concept Bottleneck Models (ECBMs). Our ECBMs use a set of neural networks to define the joint energy of candidate (input, concept, class) tuples. With such a unified interface, prediction, concept correction, and conditional dependency quantification are then represented as conditional probabilities, which are generated by composing different energy functions. Our ECBMs address both limitations of existing CBMs, providing higher accuracy and richer concept interpretations. Empirical results show that our approach outperforms the state-of-the-art on real-world datasets.

What problem does this paper attempt to address?

The paper attempts to address several key issues in existing Concept Bottleneck Models (CBMs): 1. **Interpretability**: Existing CBMs fail to effectively quantify the complex relationships between different concepts and category labels. For example, in an image labeled as "Kentucky Warbler," what is the probability that the model correctly identifies a "black beak"? Therefore, they fall short in explaining the workings of black-box models. 2. **Intervention Capability**: Existing CBMs often struggle to handle complex interactions between concepts. Thus, correcting a misidentified concept (such as "yellow chest") does not necessarily improve the accuracy of other highly related concepts (such as "yellow abdomen"), resulting in lower accuracy for both individual concepts and the final category label. 3. **Performance**: Current CBM variants exhibit a trade-off between model performance and interpretability. However, an ideal interpretable model should leverage the synergy between performance and interpretability to achieve the best combination of both. To address these limitations, the authors propose Energy-based Concept Bottleneck Models (ECBMs). ECBMs use a set of neural networks to define the joint energy of input \( x \), concept \( c \), and category label \( y \). Through this unified interface, tasks such as prediction, concept correction, and conditional dependency quantification can naturally be represented as conditional probabilities, which can be generated by combining different energy functions. ECBMs address two major limitations of existing CBMs, providing higher accuracy and richer concept explanations. Specifically, the contributions of ECBMs include: - Going beyond typical concept prediction by identifying concept correction and conditional explanation as valuable tools for providing concept-based explanations. - Proposing ECBMs, the first general method to unify concept prediction, concept correction, and conditional explanation as conditional probabilities. - Utilizing the unified interface of ECBMs to derive a set of algorithms that compute different conditional probabilities by combining different energy functions. - Experimental results show that ECBMs significantly outperform existing methods on real-world datasets. The code is available on [GitHub](https://github.com/xmed-lab/ECBM).

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

EQ-CBM: A Probabilistic Concept Bottleneck with Energy-based Models and Quantized Vectors

Incremental Residual Concept Bottleneck Models

Stochastic Concept Bottleneck Models

Editable Concept Bottleneck Models

Post-hoc Concept Bottleneck Models

Probabilistic Concept Bottleneck Models

Semi-supervised Concept Bottleneck Models

Decoupling Concept Bottleneck Model

Counterfactual Concept Bottleneck Models

The Decoupling Concept Bottleneck Model

Concept Bottleneck Models Without Predefined Concepts

Can we Constrain Concept Bottleneck Models to Learn Semantically Meaningful Input Features?

On the Concept Trustworthiness in Concept Bottleneck Models

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning

Interactive Concept Bottleneck Models

Bayesian Concept Bottleneck Models with LLM Priors

Do Concept Bottleneck Models Respect Localities?

Coarse-to-Fine Concept Bottleneck Models

Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable?

Concept Embedding Models: Beyond the Accuracy-Explainability Trade-Off