Abstract:Concept Bottleneck Model (CBM) is a kind of powerful interpretable neural network, which utilizes high-level concepts to explain model decisions and interact with humans. However, CBM cannot always work as expected due to the troublesome collection and commonplace insufficiency of high-level concepts in real-world scenarios. In this paper, we theoretically reveal that insufficient concept information will induce the mixture of explicit and implicit information, which further leads to the inherent dilemma of concept and label distortions in CBM. Motivated by the proposed theorem, we present Decoupling Concept Bottleneck Model (DCBM), a novel concept-based model decoupling heterogeneous information into explicit and implicit concepts, while still retaining high prediction performance and interpretability. Extensive experiments expose the success in the alleviation of concept/label distortions, where DCBM achieves state-of-the-art performances in both concept and label learning tasks. Especially for situations where concepts are insufficient, DCBM significantly outperforms other models based on concept bottleneck and respectively achieves error rates 24.95% and 20.09% lower than other CBMs on concept/label prediction. Moreover, to express effective human-machine interactions for DCBM, we devise two algorithms based on mutual information (MI) estimation, including forward intervention and backward rectification, which can automatically correct labels and trace back to wrong concepts. The construction of the interaction regime can be formulated as a light min-max optimization problem achieved within minutes. Multiple experiments show that such interactions can effectively promote concept/label accuracy.

Decoupling Concept Bottleneck Model

The Decoupling Concept Bottleneck Model

Post-hoc Concept Bottleneck Models

Incremental Residual Concept Bottleneck Models

Stochastic Concept Bottleneck Models

Semi-supervised Concept Bottleneck Models

Label-Free Concept Bottleneck Models

On the Concept Trustworthiness in Concept Bottleneck Models

Eliminating Information Leakage in Hard Concept Bottleneck Models with Supervised, Hierarchical Concept Learning

Counterfactual Concept Bottleneck Models

Coarse-to-Fine Concept Bottleneck Models

Probabilistic Concept Bottleneck Models

Can we Constrain Concept Bottleneck Models to Learn Semantically Meaningful Input Features?

AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis

Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations

Concept Bottleneck Models Without Predefined Concepts

Relational Concept Bottleneck Models

Benchmarking and Enhancing Disentanglement in Concept-Residual Models

Concept Bottleneck Model with Additional Unsupervised Concepts

Learning to Intervene on Concept Bottlenecks

Editable Concept Bottleneck Models