Do intermediate feature coalitions aid explainability of black-box models?

Minal Suresh Patil,Kary Främling
2023-06-02
Abstract:This work introduces the notion of intermediate concepts based on levels structure to aid explainability for black-box models. The levels structure is a hierarchical structure in which each level corresponds to features of a dataset (i.e., a player-set partition). The level of coarseness increases from the trivial set, which only comprises singletons, to the set, which only contains the grand coalition. In addition, it is possible to establish meronomies, i.e., part-whole relationships, via a domain expert that can be utilised to generate explanations at an abstract level. We illustrate the usability of this approach in a real-world car model example and the Titanic dataset, where intermediate concepts aid in explainability at different levels of abstraction.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve This paper aims to explore whether intermediate feature coalitions can help improve the interpretability of black-box models. Specifically, the paper proposes intermediate concepts based on levels structure to assist in the explanation of black-box models. The levels structure is a hierarchical structure where each level corresponds to a set of features in the dataset (i.e., a partition of the player set). The granularity of the levels increases from the trivial set containing only single elements to the set containing only the global coalition. Additionally, domain experts can establish meronomies (part-whole relationships) to generate explanations at an abstract level. The paper demonstrates the usability of this approach at different levels of abstraction through a practical example of a car model and the Titanic dataset.