Abductive explanations of classifiers under constraints: Complexity and properties

Martin Cooper,Leila Amgoud
2024-09-19
Abstract:Abductive explanations (AXp's) are widely used for understanding decisions of classifiers. Existing definitions are suitable when features are independent. However, we show that ignoring constraints when they exist between features may lead to an explosion in the number of redundant or superfluous AXp's. We propose three new types of explanations that take into account constraints and that can be generated from the whole feature space or from a sample (such as a dataset). They are based on a key notion of coverage of an explanation, the set of instances it explains. We show that coverage is powerful enough to discard redundant and superfluous AXp's. For each type, we analyse the complexity of finding an explanation and investigate its formal properties. The final result is a catalogue of different forms of AXp's with different complexities and different formal guarantees.
Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is: when existing abductive explanations (AXp) deal with classifier decisions, they ignore the constraints between features, resulting in redundant or excessive explanations. Specifically: 1. **Limitations of the existing definition**: - The existing AXp definition assumes that features are independent and ignores the constraints between features. - Ignoring the constraints may lead to an exponential increase in the number of explanations, and many of these explanations may be redundant or superfluous. 2. **The impact of introducing dependency constraints**: - Dependency constraints (DC) mean that the values of some feature values will affect the values of other features, for example, "a pregnant person must be female". - If these dependency constraints are ignored, unnecessary explanations may be generated, increasing the complexity and redundancy of the explanations. 3. **Goals and contributions**: - The paper proposes three new coverage - based prime implicant explanations (CPI - Xp) to better handle the constraints between features. - These new methods not only consider integrity constraints (IC), but also specifically consider dependency constraints, thereby reducing the number of redundant explanations. - The paper analyzes the computational complexity of these new explanation types and proposes methods for generating explanations from sample data to reduce computational costs. ### Specific problems and solutions #### 1. Redundant explanations - **Problem**: Ignoring dependency constraints will lead to the generation of unnecessary explanations. - **Solution**: By introducing dependency constraints, ensure that the generated explanations do not contain redundant information. For example, in the example, `{(f2,0)}` is a redundant explanation because it can be derived from `f1∧¬f2→⊥`. #### 2. Explosion in the number of explanations - **Problem**: Ignoring the constraints may lead to an exponential increase in the number of explanations. - **Solution**: By introducing dependency constraints, the number of explanations can be significantly reduced. For example, in the example, there were originally several AXpc in combination, but by introducing dependency constraints, there is finally only one CPI - Xp `{(fn,1)}`. #### 3. Explanations in sample data - **Problem**: When the classifier is a black - box model, calculating all possible explanations is very time - consuming. - **Solution**: By only considering the instances in the sample data, the computational complexity can be greatly reduced. For example, the time complexities of d - AXp and d - CPI - Xp are O(mn) and O(mn^2) respectively, which are much lower than the original methods. ### Summary This paper aims to improve the existing abductive explanation methods by introducing dependency constraints, reducing redundant explanations and controlling the growth of the number of explanations. At the same time, the paper also proposes a method for generating explanations based on sample data to improve computational efficiency. This not only solves the limitations of existing methods but also provides more effective tools for practical applications.