Abstract:Decision trees (DTs) epitomize the ideal of interpretability of machine learning (ML) models. The interpretability of decision trees motivates explainability approaches by so-called intrinsic interpretability, and it is at the core of recent proposals for applying interpretable ML models in high-risk applications. The belief in DT interpretability is justified by the fact that explanations for DT predictions are generally expected to be succinct. Indeed, in the case of DTs, explanations correspond to DT paths. Since decision trees are ideally shallow, and so paths contain far fewer features than the total number of features, explanations in DTs are expected to be succinct, and hence interpretable. This paper offers both theoretical and experimental arguments demonstrating that, as long as interpretability of decision trees equates with succinctness of explanations, then decision trees ought not be deemed interpretable. The paper introduces logically rigorous path explanations and path explanation redundancy, and proves that there exist functions for which decision trees must exhibit paths with explanation redundancy that is arbitrarily larger than the actual path explanation. The paper also proves that only a very restricted class of functions can be represented with DTs that exhibit no explanation redundancy. In addition, the paper includes experimental results substantiating that path explanation redundancy is observed ubiquitously in decision trees, including those obtained using different tree learning algorithms, but also in a wide range of publicly available decision trees. The paper also proposes polynomial-time algorithms for eliminating path explanation redundancy, which in practice require negligible time to compute. Thus, these algorithms serve to indirectly attain irreducible, and so succinct, explanations for decision trees. Furthermore, the paper includes novel results related with duality and enumeration of explanations, based on using SAT solvers as witness-producing NP-oracles.

Explaining Random Forests As Single Decision Trees Through Distance Functional Optimization

Explainable decision forest: Transforming a decision forest into an interpretable tree

FROD: an Efficient Framework for Optimizing Decision Trees in Packet Classification

Trading Complexity for Sparsity in Random Forest Explanations

Example-based Explanations for Random Forests using Machine Unlearning

Enhanced Local Explainability and Trust Scores with Random Forest Proximities

Explaining the Success of AdaBoost and Random Forests as Interpolating Classifiers

Very fast, approximate counterfactual explanations for decision forests

Understanding Random Forests: From Theory to Practice

Case-based Explainability for Random Forest: Prototypes, Critics, Counter-factuals and Semi-factuals

BELLATREX: Building Explanations through a LocaLly AccuraTe Rule EXtractor

Random Forests with Economic Roots: Explaining Machine Learning in Hedonic Imputation

From unbiased MDI Feature Importance to Explainable AI for Trees

Demystifying Functional Random Forests: Novel Explainability Tools for Model Transparency in High-Dimensional Spaces

On Tackling Explanation Redundancy in Decision Trees

Trees, Forests, Chickens, and Eggs: When and Why to Prune Trees in a Random Forest

From local explanations to global understanding with explainable AI for trees

Explainable Data-Driven Optimization: From Context to Decision and Back Again

Inherently Interpretable Tree Ensemble Learning

On Explaining Random Forests with SAT

Improving the Validity of Decision Trees as Explanations