Abstract:Decision trees (DTs) epitomize the ideal of interpretability of machine learning (ML) models. The interpretability of decision trees motivates explainability approaches by so-called intrinsic interpretability, and it is at the core of recent proposals for applying interpretable ML models in high-risk applications. The belief in DT interpretability is justified by the fact that explanations for DT predictions are generally expected to be succinct. Indeed, in the case of DTs, explanations correspond to DT paths. Since decision trees are ideally shallow, and so paths contain far fewer features than the total number of features, explanations in DTs are expected to be succinct, and hence interpretable. This paper offers both theoretical and experimental arguments demonstrating that, as long as interpretability of decision trees equates with succinctness of explanations, then decision trees ought not be deemed interpretable. The paper introduces logically rigorous path explanations and path explanation redundancy, and proves that there exist functions for which decision trees must exhibit paths with explanation redundancy that is arbitrarily larger than the actual path explanation. The paper also proves that only a very restricted class of functions can be represented with DTs that exhibit no explanation redundancy. In addition, the paper includes experimental results substantiating that path explanation redundancy is observed ubiquitously in decision trees, including those obtained using different tree learning algorithms, but also in a wide range of publicly available decision trees. The paper also proposes polynomial-time algorithms for eliminating path explanation redundancy, which in practice require negligible time to compute. Thus, these algorithms serve to indirectly attain irreducible, and so succinct, explanations for decision trees. Furthermore, the paper includes novel results related with duality and enumeration of explanations, based on using SAT solvers as witness-producing NP-oracles.

Learning Optimal Decision Trees with SAT

A Scalable Two Stage Approach to Computing Optimal Decision Sets

Optimal Decision Lists using SAT

Interpretable Decision Trees Through MaxSAT

Optimal Sparse Decision Trees

Improving the Validity of Decision Trees as Explanations

Succinct Explanations With Cascading Decision Trees

Learning optimal decision trees using constraint programming

Optimal Sparse Regression Trees

Interpretable Decision Tree Search as a Markov Decision Process

On Explaining Random Forests with SAT

Learning accurate and interpretable decision trees

Decision Trees for Decision-Making under the Predict-then-Optimize Framework

On Tackling Explanation Redundancy in Decision Trees

Learning Optimal Decision Making for an Industrial Truck Unloading Robot using Minimal Simulator Runs

Learn Smart with Less: Building Better Online Decision Trees with Fewer Training Examples

An Incremental MaxSAT-based Model to Learn Interpretable and Balanced Classification Rules

Optimal Decision Tree Policies for Markov Decision Processes

Counterfactual Explanations for Oblique Decision Trees: Exact, Efficient Algorithms

OPTDTALS: Approximate Logic Synthesis via Optimal Decision Trees Approach

Learning Optimal Prescriptive Trees from Observational Data