Decision Predicate Graphs: Enhancing Interpretability in Tree Ensembles

Leonardo Arrighi,Luca Pennella,Gabriel Marques Tavares,Sylvio Barbon Junior
2024-04-03
Abstract:Understanding the decisions of tree-based ensembles and their relationships is pivotal for machine learning model interpretation. Recent attempts to mitigate the human-in-the-loop interpretation challenge have explored the extraction of the decision structure underlying the model taking advantage of graph simplification and path emphasis. However, while these efforts enhance the visualisation experience, they may either result in a visually complex representation or compromise the interpretability of the original ensemble model. In addressing this challenge, especially in complex scenarios, we introduce the Decision Predicate Graph (DPG) as a model-agnostic tool to provide a global interpretation of the model. DPG is a graph structure that captures the tree-based ensemble model and learned dataset details, preserving the relations among features, logical decisions, and predictions towards emphasising insightful points. Leveraging well-known graph theory concepts, such as the notions of centrality and community, DPG offers additional quantitative insights into the model, complementing visualisation techniques, expanding the problem space descriptions, and offering diverse possibilities for extensions. Empirical experiments demonstrate the potential of DPG in addressing traditional benchmarks and complex classification scenarios.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper mainly addresses the issue of improving the interpretability of tree ensemble models. By introducing the Decision Predicate Graph (DPG), the paper proposes a model-agnostic tool that aims to provide a method for global interpretation of tree-based ensemble models. The DPG structure captures the details of the model and dataset, emphasizing the relationship between features, logical decisions, and predictions. It utilizes concepts from graph theory such as centrality and communities to provide quantitative insights, enhancing model visualization and understanding. The paper also presents the algorithm for constructing DPG and its complexity, and demonstrates the potential of DPG in traditional benchmarks and complex classification scenarios through experiments.