Abstract:We present DeforestVis, a visual analytics tool that offers summarization of the behaviour of complex ML models by providing AdaBoost‐based surrogate decision stumps. Our proposed tool helps users explore the complexity versus fidelity trade‐off, create attribute‐based explanations with weighted stumps, and analyse the impact of rule overriding. As the complexity of machine learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model‐agnostic, way to interpret such models is to train surrogate models—such as rule sets and decision trees—that sufficiently approximate the original ones while being simpler and easier‐to‐explain. Yet, rule sets can become very lengthy, with many if–else statements, and decision tree depth grows rapidly when accurately emulating complex ML models. In such cases, both approaches can fail to meet their core goal—providing users with model interpretability. To tackle this, we propose DeforestVis, a visual analytics tool that offers summarization of the behaviour of complex ML models by providing surrogate decision stumps (one‐level decision trees) generated with the Adaptive Boosting (AdaBoost) technique. DeforestVis helps users to explore the complexity versus fidelity trade‐off by incrementally generating more stumps, creating attribute‐based explanations with weighted stumps to justify decision making, and analysing the impact of rule overriding on training instance allocation between one or more stumps. An independent test set allows users to monitor the effectiveness of manual rule changes and form hypotheses based on case‐by‐case analyses. We show the applicability and usefulness of DeforestVis with two use cases and expert interviews with data analysts and model developers.

Improved prediction rule ensembling through model-based data generation

Interpretable Prediction Rule Ensembles in the Presence of Missing Data

Fitting Prediction Rule Ensembles to Psychological Research Data: An Introduction and Tutorial

Ranking and Combining Latent Structured Predictive Scores without Labeled Data

Better Short than Greedy: Interpretable Models through Optimal Rule Boosting

Predictive analytics with ensemble modeling in laparoscopic surgery: A technical note

Developing parsimonious ensembles using ensemble diversity within a reinforcement learning framework

Predictive Ensemble Pruning by Expectation Propagation

Ensemble Subset Regression (ENSURE): Efficient High-dimensional Prediction

An ensemble penalized regression method for multi-ancestry polygenic risk prediction

Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection

Tree Ensembles with Rule Structured Horseshoe Regularization

Multi-Model Subset Selection

Beyond Discriminant Patterns: On the Robustness of Decision Rule Ensembles

ASPEST: Bridging the Gap Between Active Learning and Selective Prediction

Developing parsimonious ensembles using predictor diversity within a reinforcement learning framework

Building Trees for Probabilistic Prediction via Scoring Rules

Predicting class-imbalanced business risk using resampling, regularization, and model ensembling algorithms

An ensemble approach to improved prediction from multitype data

DeforestVis: Behaviour Analysis of Machine Learning Models with Surrogate Decision Stumps