Abstract:We developed a novel machine learning (ML) algorithm with the goal of producing transparent models (i.e., understandable by humans) while also flexibly accounting for nonlinearity and interactions. Our method is based on ranked sparsity, and it allows for flexibility and user control in varying the shade of the opacity of black box machine learning methods. The main tenet of ranked sparsity is that an algorithm should be more skeptical of higher-order polynomials and interactions a priori compared to main effects, and hence, the inclusion of these more complex terms should require a higher level of evidence. In this work, we put our new ranked sparsity algorithm (as implemented in the open source R package, sparseR) to the test in a predictive model "bakeoff" (i.e., a benchmarking study of ML algorithms applied "out of the box", that is, with no special tuning). Algorithms were trained on a large set of simulated and real-world data sets from the Penn Machine Learning Benchmarks database, addressing both regression and binary classification problems. We evaluated the extent to which our human-centered algorithm can attain predictive accuracy that rivals popular black box approaches such as neural networks, random forests, and support vector machines, while also producing more interpretable models. Using out-of-bag error as a meta-outcome, we describe the properties of data sets in which human-centered approaches can perform as well as or better than black box approaches. We found that interpretable approaches predicted optimally or within 5% of the optimal method in most real-world data sets. We provide a more in-depth comparison of the performances of random forests to interpretable methods for several case studies, including exemplars in which algorithms performed similarly, and several cases when interpretable methods underperformed. This work provides a strong rationale for including human-centered transparent algorithms such as ours in predictive modeling applications.

Lifting Interpretability-Performance Trade-off via Automated Feature Engineering

Dynamic Feature Engineering for Transparent Machine Learning: a Framework for Interpretable Model Explanations

Algorithms for interpretable machine learning

Interpretable Model-Agnostic Explanations Based on Feature Relationships for High-Performance Computing

A Grey-Box Ensemble Model Exploiting Black-Box Accuracy and White-Box Intrinsic Interpretability

Achieving interpretable machine learning by functional decomposition of black-box models into explainable predictor effects

Learning outside the Black-Box: The pursuit of interpretable models

Can a Transparent Machine Learning Algorithm Predict Better than Its Black Box Counterparts? A Benchmarking Study Using 110 Data Sets

Explanatory Model Monitoring to Understand the Effects of Feature Shifts on Performance

Cracking black-box models: Revealing hidden machine learning techniques behind their predictions

Challenging the Performance-Interpretability Trade-off: An Evaluation of Interpretable Machine Learning Models

An Interpretable Probabilistic Approach for Demystifying Black-box Predictive Models

Model-Agnostic Interpretation Framework in Machine Learning: A Comparative Study in NBA Sports

Model-Agnostic Interpretability of Machine Learning

Discriminative Feature Attributions: Bridging Post Hoc Explainability and Inherent Interpretability

Interpretable models for extrapolation in scientific machine learning

Interpreting Black-box Machine Learning Models for High Dimensional Datasets

An Empirical Comparison of Interpretable Models to Post-Hoc Explanations

Interpretability in Safety-Critical FinancialTrading Systems

LCEN: A Novel Feature Selection Algorithm for Nonlinear, Interpretable Machine Learning Models

A Double Penalty Model for Interpretability