VINE: Visualizing Statistical Interactions in Black Box Models

Matthew Britton
DOI: https://doi.org/10.48550/arXiv.1904.00561
2019-04-01
Abstract:As machine learning becomes more pervasive, there is an urgent need for interpretable explanations of predictive models. Prior work has developed effective methods for visualizing global model behavior, as well as generating local (instance-specific) explanations. However, relatively little work has addressed regional explanations - how groups of similar instances behave in a complex model, and the related issue of visualizing statistical feature interactions. The lack of utilities available for these analytical needs hinders the development of models that are mission-critical, transparent, and align with social goals. We present VINE (Visual INteraction Effects), a novel algorithm to extract and visualize statistical interaction effects in black box models. We also present a novel evaluation metric for visualizations in the interpretable ML space.
Machine Learning
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are the limitations of existing interpretive methods in explaining complex machine - learning models (especially black - box models), especially the deficiencies in regional explanation and feature - interaction visualization. Specifically: 1. **Limitations of Global and Local Explanations**: - Although global explanations can summarize the overall behavior of the model, they are usually a simplification of the actual behavior and ignore the special behaviors in different subsets. - Although local explanations can provide specific explanations for a single data point, their generalization ability is limited and it is difficult to generalize them to other data subsets. 2. **Lack of Support for Regional Explanation**: - Regional explanation aims to describe the behavior of a group of similar instances, which is helpful for understanding the performance of the model on a specific subset, but existing tools provide little support for this. 3. **Visualization Challenges of Feature Interaction**: - Feature interaction refers to the non - additive relationship between two or more features, which has an important impact on prediction. However, existing methods have deficiencies in visualizing these interactions, especially in identifying which regions have particularly strong or weak interactions. To solve these problems, the paper proposes VINE (Visual INteraction Effects), a new algorithm and visualization tool for extracting and presenting statistical interaction effects in black - box models. VINE solves the above problems in the following ways: - **Generating Regional Explanations**: VINE generates regional explanations based on clustered partial dependence curves, which describe the behavior of the model on specific data subsets. - **Interactive Visualization**: VINE provides an interactive visualization interface, and users can click on the chart to view detailed regional model explanations. - **Evaluation Framework**: The paper also introduces the Information Ceiling framework for quantitatively evaluating the fidelity of visualization explanations to ensure that they are faithful to the underlying model. Through these innovations, VINE can not only better explain the behavior of complex models, but also help users understand the feature interactions in the model more deeply, thereby improving the transparency and interpretability of the model.