An Interpretable Rule Creation Method for Black-Box Models based on Surrogate Trees -- SRules

Mario Parrón Verdasco,Esteban García-Cuesta
2024-07-29
Abstract:As artificial intelligence (AI) systems become increasingly integrated into critical decision-making processes, the need for transparent and interpretable models has become paramount. In this article we present a new ruleset creation method based on surrogate decision trees (SRules), designed to improve the interpretability of black-box machine learning models. SRules balances the accuracy, coverage, and interpretability of machine learning models by recursively creating surrogate interpretable decision tree models that approximate the decision boundaries of a complex model. We propose a systematic framework for generating concise and meaningful rules from these surrogate models, allowing stakeholders to understand and trust the AI system's decision-making process. Our approach not only provides interpretable rules, but also quantifies the confidence and coverage of these rules. The proposed model allows to adjust its parameters to counteract the lack of interpretability by precision and coverage by allowing a near perfect fit and high interpretability of some parts of the model . The results show that SRules improves on other state-of-the-art techniques and introduces the possibility of creating highly interpretable specific rules for specific sub-parts of the model.
Machine Learning
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the interpretability of black - box machine - learning models. Specifically, the paper proposes a new rule - set creation method based on surrogate decision trees - SRules, aiming to improve the interpretability of black - box models while maintaining the accuracy and coverage of the models. The following are the key points mentioned in the paper: 1. **Background and Motivation**: - With the increasing complexity of artificial intelligence systems, transparent and interpretable models have become particularly important, especially in fields such as medicine, law, and national defense, where decisions directly affect human lives. - Black - box techniques (such as deep - learning neural networks and ensemble methods) perform well in solving complex problems but lack interpretability. 2. **Problem Description**: - The paper focuses on how to extract concise and meaningful rules from complex black - box models so that stakeholders can understand and trust the decision - making process of AI systems. - The goal is to balance the accuracy, coverage, and interpretability of the model by recursively generating surrogate decision trees to approximate the decision boundaries of complex models. 3. **Method Overview**: - A system framework is proposed for generating rules from surrogate models. These rules are not only interpretable but also quantify the confidence and coverage of the rules. - The method allows for parameter adjustment to achieve near - perfect fitting and high interpretability in certain parts. 4. **Technical Details**: - Feature importance (such as Gini impurity or permutation tests) is used to select the most important features and build surrogate decision trees based on these features. - Relevant branch patterns are selected by conditional information gain and converted into rules. - The generated rules are classified and pruned to improve the quality and interpretability of the rule set. 5. **Experimental Verification**: - The effectiveness of the SRules method is verified through multiple public data sets (such as Divorce, Chess, Wisconsin Breast Cancer, SPECT, Tic - Tac - Toe, etc.). - The experimental results show that SRules is superior to other state - of - the - art methods (such as RuleFit and RuleCosi) in interpretability and performance. In conclusion, this paper solves the key problem of improving the interpretability of black - box models while maintaining high performance by proposing the SRules method.