Abstract:The use of machine learning (ML) in critical domains such as medicine poses risks and requires regulation. One requirement is that decisions of ML systems in high-risk applications should be human-understandable. The field of "explainable artificial intelligence" (XAI) seemingly addresses this need. However, in its current form, XAI is unfit to provide quality control for ML; it itself needs scrutiny. Popular XAI methods cannot reliably answer important questions about ML models, their training data, or a given test input. We recapitulate results demonstrating that popular XAI methods systematically attribute importance to input features that are independent of the prediction target. This limits their utility for purposes such as model and data (in)validation, model improvement, and scientific discovery. We argue that the fundamental reason for this limitation is that current XAI methods do not address well-defined problems and are not evaluated against objective criteria of explanation correctness. Researchers should formally define the problems they intend to solve first and then design methods accordingly. This will lead to notions of explanation correctness that can be theoretically verified and objective metrics of explanation performance that can be assessed using ground-truth data.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the deficiencies of current Explainable Artificial Intelligence (XAI) methods in terms of quality control and explanation correctness. Specifically: 1. **Limitations of XAI methods**: Existing XAI methods are unable to reliably answer important questions about machine - learning models, training data, or specific test inputs. For example, popular methods often assign importance scores to features that are irrelevant to the prediction target, which limits their application in model validation, data validation, model improvement, and scientific discovery. 2. **Lack of formal definitions**: Current XAI methods are not designed for well - defined problems, nor are they evaluated for the correctness of explanations according to objective criteria. This means that these methods lack theoretical verification and are difficult to evaluate their performance with real - data. 3. **Potential risks in key areas**: When using machine - learning systems in high - risk fields such as medicine, it is required that the decisions of these systems must be understandable to humans. However, current XAI methods are not sufficient to provide such assurance, so more stringent quality control is required. ### Main contributions of the paper To address these problems, the author makes the following suggestions: - **Formalize problem definitions**: Researchers should first clearly define the problems they want to solve and then design corresponding methods. This will help to establish the concept of explanation correctness that can be theoretically verified and provide objective indicators that can be evaluated with real - data. - **Introduce the concept of causal variables**: The paper shows through two minimal examples how suppressor variables affect the performance of existing feature - attribution methods. Suppressor variables are those that do not have predictive power themselves but can improve the prediction effect. Research shows that many popular XAI methods will wrongly assign importance to these suppressor variables. - **Propose improvement directions**: To solve the above problems, the author suggests developing new XAI methods, which should follow six steps: 1. Evaluate the information needs of users and stakeholders in specific application scenarios. 2. Define the formal requirements and XAI problems that meet these needs. 3. Design methods suitable for solving specific XAI problems. 4. Conduct theoretical analysis to ensure compliance with formal requirements. 5. Conduct empirical verification using appropriate benchmark datasets. 6. Improve the method according to further requirements. Through these measures, the paper aims to promote the development of the XAI field so that it can better serve the quality assurance needs in practical applications. ### Summary This paper emphasizes the limitations of current XAI methods in terms of explanation correctness and quality control and proposes directions for improvement. By formalizing problem definitions and introducing the concept of causal variables, researchers believe that more reliable and effective XAI tools can be developed, thereby increasing the credibility and transparency of machine - learning systems in high - risk applications.

Explainable AI needs formal notions of explanation correctness

Explainable Artificial Intelligence (XAI): What we know and what is left to attain Trustworthy Artificial Intelligence

Explainable AI (XAI): History, Basic Ideas and Methods

Explainable Artificial Intelligence: Concepts, Applications, Research Challenges and Visions

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Demystifying XAI: Requirements for Understandable XAI Explanations

False Sense of Security in Explainable Artificial Intelligence (XAI)

Explaining Any ML Model? -- On Goals and Capabilities of XAI

"Explanation" is Not a Technical Term: The Problem of Ambiguity in XAI

Explainable Artificial Intelligence (XAI) in Biomedicine: Making AI Decisions Trustworthy for Physicians and Patients

EXplainable Artificial Intelligence (XAI)—From Theory to Methods and Applications

Explainable Artificial Intelligence: A Survey of Needs, Techniques, Applications, and Future Direction

Logic-Based Explainability: Past, Present & Future

Explainable AI: Definition and attributes of a good explanation for health AI

Explainable artificial intelligence: a comprehensive review

Explainability of Artificial Intelligence Methods, Applications and Challenges: A Comprehensive Survey

Explainable AI (XAI): Core Ideas, Techniques, and Solutions

It is not "accuracy vs. explainability" -- we need both for trustworthy AI systems

Explaining Explanations: An Overview of Interpretability of Machine Learning

Explainable AI does not provide the explanations end-users are asking for