Causal discovery using dynamically requested knowledge

Neville K Kitson,Anthony C Constantinou
2023-10-17
Abstract:Causal Bayesian Networks (CBNs) are an important tool for reasoning under uncertainty in complex real-world systems. Determining the graphical structure of a CBN remains a key challenge and is undertaken either by eliciting it from humans, using machine learning to learn it from data, or using a combination of these two approaches. In the latter case, human knowledge is generally provided to the algorithm before it starts, but here we investigate a novel approach where the structure learning algorithm itself dynamically identifies and requests knowledge for relationships that the algorithm identifies as uncertain during structure learning. We integrate this approach into the Tabu structure learning algorithm and show that it offers considerable gains in structural accuracy, which are generally larger than those offered by existing approaches for integrating knowledge. We suggest that a variant which requests only arc orientation information may be particularly useful where the practitioner has little preexisting knowledge of the causal relationships. As well as offering improved accuracy, the approach can use human expertise more effectively and contributes to making the structure learning process more transparent.
Artificial Intelligence
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is: How to dynamically identify and request human knowledge during the structure learning process of Causal Bayesian Networks (CBNs) in order to improve the accuracy of structure learning. Specifically, the author proposes a new method that enables the structure learning algorithm to dynamically determine which relationships are "uncertain" and actively request knowledge about these relationships from human experts. ### Background and Problem Description of the Paper Causal Bayesian Networks (CBNs) are a powerful tool for performing uncertainty reasoning in complex real - world systems. However, accurately determining the graphical structure of CBNs remains a key challenge. Traditional structure learning methods include: 1. **Knowledge Elicitation from Humans**: Obtain knowledge from human experts through a formal framework such as Knowledge Engineering for Bayesian Networks (KEBN). 2. **Learning Structures from Data**: Use structure learning algorithms to automatically learn causal structures from data. 3. **Combining Machine Learning and Human Knowledge**: Combine human knowledge with machine learning, but usually provide predefined knowledge before the learning process begins. ### Solution Proposed in the Paper This paper proposes a novel method, that is, let the structure learning algorithm itself dynamically identify and request human knowledge during the learning process. The main features of this method are as follows: - **Dynamically Requesting Knowledge**: During the structure learning process, the algorithm dynamically requests knowledge of specific relationships as needed, rather than providing all knowledge at once before the start of learning. - **Improving Accuracy**: By dynamically requesting knowledge, the accuracy of structure learning can be significantly improved, especially when humans have less knowledge about causal relationships. - **Transparency and Efficiency**: This method not only improves accuracy, but also makes the structure learning process more transparent and more effectively utilizes human expertise. ### Experimental Verification The author integrated this method into the Tabu structure learning algorithm and created a new algorithm called Tabu - AL (Tabu with Active Learning). Experimental results show that Tabu - AL has a significant improvement in structure accuracy compared to existing methods, especially when dealing with inaccurate human knowledge. ### Presentation of Formulas in Markdown Format To ensure the correctness and readability of formulas, the following are the formulas involved in the paper presented in Markdown format: 1. **Standard Chain Rule of Global Probability Distribution**: \[ P(X_1, X_2, \ldots, X_n)=\prod_{i = 1}^{n}P(X_i|X_1, X_2, \ldots, X_{i - 1}) \] 2. **Probability Distribution Based on Parent Nodes**: \[ P(X_1, X_2, \ldots, X_n)=\prod_{i = 1}^{n}P(X_i|Pa(X_i)) \] Through this method of dynamically requesting knowledge, the author hopes to better solve the uncertainty problem in the structure learning of Causal Bayesian Networks and provide new ideas and directions for future related research.