Abstract:Identifying causal relations is crucial for a variety of downstream tasks. In additional to observational data, background knowledge (BK), which could be attained from human expertise or experiments, is usually introduced for uncovering causal relations. This raises an open problem that in the presence of latent variables, what causal relations are identifiable from observational data and BK. In this paper, we propose two novel rules for incorporating BK, which offer a new perspective to the open problem. In addition, we show that these rules are applicable in some typical causality tasks, such as determining the set of possible causal effects with observational data. Our rule-based approach enhances the state-of-the-art method by circumventing a process of enumerating block sets that would otherwise take exponential complexity.
What problem does this paper attempt to address?
The paper attempts to address the problem of identifying causal relationships from observational data and background knowledge (BK) in the presence of latent variables. Specifically, the authors propose two new rules to incorporate background knowledge, which can help determine which causal relationships can be identified given the observational data and background knowledge. These issues are crucial for downstream tasks as they involve the extent to which causal relationships can be inferred from available information.
### Background and Motivation
In many machine learning fields, such as semi-supervised learning, reinforcement learning, transfer learning, etc., the application of causal thinking has opened up new research directions. Causal relationships are usually represented by causal graphs, and the structure of these graphs is often unknown, making it essential to uncover causal relationships to solve downstream tasks. Existing theoretical results indicate that only a Markov equivalence class (MEC) can be identified from observational data alone, which contains some uncertain causal relationships. To further uncover these relationships, additional structural knowledge is usually introduced, which can be obtained from experiments or human expertise, i.e., background knowledge (BK).
### Main Contributions
1. **Propose Two New Rules**: The authors propose two new rules (R12 and R13) for incorporating background knowledge in the presence of latent variables. These two rules not only identify causal relationships but also improve the efficiency of existing methods in some typical causal tasks, such as determining the possible causal effect set.
2. **Application in Set Determination Tasks**: Even without background knowledge, these two new rules can be applied to typical causal tasks that rely solely on observational data. The authors demonstrate how to reduce the exponential computational complexity by improving the existing state-of-the-art method (PAGcauses).
### Method Overview
- **Definition of Unbridged Path**: The authors first define an important concept—an unbridged path relative to the vertex set \( V' \). This path has an interesting property: if each vertex in the path is not an ancestor of \( V' \) in \( H \), then in any MAG consistent with \( H \), these vertices must be ancestors of \( V' \).
- **Rules R12 and R13**:
- **R12**: If there is an edge \( A \circ - * B \) in a PMG \( H \), and there exists an unbridged path \( \langle K_1, \ldots, K_m \rangle \) relative to \( S_A \), and for each vertex \( K_i \) there is an uncovered possible directed path \( \langle A, B, \ldots, K_i \rangle \), then convert \( A \circ - * B \) to \( A \leftarrow * B \).
- **R13**: If there is an edge \( A \circ - * B \) in a PMG \( H \), and there exists an uncovered possible directed path \( \langle A, B, \ldots, K \rangle \), where \( K \) is an ancestor of \( S_A \), then convert \( A \circ - * B \) to \( A \leftarrow * B \).
- **Algorithm Implementation**: The authors provide Algorithm 1 to implement R12 and R13 and prove its correctness and complexity.
### Application Example
- **Set Determination Task**: The authors demonstrate how to apply the new rules to the set determination task, particularly in the presence of latent variables, to determine the possible causal effect set of vertex \( X \) on vertex \( Y \). By introducing R12, the enumeration of block sets can be avoided, significantly reducing computational complexity.
### Conclusion
The paper makes significant progress in identifying causal relationships by incorporating background knowledge in the presence of latent variables through the proposal of two new rules (R12 and R13). These rules not only improve the efficiency of existing methods but also provide new perspectives for future research.