Causal Walk: Debiasing Multi-Hop Fact Verification with Front-Door Adjustment

Congzhi Zhang,Linhai Zhang,Deyu Zhou
2024-03-05
Abstract:Conventional multi-hop fact verification models are prone to rely on spurious correlations from the annotation artifacts, leading to an obvious performance decline on unbiased datasets. Among the various debiasing works, the causal inference-based methods become popular by performing theoretically guaranteed debiasing such as casual intervention or counterfactual reasoning. However, existing causal inference-based debiasing methods, which mainly formulate fact verification as a single-hop reasoning task to tackle shallow bias patterns, cannot deal with the complicated bias patterns hidden in multiple hops of evidence. To address the challenge, we propose Causal Walk, a novel method for debiasing multi-hop fact verification from a causal perspective with front-door adjustment. Specifically, in the structural causal model, the reasoning path between the treatment (the input claim-evidence graph) and the outcome (the veracity label) is introduced as the mediator to block the confounder. With the front-door adjustment, the causal effect between the treatment and the outcome is decomposed into the causal effect between the treatment and the mediator, which is estimated by applying the idea of random walk, and the causal effect between the mediator and the outcome, which is estimated with normalized weighted geometric mean approximation. To investigate the effectiveness of the proposed method, an adversarial multi-hop fact verification dataset and a symmetric multi-hop fact verification dataset are proposed with the help of the large language model. Experimental results show that Causal Walk outperforms some previous debiasing methods on both existing datasets and the newly constructed datasets. Code and data will be released at
Computation and Language
What problem does this paper attempt to address?
The problem this paper attempts to address is the issue of data bias in multi-hop fact verification. Specifically, traditional multi-hop fact verification models tend to rely on incidental associations during the annotation process, leading to a significant drop in performance on unbiased datasets. Existing causal inference debiasing methods mainly target single-hop reasoning tasks and cannot handle the complex bias patterns in multi-hop evidence. Therefore, this paper proposes a novel method based on Front-Door Adjustment—Causal Walk, aiming to reduce bias in multi-hop fact verification from a causal perspective. ### Main Contributions of the Paper: 1. **First Use of Front-Door Adjustment for Debiasing**: To the best of the authors' knowledge, this is the first time Front-Door Adjustment has been used to reduce bias in multi-hop fact verification tasks. 2. **Proposing the Causal Walk Method**: By introducing the reasoning path as a mediator variable, causal intervention is achieved, thereby reducing bias in multi-hop fact verification. 3. **Experimental Results Validate Effectiveness**: Experimental results show that Causal Walk outperforms previous debiasing methods on both existing and newly constructed datasets. ### Key Points of the Solution: - **Structural Causal Model (SCM)**: The causal relationships in multi-hop fact verification are represented as a structural causal model, where G is the graph containing claims and evidence, L is the veracity label, U is the unobserved confounders, and R is the reasoning path. - **Front-Door Adjustment**: By measuring the causal effect between G and L, it is decomposed into the causal effect between G and R and the causal effect between R and L. Specifically, the causal effect between G and R is estimated using a random walk method, and the causal effect between R and L is approximated using Normalized Weighted Geometric Mean (NWGM). - **Dataset Construction**: To validate the effectiveness of the method, adversarial multi-hop fact verification datasets and symmetric multi-hop fact verification datasets were constructed. ### Experimental Results: - **On the PolitiHop Dataset and Its Variants**: Causal Walk achieved the best performance on various variant datasets, particularly excelling on the Hard PolitiHop dataset, demonstrating its robustness to misleading evidence. - **On the FEVER Dataset and Its Variants**: Causal Walk significantly outperformed other models on the multi-hop dataset (FEVER-MH), further validating its multi-hop reasoning capability. In summary, this paper effectively reduces data bias in multi-hop fact verification by introducing Front-Door Adjustment and causal intervention, thereby improving the robustness and generalization ability of the model.