RealTCD: Temporal Causal Discovery from Interventional Data with Large Language Model

Peiwen Li,Xin Wang,Zeyang Zhang,Yuan Meng,Fang Shen,Yue Li,Jialong Wang,Yang Li,Wenweu Zhu
DOI: https://doi.org/10.1145/3627673.3680042
2024-01-01
Abstract:In the field of Artificial Intelligence for Information TechnologyOperations, causal discovery is pivotal for operation and maintenance of graphconstruction, facilitating downstream industrial tasks such as root causeanalysis. Temporal causal discovery, as an emerging method, aims to identifytemporal causal relationships between variables directly from observations byutilizing interventional data. However, existing methods mainly focus onsynthetic datasets with heavy reliance on intervention targets and ignore thetextual information hidden in real-world systems, failing to conduct causaldiscovery for real industrial scenarios. To tackle this problem, in this paperwe propose to investigate temporal causal discovery in industrial scenarios,which faces two critical challenges: 1) how to discover causal relationshipswithout the interventional targets that are costly to obtain in practice, and2) how to discover causal relations via leveraging the textual information insystems which can be complex yet abundant in industrial contexts. To addressthese challenges, we propose the RealTCD framework, which is able to leveragedomain knowledge to discover temporal causal relationships withoutinterventional targets. Specifically, we first develop a score-based temporalcausal discovery method capable of discovering causal relations for root causeanalysis without relying on interventional targets through strategic maskingand regularization. Furthermore, by employing Large Language Models (LLMs) tohandle texts and integrate domain knowledge, we introduce LLM-guidedmeta-initialization to extract the meta-knowledge from textual informationhidden in systems to boost the quality of discovery. We conduct extensiveexperiments on simulation and real-world datasets to show the superiority ofour proposed RealTCD framework over existing baselines in discovering temporalcausal structures.
What problem does this paper attempt to address?