Text Causal Discovery Based on Sequence Structure Information.

Yue Li,Donglin Cao,Dazhen Lin
DOI: https://doi.org/10.1007/978-981-99-8540-1_13
2024-01-01
Abstract:Causality forms the basis for reasoning and decision-making in artificial intelligence systems. To take advantage of the vast amount of textual data available today, causal discovery from text has become a significant challenge in recent years. Text data contains rich contextual semantic information. However, traditional causal discovery methods only handle structured data and do not consider serial relationships and semantic relevance between words on textual variables. To address this problem, in this paper, we propose a causal discovery method Text Causal Discovery Based on Sequence Structure Information (TCDSS) discovers strongly correlated text word pairs with semantic relevance and statistical causality and finally constructs lexical causal graphs by introducing sequence-structure information in the causal discovery algorithm. We tested our method TCDSS on the DXY-COVID-19-Data and the Chinese Emergency Corpus (CEC) and compared it with other existing causal discovery methods. The experimental results show that PC, IGCI, RECI, and other forms have improved in precision, recall, and structural Hamming distance (SHD) after the introduction of TCDSS.
What problem does this paper attempt to address?