Zero-Shot Cross-Lingual Document-Level Event Causality Identification with Heterogeneous Graph Contrastive Transfer Learning

Zhitao He,Pengfei Cao,Zhuoran Jin,Yubo Chen,Kang Liu,Zhiqiang Zhang,Mengshu Sun,Jun Zhao
2024-03-22
Abstract:Event Causality Identification (ECI) refers to the detection of causal relations between events in texts. However, most existing studies focus on sentence-level ECI with high-resource languages, leaving more challenging document-level ECI (DECI) with low-resource languages under-explored. In this paper, we propose a Heterogeneous Graph Interaction Model with Multi-granularity Contrastive Transfer Learning (GIMC) for zero-shot cross-lingual document-level ECI. Specifically, we introduce a heterogeneous graph interaction network to model the long-distance dependencies between events that are scattered over a document. Then, to improve cross-lingual transferability of causal knowledge learned from the source language, we propose a multi-granularity contrastive transfer learning module to align the causal representations across languages. Extensive experiments show our framework outperforms the previous state-of-the-art model by 9.4% and 8.2% of average F1 score on monolingual and multilingual scenarios respectively. Notably, in the multilingual scenario, our zero-shot framework even exceeds GPT-3.5 with few-shot learning by 24.3% in overall performance.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the problem of Zero-shot Cross-lingual Document-level Event Causality Identification. Specifically, the research objectives are: - **Cross-lingual Zero-shot Learning**: Efficiently transfer causal knowledge learned from resource-rich source languages to resource-limited languages, achieving zero-shot cross-lingual document-level event causality identification. - **Document-level Event Causality Identification**: Most existing works focus on sentence-level event causality identification and use English corpora; however, document-level event causality identification is more challenging because many causal relationships span multiple sentences. To address these issues, the authors propose a Heterogeneous Graph Interaction Model with Multi-granularity Contrastive Transfer Learning (GIMC). This model includes two key components: 1. **Multi-granularity Contrastive Transfer Learning Module**: Used to align causal representations across different languages, facilitating the transfer of language-independent causal knowledge. 2. **Heterogeneous Graph Interaction Network**: Comprising information phrase nodes, sentence nodes, statement nodes, and event pair nodes to model long-distance dependencies between events in a document. Through experiments, this method achieved significant performance improvements on widely used multilingual datasets, with average F1 scores increasing by 9.4% and 8.2%, respectively, and even outperformed GPT-3.5 in few-shot learning scenarios in multilingual contexts.