A Deep Learning Approach for Document-level Chinese Financial Event Extraction

Sapae Phyu,Wenyue Li,Qin Liu,Hongming Zhu
DOI: https://doi.org/10.1145/3689299.3689316
2024-01-01
Abstract:Driven by advancements in Artificial Intelligence (AI) techniques, significant progress has been made in the improvement of Natural Language Processing (NLP). One area of particular interest within NLP is the extraction of events from textual data. While studies on the event extraction has traditionally concentrated on the sentence level, the increasing real-world implementation of event extraction has significantly raised the demand for document-level event extraction. Two common challenges in the research of document-level event extraction are event arguments scattering across different sentences and more than one event existing in one input document. Additionally, we recognized that the class imbalance problem exists in the benchmark dataset. A novel deep-learning approach for document-level Chinese financial event extraction by using axial attention and adaptive focal loss is proposed in this paper. Previous studies showed that relation information among entities is critical to solving the argument scattering issue and multi-event issue since both of the issues are due to a complex semantic understanding of input text. Our method leverages up-to-date axial attention mechanisms to capture relations among entities and found that the benchmark dataset does not include enough argument-scattering data to cover all of the five event types. To solve the problem of class imbalance, we leverage an adaptive focal loss function that adjusts the important weights of samples during training dynamically, enabling the model to focus more on minority class instances and improve overall performance. In conclusion, our methodology leverages axial attention and adaptive focal loss and solves the problems of multi-event occurrences, cross-sentence relationships among event arguments, and class imbalance. We conducted extensive experiments on a large-scale Chinese financial event extraction dataset, demonstrating that our proposed approach outperforms baseline methods.
What problem does this paper attempt to address?