Automatic Knowledge Graph Construction for Judicial Cases

Jie Zhou,Xin Chen,Hang Zhang,Zhe Li
2024-04-15
Abstract:In this paper, we explore the application of cognitive intelligence in legal knowledge, focusing on the development of judicial artificial intelligence. Utilizing natural language processing (NLP) as the core technology, we propose a method for the automatic construction of case knowledge graphs for judicial cases. Our approach centers on two fundamental NLP tasks: entity recognition and relationship extraction. We compare two pre-trained models for entity recognition to establish their efficacy. Additionally, we introduce a multi-task semantic relationship extraction model that incorporates translational embedding, leading to a nuanced contextualized case knowledge representation. Specifically, in a case study involving a "Motor Vehicle Traffic Accident Liability Dispute," our approach significantly outperforms the baseline model. The entity recognition F1 score improved by 0.36, while the relationship extraction F1 score increased by 2.37. Building on these results, we detail the automatic construction process of case knowledge graphs for judicial cases, enabling the assembly of knowledge graphs for hundreds of thousands of judgments. This framework provides robust semantic support for applications of judicial AI, including the precise categorization and recommendation of related cases.
Computation and Language
What problem does this paper attempt to address?
This paper aims to solve the problem of automatic construction of judicial case knowledge graphs. Specifically, the research objective is to realize the automatic construction of knowledge graphs for the judgment documents of "motor vehicle traffic accident liability disputes" cases through natural language processing technologies, especially the two core tasks of entity recognition and relationship extraction. The main contributions of the paper are as follows: 1. **Improvement of Entity Recognition Model**: Two BERT - based entity recognition models were compared, and experiments showed that using Conditional Random Field (CRF) in the decoding output layer can further improve the effect of entity recognition, with the F1 score increased by 0.36. In addition, a fusion model suitable for entity recognition in traffic accident liability dispute cases was proposed. 2. **Multi - task Semantic Relationship Extraction Model**: A BERT multi - task semantic relationship extraction model (BERT - Multitask) combined with translation embedding was proposed. Compared with the baseline model, the F1 value of the relationship extraction results was increased by 2.37. 3. **Process Design for Automatic Construction of Case Knowledge Graphs**: A process for automatic construction of case knowledge graphs integrating structured and unstructured texts was designed. The feasibility and effectiveness of this process were verified, and a large - scale judicial case knowledge graph was constructed, providing semantic support for downstream tasks such as accurate push of similar cases. Through these contributions, the paper not only promotes the development of judicial artificial intelligence technically, but also provides strong support for tasks such as case classification and related case recommendation in practical applications.