Abstract:Latent semantic analysis (LSA) is a natural language statistical model, which is considered as a method to acquire, generalize, and represent knowledge. Compared with other retrieval models based on concept dictionaries or concept networks, the retrieval model based on LSA has the advantages of strong computability and less human participation. LSA establishes a latent semantic space through truncated singular value decomposition. Words and documents in the latent semantic space are projected onto the dimension representing the latent concept, and then the semantic relationship between words can be extracted to present the semantic structure in natural language. This paper designs the system architecture of the public prosecutorial knowledge graph. Combining the graph data storage technology and the characteristics of the public domain ontology, a knowledge graph storage method is designed. By building a prototype system, the functions of knowledge management, knowledge query, and knowledge push are realized. A named entity recognition method based on bidirectional long-short-term memory (bi-LSTM) combined with conditional random field (CRF) is proposed. Bi-LSTM-CRF performs named entity recognition based on character-level features. CRF can use the transition matrix to further obtain the relationship between each position label, so that bi-LSTM-CRF not only retains the context information but also considers the influence between the current position and the previous position. The experimental results show that the LSTM-entity-context method proposed in this paper improves the representation ability of text semantics compared with other algorithms. However, this method only introduces relevant entity information to supplement the semantic representation of the text. The order in the case is often ignored, especially when it comes to the time series of the case characteristics, and the "order problem" may eventually affect the final prediction result. The knowledge graph of legal documents of theft cases based on ontology can be updated and maintained in real time. The knowledge graph can conceptualize, share, and perpetuate knowledge related to procuratorial organs and can also reasonably utilize and mine many useful experiences and knowledge to assist in decision-making.

Learning Heterogeneous Graph Embedding for Chinese Legal Document Similarity

Legal Judgment Prediction via Heterogeneous Graphs and Knowledge of Law Articles

Legal Document Similarity Matching Based on Ensemble Learning

Design of Knowledge Graph Retrieval System for Legal and Regulatory Framework of Multilevel Latent Semantic Indexing

Similarity Analysis of Law Documents Based on Word2vec

BERT_LF: A Similar Case Retrieval Method Based on Legal Facts

Iterative Self-Supervised Learning for Legal Similar Case Retrieval

CaseLink: Inductive Graph Learning for Legal Case Retrieval

An Evaluation Dataset for Legal Word Embedding: A Case Study On Chinese Codex

LA-MGFM: A legal judgment prediction method via sememe-enhanced graph neural networks and multi-graph fusion mechanism

Discovering Hypernymy Relationships in Chinese Traffic Legal Texts

Siamese Capsule Network with Position Correlation and Integrating Articles of Law for Chinese Similar Case Matching.

Joint Learning-based Multiple Documents Heterogeneous Graph Inference for Biomedical Entity Linking.

Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law

Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for Computing Legal Case Document Similarity

Event is More Valuable Than You Think: Improving the Similar Legal Case Retrieval Via Event Knowledge

Legal Element-oriented Modeling with Multi-view Contrastive Learning for Legal Case Retrieval

ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases

LegalGNN: Legal Information Enhanced Graph Neural Network for Recommendation

Word Embedding Based Document Similarity for the Inferring of Penalty.

A RoBERTa-GlobalPointer-Based Method for Named Entity Recognition of Legal Documents.