Abstract:Legal case retrieval is an information retrieval task in the legal domain, which aims to retrieve relevant cases with a given query case. Recent research of legal case retrieval mainly relies on traditional bag-of-words models and language models. Although these methods have achieved significant improvement in retrieval accuracy, there are still two challenges: (1) Legal structural information neglect. Previous neural legal case retrieval models mostly encode the unstructured raw text of case into a case representation, which causes the lack of important legal structural information in a case and leads to poor case representation; (2) Lengthy legal text limitation. When using the powerful BERT-based models, there is a limit of input text lengths, which inevitably requires to shorten the input via truncation or division with a loss of legal context information. In this paper, a graph neural networks-based legal case retrieval model, CaseGNN, is developed to tackle these challenges. To effectively utilise the legal structural information during encoding, a case is firstly converted into a Text-Attributed Case Graph (TACG), followed by a designed Edge Graph Attention Layer and a readout function to obtain the case graph representation. The CaseGNN model is optimised with a carefully designed contrastive loss with easy and hard negative sampling. Since the text attributes in the case graph come from individual sentences, the restriction of using language models is further avoided without losing the legal context. Extensive experiments have been conducted on two benchmarks from COLIEE 2022 and COLIEE 2023, which demonstrate that CaseGNN outperforms other state-of-the-art legal case retrieval methods. The code has been released on <a class="link-external link-https" href="https://github.com/yanran-tang/CaseGNN" rel="external noopener nofollow">this https URL</a>.

Named Entity Recognition in the Legal Domain using a Pointer Generator Network

E-NER -- An Annotated Named Entity Recognition Corpus of Legal Text

BTPK-based learning: An Interpretable Method for Named Entity Recognition

Named entity recognition for Chinese based on global pointer and adversarial training

Combining prompt-based language models and weak supervision for labeling named entity recognition on legal documents

Fine-grained Contract NER using instruction based model

Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks

BTPK-based interpretable method for NER tasks based on Talmudic Public Announcement Logic

Automated Refugee Case Analysis: An NLP Pipeline for Supporting Legal Practitioners

Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution

Named Entity Recognition for English Language Using Deep Learning Based Bi Directional LSTM-RNN

Incorporating Entity Type-Aware and Word–Word Relation-Aware Attention in Generative Named Entity Recognition

CaseGNN: Graph Neural Networks for Legal Case Retrieval with Text-Attributed Graphs

NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval

Named Entity Recognition Method of Legal Instruments Based on Improved Few-Shot Learning

Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition

Boosting court judgment prediction and explanation using legal entities

Legal Text Recognition Using LSTM-CRF Deep Learning Model

Conditional random fields with semantic enhancement for named-entity recognition

ToNER: Type-oriented Named Entity Recognition with Generative Language Model

GPT-NER: Named Entity Recognition via Large Language Models