Abstract:Automatic legal judgment prediction and its explanation suffer from the problem of long case documents exceeding tens of thousands of words, in general, and having a non-uniform structure. Predicting judgments from such documents and extracting their explanation becomes a challenging task, more so on documents with no structural annotation. We define this problem as "scarce annotated legal documents" and explore their lack of structural information and their long lengths with a deep-learning-based classification framework which we call MESc; "Multi-stage Encoder-based Supervised with-clustering"; for judgment prediction. We explore the adaptability of LLMs with multi-billion parameters (GPT-Neo, and GPT-J) to legal texts and their intra-domain(legal) transfer learning capacity. Alongside this, we compare their performance and adaptability with MESc and the impact of combining embeddings from their last layers. For such hierarchical models, we also propose an explanation extraction algorithm named ORSE; Occlusion sensitivity-based Relevant Sentence Extractor; based on the input-occlusion sensitivity of the model, to explain the predictions with the most relevant sentences from the document. We explore these methods and test their effectiveness with extensive experiments and ablation studies on legal documents from India, the European Union, and the United States with the ILDC dataset and a subset of the LexGLUE dataset. MESc achieves a minimum total performance gain of approximately 2 points over previous state-of-the-art proposed methods, while ORSE applied on MESc achieves a total average gain of 50% over the baseline explainability scores.

Semantic Segmentation of Legal Documents via Rhetorical Roles

Rhetorical Role Labeling of Legal Documents using Transformers and Graph Neural Networks

Corpus for Automatic Structuring of Legal Documents

Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents

Identification of Rhetorical Roles of Sentences in Indian Legal Judgments

Enhancing Pre-Trained Language Models with Sentence Position Embeddings for Rhetorical Roles Recognition in Legal Opinions

An MRC Framework for Semantic Role Labeling

Understand Legal Documents with Contextualized Large Language Models

Understanding the Logical and Semantic Structure of Large Documents

A Hierarchical Neural Framework for Classification and its Explanation in Large Unstructured Legal Documents

SLJP: Semantic Extraction based Legal Judgment Prediction

Towards Unsupervised Question Answering System with Multi-level Summarization for Legal Text

The Law of Large Documents: Understanding the Structure of Legal Contracts Using Visual Cues

Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement

A case study for automated attribute extraction from legal documents using large language models

Modeling of automated glowworm swarm optimization based deep learning model for legal text summarization

Learning from syntax generalizations for automatic semantic annotation

Capturing Logical Structure of Visually Structured Documents with Multimodal Transition Parser

Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges

TransformLLM: Adapting Large Language Models via LLM-Transformed Reading Comprehension Text

Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation