Judgement Citation Retrieval using Contextual Similarity

Akshat Mohan Dasula,Hrushitha Tigulla,Preethika Bhukya
2024-08-15
Abstract:Traditionally in the domain of legal research, the retrieval of pertinent citations from intricate case descriptions has demanded manual effort and keyword-based search applications that mandate expertise in understanding legal jargon. Legal case descriptions hold pivotal information for legal professionals and researchers, necessitating more efficient and automated approaches. We propose a methodology that combines natural language processing (NLP) and machine learning techniques to enhance the organization and utilization of legal case descriptions. This approach revolves around the creation of textual embeddings with the help of state-of-art embedding models. Our methodology addresses two primary objectives: unsupervised clustering and supervised citation retrieval, both designed to automate the citation extraction process. Although the proposed methodology can be used for any dataset, we employed the Supreme Court of The United States (SCOTUS) dataset, yielding remarkable results. Our methodology achieved an impressive accuracy rate of 90.9%. By automating labor-intensive processes, we pave the way for a more efficient, time-saving, and accessible landscape in legal research, benefiting legal professionals, academics, and researchers.
Information Retrieval,Computation and Language,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the need for efficiently and accurately retrieving relevant citations from complex case descriptions in legal research. Traditionally, this retrieval task requires manual effort and relies on keyword search applications that understand legal terminology. However, with the surge in the number of legal cases and the complexity of legal texts, traditional manual methods have become inefficient and prone to errors. Therefore, the paper proposes a method that combines Natural Language Processing (NLP) and machine learning techniques, aiming to automate the citation extraction process through unsupervised clustering and supervised citation retrieval. This approach not only improves the efficiency and accuracy of legal research but also enables legal professionals to quickly and precisely access relevant information, thereby saving time and resources. Moreover, the method has shown significant results in practical applications, particularly on the United States Supreme Court (SCOTUS) dataset, achieving an accuracy rate of 90.9%. By automating these time-consuming processes, the method provides a more efficient, time-saving, and accessible solution for legal research.