Domain knowledge-enriched summarization of legal judgment documents via grey wolf optimization

Deepali Jain,Malaya Dutta Borah,Anupam Biswas
DOI: https://doi.org/10.1016/bs.adcom.2023.11.005
IF: 3.067
2024-01-11
Advances in Computers
Abstract:Extractive summarization of legal documents involves extracting important sentences from documents. In this work, we model the extractive summarization task as an optimization problem in the complete output space, where the goal is to select a subset of important sentences from the document. An effective objective function is proposed that is infused with domain-specific knowledge along with the exploration of pretrained embeddings for better scoring of candidate summaries. In this work, we have considered a grey wolf optimization-based approach whose objective function formulation contains the legal-specific knowledge along with pretrained embeddings as one of the features of this objective function. The experimental evaluation of the proposed nature-inspired summarization approach is carried out on an annotated Indian Legal Judgment document summarization dataset with the help of ROUGE metrics. From the experimental analysis, it has been observed that the best ROUGE-1 score (0.56034) is achieved by GWO setting with 50 population size and 300 iterations which used the Mini-LM model for finding the pretrained embeddings, whereas the best ROUGE-2 and ROUGE-L scores are 0.30583 and 0.27621, respectively, which have been achieved by GWO setting with 10 population sizes and 300 iterations using the general Legal Bert model. From the experimental results, the improved performances of Legal Bert-based embeddings with a higher number of iterations are observed. Such an approach can have very high practical utility in getting the gist out of lengthy legal judgment documents.
computer science, software engineering, hardware & architecture
What problem does this paper attempt to address?