SM-BERT-CR: a deep learning approach for case law retrieval with supporting model

Yen Thi-Hai Vuong,Quan Minh Bui,Ha-Thanh Nguyen,Thi-Thu-Trang Nguyen,Vu Tran,Xuan-Hieu Phan,Ken Satoh,Le-Minh Nguyen
DOI: https://doi.org/10.1007/s10506-022-09319-6
IF: 4.1
2022-08-11
Artificial Intelligence and Law
Abstract:Case law retrieval is the task of locating truly relevant legal cases given an input query case. Unlike information retrieval for general texts, this task is more complex with two phases ( legal case retrieval and legal case entailment ) and much harder due to a number of reasons. First, both the query and candidate cases are long documents consisting of several paragraphs. This makes it difficult to model with representation learning that usually has restriction on input length. Second, the concept of relevancy in this domain is defined based on the legal relation that goes beyond the lexical or topical relevance. This is a real challenge because normal text matching will not work. Third, building a large and accurate legal case dataset requires a lot of effort and expertise. This is obviously an obstacle to creating enough data for training deep retrieval models. In this paper, we propose a novel approach called supporting model that can deal with both phases. The underlying idea is the case–case supporting relation and the paragraph–paragraph as well as the decision-paragraph matching strategy. In addition, we propose a method to automatically create a large weak-labeling dataset to overcome the lack of data. The experiments showed that our solution has achieved the state-of-the-art results for both case retrieval and case entailment phases.
computer science, artificial intelligence, interdisciplinary applications,law
What problem does this paper attempt to address?