DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services

Shengbin Yue,Wei Chen,Siyuan Wang,Bingxuan Li,Chenchen Shen,Shujun Liu,Yuxuan Zhou,Yao Xiao,Song Yun,Xuanjing Huang,Zhongyu Wei

2023-09-24

Abstract:We propose DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services. We adopt legal syllogism prompting strategies to construct supervised fine-tuning datasets in the Chinese Judicial domain and fine-tune LLMs with legal reasoning capability. We augment LLMs with a retrieval module to enhance models' ability to access and utilize external legal knowledge. A comprehensive legal benchmark, DISC-Law-Eval, is presented to evaluate intelligent legal systems from both objective and subjective dimensions. Quantitative and qualitative results on DISC-Law-Eval demonstrate the effectiveness of our system in serving various users across diverse legal scenarios. The detailed resources are available at <a class="link-external link-https" href="https://github.com/FudanDISC/DISC-LawLLM" rel="external noopener nofollow">this https URL</a>.

Computation and Language

What problem does this paper attempt to address?

The paper aims to address the need for intelligent legal systems in the legal field, particularly by leveraging large language models (LLMs) to provide a wide range of legal services. Specifically, the paper proposes DISC-LawLLM, a large language model optimized for legal reasoning and knowledge retrieval capabilities. The research team attempts to solve the problem through the following points: 1. **Constructing a high-quality supervised fine-tuning dataset**: By adopting a legal syllogism prompting strategy, a supervised fine-tuning dataset, DISC-Law-SFT, was constructed within the Chinese judicial domain to train the model with legal reasoning capabilities. 2. **Enhancing retrieval and reasoning capabilities**: A retrieval module was introduced, enabling the model to access and utilize external legal knowledge, thereby improving its reliability and accuracy in practical applications. 3. **Designing a comprehensive evaluation benchmark**: A comprehensive legal benchmark test, DISC-Law-Eval, was proposed to evaluate intelligent legal systems from both objective and subjective dimensions. Through these methods, the paper demonstrates the effectiveness of DISC-LawLLM in handling legal issues of varying difficulty levels and significantly outperforms existing large legal language models in multiple tasks.

DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services

Fine-tuning and Application of Large Language Model in Law Domain

LawLLM: Law Large Language Model for the US Legal System

InternLM-Law: An Open Source Chinese Legal Large Language Model

A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction

Optimizing Numerical Estimation and Operational Efficiency in the Legal Domain through Large Language Models

LAiW: A Chinese Legal Large Language Models Benchmark

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models

LawBench: Benchmarking Legal Knowledge of Large Language Models

DISC-MedLLM: Bridging General Large Language Models and Real-World Medical Consultation

DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model

DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuning

FedJudge: Federated Legal Large Language Model

Enabling Discriminative Reasoning in LLMs for Legal Judgment Prediction

Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning

Large language models for automated Q&A involving legal documents: a survey on algorithms, frameworks and applications

Large Language Models are legal but they are not: Making the case for a powerful LegalLLM

Large Language Models in Law: A Survey

Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications

Improving Clinical Expertise in Large Language Models Using Electronic Medical Records

ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases