CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model

Zhengpeng Shi,Haoran Luo

2024-04-28

Abstract:Domain-Specific Chinese Relation Extraction (DSCRE) aims to extract relations between entities from domain-specific Chinese text. Despite the rapid development of PLMs in recent years, especially LLMs, DSCRE still faces three core challenges: complex network structure design, poor awareness, and high consumption of fine-tuning. Given the impressive performance of large language models (LLMs) in natural language processing, we propose a new framework called CRE-LLM. This framework is based on fine-tuning open-source LLMs, such as Llama-2, ChatGLM2, and Baichuan2. CRE-LLM enhances the logic-awareness and generative capabilities of the model by constructing an appropriate prompt and utilizing open-source LLMs for instruction-supervised fine-tuning. And then it directly extracts the relations of the given entities in the input textual data, which improving the CRE approach. To demonstrate the effectiveness of the proposed framework, we conducted extensive experiments on two domain-specific CRE datasets, FinRE and SanWen. The experimental results show that CRE-LLM is significantly superior and robust, achieving state-of-the-art (SOTA) performance on the FinRE dataset. This paper introduces a novel approach to domain-specific relation extraction (DSCRE) tasks that are semantically more complex by combining LLMs with triples. Our code is publicly available.

Computation and Language

What problem does this paper attempt to address?

This paper mainly discusses the issue of Domain-Specific Chinese Relation Extraction (DS-CRE). DS-CRE is a task of extracting relationships between entities from domain-specific Chinese text. Due to limited data, language diversity, and high fine-tuning cost, there are still challenges in this field. The paper proposes a new framework called CRE-LLM, which is based on fine-tuning of Large Language Models (LLMs) such as Llama-2, ChatGLM2, and Baichuan2. CRE-LLM enhances the model's logical awareness and generation capability by constructing appropriate prompts and instruction supervisions for fine-tuning, directly extracting relationships from input text data. This approach simplifies the network architecture design, improves the understanding of internal relationships, and reduces the resource consumption of fine-tuning. The paper experiments on two domain-specific relation extraction datasets, FinRE and SanWen, and the results show that CRE-LLM achieves state-of-the-art performance on FinRE. In summary, the paper attempts to address how to effectively perform domain-specific Chinese relation extraction using large language models and overcome the problems of existing methods in terms of network design complexity, inadequate understanding, and high fine-tuning cost.

CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model

Making LLMs As Fine-Grained Relation Extraction Data Augmentor

Empirical Analysis of Dialogue Relation Extraction with Large Language Models

Empowering Few-Shot Relation Extraction with The Integration of Traditional RE Methods and Large Language Models

Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks

LLM with Relation Classifier for Document-Level Relation Extraction

Chinese Relation Extraction With Multi-Grained Information And External Linguistic Knowledge

Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction

Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline Study

Evidence Reasoning and Curriculum Learning for Document-Level Relation Extraction

Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction

Document-level Relation Extraction with Entity Interaction and Commonsense Knowledge

SelfLRE: Self-refining Representation Learning for Low-resource Relation Extraction

AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

AutoRE: Document-Level Relation Extraction with Large Language Models

Not Just Plain Text! Fuel Document-Level Relation Extraction with Explicit Syntax Refinement and Subsentence Modeling

How Good are LLMs at Relation Extraction under Low-Resource Scenario? Comprehensive Evaluation

Not Just Plain Text! Fuel Document-Level Relation Extraction with Explicit Syntax Refinement and Subsentence Modeling

GPT-RE: In-context Learning for Relation Extraction using Large Language Models

Document-Level Relation Extraction with Entity Enhancement and Context Refinement