Abstract:To tackle the problem of domain-specific knowledge scarcity within large language models (LLMs), knowledge graph-retrievalaugmented method has been proven to be an effective and efficient technique for knowledge infusion. However, existing approaches face two primary challenges: knowledge mismatch between public available knowledge graphs and the specific domain of the task at hand, and poor information compliance of LLMs with knowledge graphs. In this paper, we leverage a small set of labeled samples and a large-scale corpus to efficiently construct domain-specific knowledge graphs by an LLM, addressing the issue of knowledge mismatch. Additionally, we propose a three-stage KG-LLM alignment strategyto enhance the LLM's capability to utilize information from knowledge graphs. We conduct experiments with a limited-sample setting on two biomedical question-answering datasets, and the results demonstrate that our approach outperforms existing baselines.

What problem does this paper attempt to address?

### The Problem Addressed by This Paper This paper aims to address the issue of knowledge insufficiency in the application of large language models (LLMs) in specific domains, particularly in the medical field. Specifically, the paper identifies the following two main challenges: 1. **Knowledge Mismatch**: Existing methods typically utilize publicly available knowledge graphs for knowledge injection, but these graphs often fail to fully cover the highly specialized knowledge required for specific tasks. 2. **Poor Information Consistency**: The structured triplet form in knowledge graphs differs from natural language text, making it difficult for LLMs to effectively utilize this information, especially in scenarios with scarce supervised samples. To address these issues, the authors propose a modular knowledge injection framework, which mainly includes the following steps: 1. **Efficient Construction of Domain Knowledge Graphs**: Train a knowledge extraction model based on LLMs using a small amount of annotated data, and extract knowledge triplets from a large domain-specific corpus to construct a domain-specific knowledge graph. 2. **Pre-learning Stage**: Use the K-LoRA method to enable the model to understand and process domain knowledge by generating triplets into text. 3. **Supervised Fine-tuning Stage**: Further fine-tune the model by combining the retrieval results from the knowledge graph to improve the quality of its output. 4. **Automatic Knowledge Graph Feedback Stage (AKGF)**: Use the knowledge graph as an automatic evaluation tool to provide feedback on the knowledge correctness of the generated content and further optimize the model. Through these steps, the proposed method significantly improves the performance of LLMs in specific domains, such as biomedical question answering.

Efficient Knowledge Infusion via KG-LLM Alignment

Enhancing Large Language Models with Knowledge Graphs for Robust Question Answering

Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs

Knowledge Graph-Enhanced Large Language Models via Path Selection

InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration

KGLens: Towards Efficient and Effective Knowledge Probing of Large Language Models with Knowledge Graphs

Integrated Application of LLM Model and Knowledge Graph in Medical Text Mining and Knowledge Extraction

Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs

Combining Knowledge Graphs and Large Language Models

CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph

LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs

Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering

LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities

KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

KaLM: Knowledge-aligned Autoregressive Language Modeling via Dual-view Knowledge Graph Contrastive Learning

KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph

Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering

SAC-KG: Exploiting Large Language Models as Skilled Automatic Constructors for Domain Knowledge Graphs