Efficient Knowledge Infusion via KG-LLM Alignment

Zhouyu Jiang,Ling Zhong,Mengshu Sun,Jun Xu,Rui Sun,Hui Cai,Shuhan Luo,Zhiqiang Zhang
2024-06-06
Abstract:To tackle the problem of domain-specific knowledge scarcity within large language models (LLMs), knowledge graph-retrievalaugmented method has been proven to be an effective and efficient technique for knowledge infusion. However, existing approaches face two primary challenges: knowledge mismatch between public available knowledge graphs and the specific domain of the task at hand, and poor information compliance of LLMs with knowledge graphs. In this paper, we leverage a small set of labeled samples and a large-scale corpus to efficiently construct domain-specific knowledge graphs by an LLM, addressing the issue of knowledge mismatch. Additionally, we propose a three-stage KG-LLM alignment strategyto enhance the LLM's capability to utilize information from knowledge graphs. We conduct experiments with a limited-sample setting on two biomedical question-answering datasets, and the results demonstrate that our approach outperforms existing baselines.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
### The Problem Addressed by This Paper This paper aims to address the issue of knowledge insufficiency in the application of large language models (LLMs) in specific domains, particularly in the medical field. Specifically, the paper identifies the following two main challenges: 1. **Knowledge Mismatch**: Existing methods typically utilize publicly available knowledge graphs for knowledge injection, but these graphs often fail to fully cover the highly specialized knowledge required for specific tasks. 2. **Poor Information Consistency**: The structured triplet form in knowledge graphs differs from natural language text, making it difficult for LLMs to effectively utilize this information, especially in scenarios with scarce supervised samples. To address these issues, the authors propose a modular knowledge injection framework, which mainly includes the following steps: 1. **Efficient Construction of Domain Knowledge Graphs**: Train a knowledge extraction model based on LLMs using a small amount of annotated data, and extract knowledge triplets from a large domain-specific corpus to construct a domain-specific knowledge graph. 2. **Pre-learning Stage**: Use the K-LoRA method to enable the model to understand and process domain knowledge by generating triplets into text. 3. **Supervised Fine-tuning Stage**: Further fine-tune the model by combining the retrieval results from the knowledge graph to improve the quality of its output. 4. **Automatic Knowledge Graph Feedback Stage (AKGF)**: Use the knowledge graph as an automatic evaluation tool to provide feedback on the knowledge correctness of the generated content and further optimize the model. Through these steps, the proposed method significantly improves the performance of LLMs in specific domains, such as biomedical question answering.