Retrieval-Augmented Language Model for Extreme Multi-Label Knowledge Graph Link Prediction

Yu-Hsiang Lin,Huang-Ting Shieh,Chih-Yu Liu,Kuang-Ting Lee,Hsiao-Cheng Chang,Jing-Lun Yang,Yu-Sheng Lin
2024-05-21
Abstract:Extrapolation in Large language models (LLMs) for open-ended inquiry encounters two pivotal issues: (1) hallucination and (2) expensive training costs. These issues present challenges for LLMs in specialized domains and personalized data, requiring truthful responses and low fine-tuning costs. Existing works attempt to tackle the problem by augmenting the input of a smaller language model with information from a knowledge graph (KG). However, they have two limitations: (1) failing to extract relevant information from a large one-hop neighborhood in KG and (2) applying the same augmentation strategy for KGs with different characteristics that may result in low performance. Moreover, open-ended inquiry typically yields multiple responses, further complicating extrapolation. We propose a new task, the extreme multi-label KG link prediction task, to enable a model to perform extrapolation with multiple responses using structured real-world knowledge. Our retriever identifies relevant one-hop neighbors by considering entity, relation, and textual data together. Our experiments demonstrate that (1) KGs with different characteristics require different augmenting strategies, and (2) augmenting the language model's input with textual data improves task performance significantly. By incorporating the retrieval-augmented framework with KG, our framework, with a small parameter size, is able to extrapolate based on a given KG. The code can be obtained on GitHub:
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper primarily focuses on addressing two core issues encountered by Large Language Models (LLMs) in open-ended queries: hallucination and the expensive cost of training. These issues are particularly prominent in professional domains and applications involving personalized data, as these scenarios require the model to provide highly accurate and truthful responses, along with lower fine-tuning costs. Existing work attempts to mitigate these problems by incorporating Knowledge Graph (KG) information into smaller language model inputs, but there are limitations, including the inability to effectively extract relevant information from the one-hop neighborhood of the KG, and the application of the same enhancement strategy to different types of KGs leading to poor performance. The paper introduces a new task—Extreme Multi-Label Knowledge Graph Link Prediction—aimed at enabling models to extrapolate multiple responses using structured real-world knowledge. Researchers designed a retrieval-enhanced framework that can identify relevant one-hop neighbors involving entities, relationships, and textual data. Experiments show that different types of KGs indeed require different enhancement strategies, and adding textual data to the language model's input can significantly improve task performance. By combining the retrieval-enhanced framework with KGs, models can extrapolate based on the given KG, even with a smaller number of parameters. Specifically, the paper addresses the following key challenges: 1. Handling cases where the one-hop neighborhood of nodes in the KG may be very large, ensuring the model can extract the most relevant information. 2. Adopting different enhancement strategies for KGs with varying characteristics to optimize model performance. 3. Addressing the issue that open-ended queries often generate multiple responses, making extrapolation more complex. 4. Maximizing the number of triples in each fixed-length raw text input under limited input length by truncating descriptions without changing the original meaning. 5. Addressing the issue of low precision that may occur in high-dimensional classification layers when optimizing with binary cross-entropy loss, proposing a new loss function and training strategy. Experimental results prove that the proposed framework and methods show significant improvements in the Precision at k metric when dealing with large knowledge graphs such as WN18RR and FB15k-237, compared to other models. Additionally, through a series of ablation experiments, the paper validates the importance of descriptive data, the SimCSE model, and the filtered 1-hop neighbor selection strategy for model performance. Lastly, the paper discusses the different values of the positive coefficient α in the loss function, proving that its reasonable setting at different stages is crucial for balancing the model's recall and precision.