Abstract:Knowledge-based question answering (KBQA) is a key task in NLP research, and also an approach to access the web data and knowledge, which requires exploiting knowledge graphs (KGs) for reasoning. In the literature, one promising solution for KBQA is to incorporate the pretrained language model (LM) with KGs by generating KG-centered pretraining corpus, which has shown its superiority. However, these methods often depend on specific techniques and resources to work, which may not always be available and restrict its application. Moreover, existing methods focus more on improving language understanding with KGs, while neglect the more important human-like complex reasoning. To this end, in this paper, we propose a general Knowledge-Injected Curriculum Pretraining framework (KICP) to achieve comprehensive KG learning and exploitation for KBQA tasks, which is composed of knowledge injection (KI), knowledge adaptation (KA) and curriculum reasoning (CR). Specifically, the KI module first injects knowledge into the LM by generating KG-centered pretraining corpus, and generalizes the process into three key steps that could work with different implementations for flexible application. Next, the KA module learns knowledge from the generated corpus with LM equipped with an adapter as well as keeps its original natural language understanding ability to reduce the negative impacts of the difference between the generated and natural corpus. Last, to enable the LM with complex reasoning, the CR module follows human reasoning patterns to construct three corpora with increasing difficulties of reasoning, and further trains the LM from easy to hard in a curriculum manner. We provide an implementation of the general framework, and evaluate the proposed KICP on four real-word datasets. The results demonstrate that our framework can achieve higher performances.

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters

Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs

Drop Redundant, Shrink Irrelevant: Selective Knowledge Injection for Language Pretraining

Kformer: Knowledge Injection in Transformer Feed-Forward Layers

AdapterDistillation: Non-Destructive Task Composition with Knowledge Distillation

Structure Pre-training and Prompt Tuning for Knowledge Graph Transfer

Adaptable Adapters

The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection

Adapter-based Approaches to Knowledge-enhanced Language Models -- A Survey

UniAdapt: A Universal Adapter for Knowledge Calibration

Enhancing Self-Attention with Knowledge-Assisted Attention Maps

Utilization of pre-trained language models for adapter-based knowledge transfer in software engineering

Plug-and-Play Knowledge Injection for Pre-trained Language Models

Selective UMLS knowledge infusion for biomedical question answering

Experience Adapter: Adapting Pre-trained Language Models for Continual Task Planning.

Adapters for Enhanced Modeling of Multilingual Knowledge and Text

On The Cross-Modal Transfer from Natural Language to Code through Adapter Modules

Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models Memories

UNTER: A Unified Knowledge Interface for Enhancing Pre-trained Language Models

Generative Adapter: Contextualizing Language Models in Parameters with A Single Forward Pass

A Knowledge-Injected Curriculum Pretraining Framework for Question Answering