Abstract:Knowledge-based question answering (KBQA) is a key task in NLP research, and also an approach to access the web data and knowledge, which requires exploiting knowledge graphs (KGs) for reasoning. In the literature, one promising solution for KBQA is to incorporate the pretrained language model (LM) with KGs by generating KG-centered pretraining corpus, which has shown its superiority. However, these methods often depend on specific techniques and resources to work, which may not always be available and restrict its application. Moreover, existing methods focus more on improving language understanding with KGs, while neglect the more important human-like complex reasoning. To this end, in this paper, we propose a general Knowledge-Injected Curriculum Pretraining framework (KICP) to achieve comprehensive KG learning and exploitation for KBQA tasks, which is composed of knowledge injection (KI), knowledge adaptation (KA) and curriculum reasoning (CR). Specifically, the KI module first injects knowledge into the LM by generating KG-centered pretraining corpus, and generalizes the process into three key steps that could work with different implementations for flexible application. Next, the KA module learns knowledge from the generated corpus with LM equipped with an adapter as well as keeps its original natural language understanding ability to reduce the negative impacts of the difference between the generated and natural corpus. Last, to enable the LM with complex reasoning, the CR module follows human reasoning patterns to construct three corpora with increasing difficulties of reasoning, and further trains the LM from easy to hard in a curriculum manner. We provide an implementation of the general framework, and evaluate the proposed KICP on four real-word datasets. The results demonstrate that our framework can achieve higher performances.

Tabular reasoning via two-stage knowledge injection

A Knowledge-Injected Curriculum Pretraining Framework for Question Answering

Seek and Solve Reasoning for Table Question Answering

Incorporating External Knowledge to Enhance Tabular Reasoning

Unveiling Implicit Table Knowledge with Question-Then-Pinpoint Reasoner for Insightful Table Summarization

A Confidence-Based Knowledge Integration Framework for Cross-Domain Table Question Answering

KET-QA: A Dataset for Knowledge Enhanced Table Question Answering

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular Data

TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition

Tab-CoT: Zero-shot Tabular Chain of Thought

Knowledge-Driven CoT: Exploring Faithful Reasoning in LLMs for Knowledge-intensive Question Answering

Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion

Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning

TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Base

Injecting Numerical Reasoning Skills into Knowledge Base Question Answering Models

Knowledge-Aware Reasoning over Multimodal Semi-structured Tables

TIARA: Multi-grained Retrieval for Robust Question Answering over Large Knowledge Bases

CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm

Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions

Improving Interpretability of Deep Sequential Knowledge Tracing Models with Question-centric Cognitive Representations