Large Knowledge Model: Perspectives and Challenges

Huajun Chen
DOI: https://doi.org/10.3724/2096-7004.di.2024.0001
2024-06-27
Abstract:Humankind's understanding of the world is fundamentally linked to our perception and cognition, with \emph{human languages} serving as one of the major carriers of \emph{world knowledge}. In this vein, \emph{Large Language Models} (LLMs) like ChatGPT epitomize the pre-training of extensive, sequence-based world knowledge into neural networks, facilitating the processing and manipulation of this knowledge in a parametric space. This article explores large models through the lens of "knowledge". We initially investigate the role of symbolic knowledge such as Knowledge Graphs (KGs) in enhancing LLMs, covering aspects like knowledge-augmented language model, structure-inducing pre-training, knowledgeable prompts, structured CoT, knowledge editing, semantic tools for LLM and knowledgeable AI agents. Subsequently, we examine how LLMs can boost traditional symbolic knowledge bases, encompassing aspects like using LLM as KG builder and controller, structured knowledge pretraining, and LLM-enhanced symbolic reasoning. Considering the intricate nature of human knowledge, we advocate for the creation of \emph{Large Knowledge Models} (LKM), specifically engineered to manage diversified spectrum of knowledge structures. This promising undertaking would entail several key challenges, such as disentangling knowledge base from language models, cognitive alignment with human knowledge, integration of perception and cognition, and building large commonsense models for interacting with physical world, among others. We finally propose a five-"A" principle to distinguish the concept of LKM.
Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The paper primarily explores the concept of Large Knowledge Models (LKMs) and the challenges they face, attempting to address the following core issues: 1. **Enhancing the knowledge representation capabilities of Large Language Models (LLMs)**: By integrating Knowledge Graphs (KGs) and other structured knowledge representation methods to improve LLMs' abilities in reasoning, understanding, and generation. 2. **Enriching traditional symbolic knowledge bases using LLMs**: Investigating how to use LLMs as knowledge graph builders and controllers to enhance existing symbolic knowledge bases. 3. **Proposing the concept of Large Knowledge Models (LKM)**: Given the complexity of human knowledge, proposing the creation of LKMs specifically designed to handle diverse knowledge structures and discussing their key challenges, such as separating the knowledge base from the language model and achieving alignment with human cognition. 4. **Addressing the hallucination problem in LLMs**: Using high-precision and logically clear knowledge graphs as references to detect and correct inaccuracies in the content generated by LLMs. In summary, this paper aims to improve the performance of large models in handling complex tasks by integrating structured knowledge and natural language processing technologies, and to explore methods for constructing more reliable and powerful knowledge representation systems.