A Survey of Knowledge Enhanced Pre-trained Language Models

Jian Yang,Xinyu Hu,Gang Xiao,Yulong Shen
DOI: https://doi.org/10.1145/3631392
IF: 1.471
2024-03-01
ACM Transactions on Asian and Low-Resource Language Information Processing
Abstract:Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning. These models, however, suffer from poor robustness and lack of interpretability. We refer to pre-trained language models with knowledge injection as knowledge-enhanced pre-trained language models (KEPLMs). These models demonstrate deep understanding and logical reasoning and introduce interpretability. In this survey, we provide a comprehensive overview of KEPLMs in NLP. We first discuss the advancements in pre-trained language models and knowledge representation learning. Then we systematically categorize existing KEPLMs from three different perspectives. Finally, we outline some potential directions of KEPLMs for future research.
computer science, artificial intelligence
What problem does this paper attempt to address?