Abstract:Knowledge graphs (KGs) are collections of real-world knowledge that is represented by a structured form of triples. Since they are manually built in their nascent stage, there is a common problem that some links (triples) are missing. Knowledge graph completion (KGC) aims to find those missing links and thereby complete the KGs. However, as knowledge increases through diverse sources, new entities have explosively emerged and they are needed to be connected to existing KGs. Thus, open-world KGC is targeted on extending KGs to those new entities. Dealing with those new entities is challenging because they do not have any connection with entities in the existing KGs. One way to handle the new ones is to embed them with their textual descriptions with pre-trained word embeddings and score them in the graph-vector space with the existing typical KGC models. These models have resulted in meaningful results but there is still a lack of studies on utilizing the latest neural networks, such as pre-trained language models which are known to be better at capturing contexts than pre-trained word embeddings. This paper proposes a novel model that effectively connects new entities and existing KGs through a pre-trained language model. To effectively handle the problem, we utilize two learning methods; one is the classification method of the masked language model (MLM) that predicts a word among a huge vocabulary set with a given context, and the other is multi-task learning based on the Multi-Task for Deep Neural Networks (MT-DNN). Based on the methods, the model first generates an embedding of a new entity using its textual description and then uses the embedding to find one of the existing entities from a KG where the new entity can be connected. The experimental results on three benchmark datasets, DBPedia50k, FB15k-237-OWE, and FB20k, show that the proposed model improves performances by 9.2%p , 4.4%p , and 11.1%p , respectively, and achieves new state-of-the-art performance for all datasets.

Graph Structure Enhanced Pre-Training Language Model for Knowledge Graph Completion

Mixed Geometry Message and Trainable Convolutional Attention Network for Knowledge Graph Completion

MEGA: Meta-Graph Augmented Pre-Training Model for Knowledge Graph Completion

Subgraph-Aware Training of Language Models for Knowledge Graph Completion Using Structure-Aware Contrastive Learning

Structure Pre-training and Prompt Tuning for Knowledge Graph Transfer

KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completion

Hierarchical Perceptual Graph Attention Network for Knowledge Graph Completion

Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach.

Knowledge graph extension with a pre-trained language model via unified learning method

Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning

Knowledge Graph Completion Method of Combining Structural Information with Semantic Information

Simple knowledge graph completion model based on PU learning and prompt learning

Making Large Language Models Perform Better in Knowledge Graph Completion

Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model

Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models

KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation

Improving Knowledge Graph Representation Learning by Structure Contextual Pre-training

Multi-level Shared Knowledge Guided Learning for Knowledge Graph Completion

Unifying Structure and Language Semantic for Efficient Contrastive Knowledge Graph Completion with Structured Entity Anchors