Convolutional Network Embedding of Text-Enhanced Representation for Knowledge Graph Completion

Feng Zhao,Tao Xu,Langjunqing Jin,Hai Jin
DOI: https://doi.org/10.1109/jiot.2020.3039750
IF: 10.6
2021-12-01
IEEE Internet of Things Journal
Abstract:In recent years, knowledge graph representation learning has prompted extensive research. Machine learning models are used to map entity and relational data in knowledge graphs to vector representations in low-dimensional spaces to predict and analyze potential relationships. Current works mainly focus on the knowledge representation of the triple structure and relationship path in knowledge graphs without fully utilizing external textual information to semantically supplement knowledge representation. However, the existing knowledge inventory, such as that for smart health and emotion care systems, is relatively meager, and structural knowledge is incomplete; therefore, knowledge graph completion is essential. In this article, we propose a novel joint representation learning model that introduces text description information and extracts reliable feature information from text data by using a convolutional neural network (CNN) model. Furthermore, being based on the attention mechanism, the proposed model distinguishes the characteristic credibility of different relationships, enhances the representation of the entity relationship structure vector in the existing knowledge graph, and obtains rich semantic information. Finally, the 2-D convolution operation is used to process the joint representation vectors of entities and relationships to obtain nonlinear features, and the knowledge graph is completed by completing the calculation of the score function of the joint representation vector of the entity and the relationship. Experiments performing tasks, such as link prediction and triple classification, on the FreeBase (FB15k), WordNet (WN18) and Yet Another Great Ontology (YAGO3-10) data sets reveal that our model performs better than the benchmark model and has some degree of scalability.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?
The paper primarily focuses on addressing the problem of knowledge graph completion, especially in cases where the knowledge graph information is incomplete and the structure is sparse. By leveraging external textual information, it aims to enhance the representation of entities and relationships within the knowledge graph, thereby improving the ability to predict missing entities or relationships. Specifically, the paper proposes a novel joint representation learning model called Text-Enhanced Convolutional Representation Learning (TECRL). The main contributions of this model include: 1. **Introduction of Textual Description Information**: The TECRL model uses Convolutional Neural Networks (CNN) to extract reliable feature information from textual data related to knowledge graph entities. It combines this with an attention mechanism to assign different weights to different texts, thereby obtaining text vector representations that are highly discriminative and semantically accurate. 2. **Enhancement of Entity-Relationship Vectors**: The model also proposes a dynamic parameter generation strategy to integrate structural and textual vector representations. By reshaping the joint vectors through 2D convolution operations, it enhances the semantic representation of entity-relationship structural vectors in the existing knowledge graph. 3. **Non-linear Feature Extraction**: Finally, the model processes the joint representation vectors of entities and relationships through 2D convolution operations to extract non-linear features. It completes the knowledge graph by calculating the score function of the joint representation vectors of entities and relationships. Experimental results show that the TECRL model outperforms benchmark models in link prediction and triple classification tasks on datasets such as Freebase (FB15k), WordNet (WN18), and Yet Another Great Ontology (YAGO3-10), and it also demonstrates a certain degree of scalability.