Knowledge Graph Extension by Entity Type Recognition

Daqian Shi
2024-05-04
Abstract:Knowledge graphs have emerged as a sophisticated advancement and refinement of semantic networks, and their deployment is one of the critical methodologies in contemporary artificial intelligence. The construction of knowledge graphs is a multifaceted process involving various techniques, where researchers aim to extract the knowledge from existing resources for the construction since building from scratch entails significant labor and time costs. However, due to the pervasive issue of heterogeneity, the description diversity across different knowledge graphs can lead to mismatches between concepts, thereby impacting the efficacy of knowledge extraction. This Ph.D. study focuses on automatic knowledge graph extension, i.e., properly extending the reference knowledge graph by extracting and integrating concepts from one or more candidate knowledge graphs. We propose a novel knowledge graph extension framework based on entity type recognition. The framework aims to achieve high-quality knowledge extraction by aligning the schemas and entities across different knowledge graphs, thereby enhancing the performance of the extension. This paper elucidates three major contributions: (i) we propose an entity type recognition method exploiting machine learning and property-based similarities to enhance knowledge extraction; (ii) we introduce a set of assessment metrics to validate the quality of the extended knowledge graphs; (iii) we develop a platform for knowledge graph acquisition, management, and extension to benefit knowledge engineers practically. Our evaluation comprehensively demonstrated the feasibility and effectiveness of the proposed extension framework and its functionalities through quantitative experiments and case studies.
Artificial Intelligence
What problem does this paper attempt to address?
The paper primarily focuses on addressing key technical challenges in knowledge graph expansion, particularly on how to effectively extract and integrate concepts from different knowledge graphs to expand a reference knowledge graph. Specifically, the paper addresses the following key issues: 1. **Need for a standardized framework**: To achieve automated knowledge graph expansion tasks, a standardized framework is required to organize the work at various stages. 2. **Overcoming description diversity**: Due to the semantic heterogeneity leading to description diversity, it is necessary to precisely align concepts from different knowledge graphs. 3. **Evaluation methods**: To verify the effectiveness of the expanded knowledge graph, appropriate methods are needed for evaluation. 4. **Unified platform**: A unified platform is needed to customize knowledge graph expansion services and ensure a consistent processing workflow. To address the above issues, the paper proposes the following main contributions: - Proposes a knowledge graph expansion framework based on entity type recognition, which includes four stages: data preparation, entity type recognition, knowledge graph expansion, and performance evaluation. - Designs a machine learning-driven entity type recognition method, utilizing various machine learning models and attribute-based similarity measures to improve knowledge extraction performance. - Introduces a set of evaluation metrics to measure the quality of the expanded knowledge graph through classification objectives. - Develops an online platform named LiveSchema, which integrates functions for knowledge acquisition, management, and expansion services, aiming to facilitate practical applications for knowledge engineers. In summary, this doctoral dissertation focuses on constructing an automated knowledge graph expansion framework. By introducing advanced entity type recognition techniques and evaluation methods, as well as developing a practical online platform, it provides an effective solution for the expansion of knowledge graphs.