Tracking Logical Difference in Industrial-Scale Ontologies.
Yizheng Zhao,Ghadah Alghamdi,Renate A. Schmidt,Hao Feng,Giorgos Stoilos,Damir Juric,Mohammad Khodadadi
2019-01-01
Abstract:This paper considers the problem of tracking the changes between different versions of an ontology, which can be done based on the notion of logical difference. The logical difference between two ontologies are the axioms in one that are not entailed by the other, reflecting information gained and information lost between the ontologies [5]. For ontology engineers, tracking the logical difference in industrial ontologies is a critical task for the purpose of maintaining them: to track what has changed in a new version of an ontology, to ensure that the extension is safe in the sense of being a conservative extension [9], and to identify information gained and information lost [4]. This provides a means of discovering issues in the ontologies and supports quality assurance reviews. Being able to compute the logical difference between two ontologies is also important when merging/aligning ontologies and integrating different ontologies [3,12]. Presently available ontology difference and alignment/merging tools compute the structural difference between ontologies [11,1], which is a syntactic notion, or approximate the logical difference between ontologies [5,3,7,12], which is a semantic notion. A particular challenge is the size of ontologies used in realworld applications. Our targets are the Snomed CT and NCIt ontologies. Being the most comprehensive, multi-lingual medical ontology in the world, the core Snomed CT ontology contains over 335K concepts definitions [13]. The NCIt ontology defines terminologies in the biomedical domain and contains more than 60K concept definitions [2]. This paper explores how the logical difference between two ontologies can be computed using a forgettingor uniform interpolation-based approach, as proposed in [8]. Forgetting is an ontology re-engineering technique which preserves the semantics of definitions of concepts and relationships among them [6,10,14]. Current forgetting tools have difficulties in tracking the logical difference in very large ontologies. To address this shortcoming we introduce a new forgetting method designed for the task of tracking the logical difference in large-scale ALContologies. The method is sound and terminating, and can compute forgetting solutions for ontologies as large as Snomed CT and NCIt. Our evaluation shows