Automatic Enrichment of Domain Ontology——A Case Study on ADL Gazetteer

GE Ning,WANG Jun
DOI: https://doi.org/10.3969/j.issn.1002-137x.2007.09.042
2007-01-01
Computer Science
Abstract:The utility of domain ontologies has been increasing in recent years.A critical issue for their wider applica- tions is the automatic enrichment of domain ontology,i.e.to enrich them with new terminologies and relationships to reflect the ever advancing of domain knowledge.Based on the large scale experiment on ADL Gazetteer,one of the big- gest digital gazetteers over the world,a solution for this problem is proposed.ADL Gazetteer is a digitalized worldwide gazetteer developed in the Alexandria Digital Library(ADL)Project,which contains millions of geographic names (place names).The place names are indexed with type terms from the ADL Feature Type Thesaurus(FTT),a hierar- chical category scheme,which is the ontology study case in this research.To discover generic terms from place names and to set up correlation between the extracted generic terms and the corresponding type terms in the FTT,a point- wise-mutual-information motivated method is used to extract frequent words/phrases from the place names,and a hier- archical clustering algorithm is used to identify the generic terms from the extracted and to determine the concepts in the FTT to which the generic terms are correlated.The effectiveness of the experiment reached 82.7%.The proposed approach can be applied upon other similar ontology-indexed corpora,such as dictionaries and catalogs,and served as an assistant tool for end-user search,terminology discovery,ontology enrichment,and interoperation among different ontologies.
What problem does this paper attempt to address?