Learning Concept Hierarchies from Scientific Articles for Ontology Construction

Ting Jiang,Jianjun Sun
DOI: https://doi.org/10.3772/j.issn.1000-0135.2017.10.012
2017-01-01
Abstract:"Concept hierarchy learning" is an important topic in ontology learning. This topic is mainly researched in the biomedical field. Low efficiency is the main problem in research on taxonomic construction. This paper proposes a new framework for taxonomy extraction for domain scientific resources. First, in concept learning, concepts in sci-entific articles are classified into four categories, namely, methods, tasks, resources, and tools. Then, the terms of each category are extracted by using a combination of cascaded conditional random fields, C-value, and lexico-syntactic patterns. Second, within the limitation of the categories of the terms, the concepts of hyponym rela-tionships are extracted by combining lexicon- and Web-based methods. Thereafter, a graph model is initialized from the relationships extracted; then, graph-pruning methods are applied, and finally, the taxonomies are generated. The proposed methods are experimentally verified based on the corpus of scientific articles. We achieved high precision and recall in concept learning. Then, we extract taxonomic relationships and generated taxonomies. The feasibility and effectiveness of the propped methods are established experimentally.
What problem does this paper attempt to address?