User Generated Content Oriented Chinese Taxonomy Construction

Jinyang Li,Chengyu Wang,Xiaofeng He,Rong Zhang,Ming Gao
DOI: https://doi.org/10.1007/978-3-319-25255-1_51
2015-01-01
Abstract:The taxonomy is one of the basic components in knowledge graphs as it establishes types of classes and semantic relations among the classes. Taxonomies are normally constructed either manually, or by language-dependent rules or patterns for type and relation extraction or inference. Existing work on building taxonomies for knowledge graphs is mostly in English language environment. In this paper, we propose a novel approach for large-scale Chinese taxonomy construction based on user generated content. We take Chinese Wikipedia as the data source, develop methods to extract classes and their relations mined from user tagged categories, and build up the taxonomy using a bottom-up strategy. The algorithms can be easily applied to other Wiki-style data sources. The experiments show that the constructed Chinese taxonomy achieves better results in both quality and quantity.
What problem does this paper attempt to address?