A Hybrid Algorithm of Automatic Domain Concept Taxonomy Construction

Nianjie LUO,Zhao Lü
DOI: https://doi.org/10.3969/j.issn.1000-3428.2014.12.010
2015-01-01
Abstract:Domain concept taxonomy automatic construction plays an important role in artificial intelligence,natural language processing and information retrieval. Existing approaches pay more attention on common knowledge, while there are fewer reports about domain concepts. Two main challenges of domain concept taxonomy automatic construction are identifying relationships between concepts and less efficiency of current algorithms. In this paper,a Hybrid algorithm of Automatic Domain concept Taxonomy construction(HADT) is proposed,which has two main modules:extracting relationships between domain concepts and automatic taxonomy construction. Considering Chinese characteristics,the first module uses syntax tree method and rule-based method together,to get the aim of higher precision and higher recall. The second module uses an improved BRT algorithm to reduce time complexity and to improve taxonomy construction precision. The experiments conducted on three datasets of mobile,financial and computer show the HADT algorithm is effectiveness compared with the BRT algorithm,and the highest precision rate is 89. 3%.
What problem does this paper attempt to address?