Research on Building Methods of Hierarchical Structure in Text Classification

Yun Bo Xiong
DOI: https://doi.org/10.4028/www.scientific.net/aef.6-7.742
2012-01-01
Advanced Engineering Forum
Abstract:There always exists semantic hierarchical relationship in text classification. Therefore, it's inevitable to organize documents in accordance with the hierarchical structure. Based on confusion matrix, this paper attempted to adopt two different algorithms including hierarchical clustering and confusion category to build hierarchical structure of document category, and finally made use of hierarchical classification to carry on experiment, results of which showed that the confusion category strategy is superior to hierarchical clustering strategy and recall and precision of flat classification are both improved.
What problem does this paper attempt to address?