Hierarchical relation mining of Chinese text based on mixed cosine similarity

Yangyi Dong,Weihua Li,Hui Yu
DOI: https://doi.org/10.3969/j.issn.1001-3695.2017.05.029
2017-01-01
Abstract:Hierarchy relation was one of the most important relationships between the Chinese text concepts.The correct determination of the hierarchical relationship was the basic research content of the domain ontology automatic construction and text data mining and so on.Firstly,this paper listed the possibly candidate hierarchy relations,and constructed a kernel function classifier which was based on the semantic cosine similarity of part-of-speech semantic sequence and relation words.Mining problems could be transformed into a hierarchy of classification.Then it trained the classifier by the manual template.Finally,it entered the Chinese text into the preprocessed,using the kernel function classifier to determine the relationship between the candidate hierarchy relations.Using the Chinese text in the field of Air Force Weapons and equipment as the test data,experiments show that the method is simple and reliable,with good accuracy and recall rate.
What problem does this paper attempt to address?