Merge Information in HowNet and TongYiCi CiLin

梅立军,周强,臧路,陈祖舜
DOI: https://doi.org/10.3969/j.issn.1003-0077.2005.01.010
2005-01-01
Abstract:In this paper,we study the problem of merging information in HowNet and a Chinese thesaurus — TongYiCi CiLin. In order to integrate both the conception descriptions of words in HowNet and the semantic categories of words in TongYiCi CiLin,we propose several useful merging strategies: Firstly,we establish a DEF description for each SynSet in TongYiCi CiLin,which is similar with the word sense definition in HowNet.Then,we make bidirectional link for the words which have only one sense in both dictionaries.Finally we make bidirectional link for other words with multiple senses by using a classification algorithm based on salient frequency and vector distance of two sense descriptions.Experimental result shows that these merging strategies are effective and the merging accuracy is about 93%.The merged results form a new dictionary,which not only has semantic category of TongYi CiLin,but also has conception description of HowNet.
What problem does this paper attempt to address?