Semantic Simple Vector Distance Classification Based on Ontology

He Yuanjiao,Zhang Guoying
DOI: https://doi.org/10.3969/j.issn.1008-2565.2007.03.004
2007-01-01
Abstract:The feature selection of traditional simple vector distance ignores the semantic difference of vocabulary on different abstract levels.Aimed at this problem,this paper proposed semantic simple vector distance classification based on ontology.It efficiently incorporates linguistic knowledge into text vector space representation with the support of ontology and further discover the deep-seated semantic relations among concepts of feature vector.Then those semantic feature vectors are used as final text feature vectors.At the same time,this approach defines how to calculate the semantic similarity of different abstract levels based on domain ontologies,and then the semantic similarity is used to improve the traditional simple vector distance method.Experiments on corpus CWT20G show that ontology semantic simple vector distance algorithm distinguishs better for synonym,polysemy and hyponymy.The accuracy rate of classification is gradually improved along with more and more in-depth semantic analysis.
What problem does this paper attempt to address?