Similarity Computing Model of High Dimension Data for Symptom Classification of Chinese Traditional Medicine

Peng Jing,Tang Chang-jie,Yang Dong-qing,Zhang Jing,Hu Jian-jun
DOI: https://doi.org/10.1016/j.asoc.2008.04.005
IF: 8.7
2009-01-01
Applied Soft Computing
Abstract:In recent years, researchers have paid more and more attention on data mining of practical applications. Aimed to the problem of symptom classification of Chinese traditional medicine, this paper proposes a novel computing model based on the similarities among attributes of high dimension data to compute the similarity between any tuples. This model assumes data attributes as basic vectors of m dimensions and each tuple as a sum vector of all the attribute-vectors. Based on the transcendental concept similarity information among attributes, it suggests a novel distance algorithm to compute the similarity distance of any pair of attribute-vectors. In this method, the computing of similarity between any tuples are turned to the formulas of attribute-vectors and their projections of each other, and the similarity between any pair of tuples can be worked out by computing these vectors and formulas. This paper also presents a novel classification algorithm based on the similarity computing model and successfully applies the algorithm into the symptom classification of Chinese traditional medicine. The efficiency of the algorithm is proved by extensive experiments.
What problem does this paper attempt to address?