New nearest neighbor affinity similarity function based on separation and compactness between samples

LI Juan,WANG Yuping
DOI: https://doi.org/10.3969/j.issn.1001-2400.2014.03.018
2014-01-01
Abstract:Traditional distance and similarity measurements did not take into account the influence of the individual sample on the whole sample set . To deal with this issue , a new similarity improvement strategy of k-nearest neighbor algorithm ( KNN) is proposed in the paper . First , a new affinity distance function is introduced , which focuses on the separation and compactness between each individual sample and the whole sample set . Second , a new similarity function using this affinity distance function is proposed and taken as the similarity measure function in the KNN . Third , a theoretical analysis of and experiments on eighteen numerical UCI ( University of California Irvine) datasets are made to compare the affinity similarity function proposed in this paper with classical distance or similarity functions through 5-fold partitioning cross-validations . Finally , classification results indicate that the proposed affinity similarity function is not only an effective similarity strategy for classification , but can reduce the classification time for large -scale data sets by combining efficient indexing algorithms .
What problem does this paper attempt to address?