Classification algorithm based on near neighbor interval of dimensional samples

Li Juan,Wang Yuping
DOI: https://doi.org/10.13245/j.hust.2012.12.011
2012-01-01
Abstract:To improve the traditional KNN(K-nearest neighbor) algorithm,an improved nearest neighbor classification algorithm was proposed.Firstly,the class-dimensional sample storage model was adopted to break the integrity of the sample and convert the storage model of the training sample set.Secondly,the class-dimensional neighbor interval was found and the class-dimensional similarity was computed to obtain the class similarity of an unlabeled sample.Lastly,the class label of the unlabeled sample was same as the class label with the highest class similarity.The algorithm improves classification efficiency,decreases the influence with those isolated samples and noise,can effectively deal with both continuous attribute and categorical attribute,and can be adapted to various types of samples′ distribution,and expands the algorithm range of applications.Experimental results from several UCI(University of California Irvine) datasets show that the algorithm has more high performance in classification accuracy and time,compared with the traditional KNN algorithm and neighborhood counting algorithm.
What problem does this paper attempt to address?