Abstract:In the realm of machine learning, the KNN classification algorithm is widely recognized for its simplicity and efficiency. However, its sensitivity to the K value poses challenges, especially with small sample sizes or outliers, impacting classification performance. This article introduces a novel KNN-based classifier called LMPHNN (Novel Pseudo Nearest Neighbor Classification Method Using Local Harmonic Mean Distance). LMPHNN leverages harmonic mean distance (HMD) to improve classification performance based on LMPNN rules and HMD. The classifier begins by identifying k nearest neighbors for each class and generates distinct local vectors as prototypes. Pseudo nearest neighbors (PNNs) are then created based on the local mean for each class, determined by comparing the HMD of the sample with the initial k group. Classification is determined by calculating the Euclidean distance between the query sample and PNNs, based on the local mean of these categories. Extensive experiments on various real UCI datasets and combined datasets compare LMPHNN with seven KNN-based classifiers, using precision, recall, accuracy, and F1 as evaluation metrics. LMPHNN achieves an average precision of 97%, surpassing other methods by 14%. The average recall improves by 12%, with an average accuracy enhancement of 5%. Additionally, LMPHNN demonstrates a 13% higher average F1 value compared to other methods. In summary, LMPHNN outperforms other classifiers, showcasing lower sensitivity with small sample sizes.

Classification algorithm based on near neighbor interval of dimensional samples

An improved nearest neighbor searching method for classification problems

RW.KNN: a proposed random walk KNN algorithm for multi-label classification.

Label Distribution Learning-Enhanced Dual-KNN for Text Classification

K-Ns: A Classifier by the Distance to the Nearest Subspace.

On high-dimensional modifications of the nearest neighbor classifier

Nearest neighbor classification using cam weighted distance

Active and Passive Nearest Neighbor Algorithm: A Newly-Developed Supervised Classifier

Improving Nearest Neighbor Classification with Cam Weighted Distance.

The Nearest Neighbor Algorithm of Local Probability Centers

Data Classification Based on Density Function Estimated by Radial Basis Function

A Fast Document Classification Algorithm Based on Improved KNN

Nearest Feature Line Classifier Based on Collaborative Representation with Nearest Neighbour Search Algorithm

Local Naive Bayes Nearest Neighbor for Image Classification

A Novel Pseudo Nearest Neighbor Classification Method Using Local Harmonic Mean Distance

An Improved K-Nearest Neighbor Algorithm for Text Categorization

Novel Data Classification Method Based on Radial Basis Function Networks

Under-bagging Nearest Neighbors for Imbalanced Classification

Improved KNN algorithms of spherical regions based on clustering and region division

A pre-averaged pseudo nearest neighbor classifier