Abstract:While the classical KNN (k nearest neighbor) shares its avoidance of the consistent distribution assumption between training and testing samples to achieve fast prediction, it still faces two challenges: (a) its generalization ability heavily depends on an appropriate number k of nearest neighbors; (b) its prediction behavior lacks interpretability. In order to address the two challenges, a novel Bayes-decisive linear KNN with adaptive nearest neighbors (i.e., BLA-KNN) is proposed to obtain the following three merits: (a) a diagonal matrix is introduced to adaptively select the nearest neighbors and simultaneously improve the generalization capability of the proposed BLA-KNN method; (b) the proposed BLA-KNN method owns the group effect, which inherits and extends the group property of the sum of squares for total deviations by reflecting the training sample class-aware information in the group effect regularization term; (c) the prediction behavior of the proposed BLA-KNN method can be interpreted from the Bayes-decision-rule perspective. In order to do so, we first use a diagonal matrix to weigh each training sample so as to obtain the importance of the sample, while constraining the importance weights to ensure that the adaptive k value is carried out efficiently. Second, we introduce a class-aware information regularization term in the objective function to obtain the nearest neighbor group effect of the samples. Finally, we introduce linear expression weights related to the distance measure between the testing and training samples in the regularization term to ensure that the interpretation of Bayes-decision-rule can be performed smoothly. We also optimize the proposed objective function using an alternating optimization strategy. We experimentally demonstrate the effectiveness of the proposed BLA-KNN method by comparing it with 7 comparative methods on 15 benchmark datasets.

Tittle : A novel ensemble method for k-nearest neighbor Author order :

An Optimal Random Projection k Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification

Under-bagging Nearest Neighbors for Imbalanced Classification

Ensemble Learning with Extremely Randomized k-Nearest Neighbors for Accurate and Efficient Classification

K-Ns: A Classifier by the Distance to the Nearest Subspace.

Distributionally Robust Weighted $k$-Nearest Neighbors

KNN Ensembles for Tweedie Regression: The Power of Multiscale Neighborhoods

Adapt Bagging to Nearest Neighbor Classifiers

Ensembling Local Learners Through Multimodal Perturbation

A Sparse Reconstructive Evidential -Nearest Neighbor Classifier for High-dimensional Data

Revisiting K-Nn for Fine-Tuning Pre-trained Language Models

A Random Projection k Nearest Neighbours Ensemble for Classification via Extended Neighbourhood Rule

A Novel Density Peaks Clustering Algorithm Based on K Nearest Neighbors with Adaptive Merging Strategy

Bayesian Model Selection Methods for Mutual and Symmetric $k$-Nearest Neighbor Classification

Surprisal Driven $k$-NN for Robust and Interpretable Nonparametric Learning

An ensemble-driven k-NN approach to ill-posed classification problems

Classification of Functional Data with k-Nearest-Neighbor Ensembles by Fitting Constrained Multinomial Logit Models

Bayes-Decisive Linear KNN with Adaptive Nearest Neighbors

Efficient Estimation of k for the Nearest Neighbors Class of Methods

A Novel Pseudo Nearest Neighbor Classification Method Using Local Harmonic Mean Distance

Probabilistic SVM classifier ensemble selection based on GMDH-type neural network