Is It Possible to Find the Single Nearest Neighbor of a Query in High Dimensions?

Kai Ming Ting,Takashi Washio,Ye Zhu,Yang Xu,Kaifeng Zhang
DOI: https://doi.org/10.1016/j.artint.2024.104206
IF: 14.4
2024-01-01
Artificial Intelligence
Abstract:We investigate an open question in the study of the curse of dimensionality: Is it possible to find the single nearest neighbor of a query in high dimensions? Using the notion of (in)distinguishability to examine whether the feature map of a kernel is able to distinguish two distinct points in high dimensions, we analyze this ability of a metric-based Lipschitz continuous kernel as well as that of the recently introduced Isolation Kernel. Between the two kernels, we show that only Isolation Kernel has distinguishability and it performs consistently well in four tasks: indexed search for exact nearest neighbor search, anomaly detection using kernel density estimation, t-SNE visualization and SVM classification in both low and high dimensions, compared with distance, Gaussian and three other existing kernels.
What problem does this paper attempt to address?