Abstract:Motivated by applications in computer vision and databases, we introduce and study the Simultaneous Nearest Neighbor Search (SNN) problem. Given a set of data points, the goal of SNN is to design a data structure that, given a collection of queries, finds a collection of close points that are compatible with each other. Formally, we are given k query points Q=q_1,⋯,q_k, and a compatibility graph G with vertices in Q, and the goal is to return data points p_1,⋯,p_k that minimize (i) the weighted sum of the distances from q_i to p_i and (ii) the weighted sum, over all edges (i,j) in the compatibility graph G, of the distances between p_i and p_j. The problem has several applications, where one wants to return a set of consistent answers to multiple related queries. This generalizes well-studied computational problems, including NN, Aggregate NN and the 0-extension problem. In this paper we propose and analyze the following general two-step method for designing efficient data structures for SNN. In the first step, for each query point q_i we find its (approximate) nearest neighbor point p̂_i; this can be done efficiently using existing approximate nearest neighbor structures. In the second step, we solve an off-line optimization problem over sets q_1,⋯,q_k and p̂_1,⋯,p̂_k; this can be done efficiently given that k is much smaller than n. Even though p̂_1,⋯,p̂_k might not constitute the optimal answers to queries q_1,⋯,q_k, we show that, for the unweighted case, the resulting algorithm is O(log k/loglog k)-approximation. Also, we show that the approximation factor can be in fact reduced to a constant for compatibility graphs frequently occurring in practice. Finally, we show that the "empirical approximation factor" provided by the above approach is very close to 1.

K Nearest Neighbor Queries and Knn-Joins in Large Relational Databases (almost) for Free

Constrained All-k-Nearest-Neighbor Search

Nearest group queries.

Efficient index-based KNN join processing for high-dimensional data

Efficient Parallel Processing of High-Dimensional Spatial K NN Queries

Efficient K-Nearest Neighbor Join Algorithms for High Dimensional Sparse Data

Preserving-Ignoring Transformation Based Index for Approximate k Nearest Neighbor Search

An Efficient Method for k Nearest Neighbor Searching in Obstructed Spatial Databases.

Efficient K -NN Searching over Large Uncertain Time Series Database.

Evaluating a Stream of Relational KNN Queries by a Knowledge Base.

Performance Optimization For The K-Nearest Neighbors Kernel On X86 Architectures

Enhancing K-nearest neighbor algorithm: a comprehensive review and performance analysis of modifications

Processing Incomplete K Nearest Neighbor Search

Efficient parallel processing for K-nearest-neighbor search in spatial databases

High-dimensional kNN joins with incremental updates

Relational Algorithms for K-Means Clustering

Solutions for Processing K Nearest Neighbor Joins for Massive Data on MapReduce

Processing Conflict-Aware $k$ Nearest Neighbor Queries in Euclidean Space

Memory-Efficient RkNN Retrieval by Nonlinear k-Distance Approximation

Simultaneous Nearest Neighbor Search.

On efficient mutual nearest neighbor query processing in spatial databases