Abstract:Motivated by applications in computer vision and databases, we introduce and study the Simultaneous Nearest Neighbor Search (SNN) problem. Given a set of data points, the goal of SNN is to design a data structure that, given a collection of queries, finds a collection of close points that are compatible with each other. Formally, we are given k query points Q=q_1,⋯,q_k, and a compatibility graph G with vertices in Q, and the goal is to return data points p_1,⋯,p_k that minimize (i) the weighted sum of the distances from q_i to p_i and (ii) the weighted sum, over all edges (i,j) in the compatibility graph G, of the distances between p_i and p_j. The problem has several applications, where one wants to return a set of consistent answers to multiple related queries. This generalizes well-studied computational problems, including NN, Aggregate NN and the 0-extension problem. In this paper we propose and analyze the following general two-step method for designing efficient data structures for SNN. In the first step, for each query point q_i we find its (approximate) nearest neighbor point p̂_i; this can be done efficiently using existing approximate nearest neighbor structures. In the second step, we solve an off-line optimization problem over sets q_1,⋯,q_k and p̂_1,⋯,p̂_k; this can be done efficiently given that k is much smaller than n. Even though p̂_1,⋯,p̂_k might not constitute the optimal answers to queries q_1,⋯,q_k, we show that, for the unweighted case, the resulting algorithm is O(log k/loglog k)-approximation. Also, we show that the approximation factor can be in fact reduced to a constant for compatibility graphs frequently occurring in practice. Finally, we show that the "empirical approximation factor" provided by the above approach is very close to 1.

Supporting Subseries Nearest Neighbor Search Via Approximation

Dynamic Time Warping under Product Quantization, with Applications to Time-Series Data Similarity Search

Accelerating Exact Nearest Neighbor Search in High Dimensional Euclidean Space Via Block Vectors

Approximate Nearest Neighbor Search on High Dimensional Data — Experiments, Analyses, and Improvement

Constrained All-k-Nearest-Neighbor Search

Variable Step Algorithm for Sub-Trend Sequence Searching

Piecewise Chebyshev Factorization Based Nearest Neighbour Classification for Time Series

Multi-Querying: A Subsequence Matching Approach to Support Multiple Queries.

Towards a faster symbolic aggregate approximation method

Return of the Lernaean Hydra: Experimental Evaluation of Data Series Approximate Similarity Search

Lazylsh: Approximate Nearest Neighbor Search For Multiple Distance Functions With A Single Index

Subspace Collision: An Efficient and Accurate Framework for High-dimensional Approximate Nearest Neighbor Search

Indexable Online Time Series Segmentation with Error Bound Guarantee

Preserving-Ignoring Transformation Based Index for Approximate k Nearest Neighbor Search

Subsequence Similarity Search under Time Shifting

Simultaneous Nearest Neighbor Search.

Indexable PLA for Efficient Similarity Search

A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor search

Interest-Based Queries For Time Series Data

Nearest group queries.

Efficient Algorithm for a Novel Pattern of Time Series