Best-First Based Parallel Nearest Neighbor Queries.

Yunjun Gao,Gencai Chen,Ling Chen,Chun Chen
2005-01-01
Abstract:Given a query point q, a nearest neighbor (NN) query retrieves the closest data point with the minimum distance to q in space (e.g., “find the nearest hotel to the airport”). It is one of the most important operations in spatial databases and spatio-temporal databases. However, most of existing methods for NN search only aim at a single disk to find the NN of q, which incur significant query cost (involving CPU time and I/O overhead) and huge number of accessed nodes with the increasing volume of data points. Motivated by these problems, in this paper, we present the first Best-First based Parallel NN (BFPNN) algorithm and Full BFPNN (FBFPNN) algorithm for effective processing of NN retrieval by means of parallelism (i.e., fetching more nodes or data points from multiple disks simultaneously) in multi-disk setting. Furthermore, extensive experiments verify that the proposed algorithms are correct, and also outperform the existing ones (e.g., FPSS and CRSS algorithms) under most cases in terms of effectiveness and scalability, by using various real and synthetic datasets. The goal is to reduce the query cost and alleviate I/O overhead, so as to facilitate the execution of the NN retrieval.
What problem does this paper attempt to address?