Efficient parallel processing for K-nearest-neighbor search in spatial databases

Yunjun Gao,Ling Chen,Gencai Chen,Chun Chen
DOI: https://doi.org/10.1007/11751649_5
2006-01-01
Abstract:Even though the problem of k nearest neighbor (kNN) query is well-studied in serial environment, there is little prior work on parallel kNN search processing in parallel one. In this paper, we present the first Best-First based Parallel kNN (BFPkNN) query algorithm in a multi-disk setting, for efficient handling of kNN retrieval with arbitrary values of k by parallelization. The core of our method is to access more entries from multiple disks simultaneously and enable several effective pruning heuristics to discard non-qualifying entries. Extensive experiments with real and synthetic datasets confirm that BFPkNN significantly outperforms its competitors in both efficiency and scalability.
What problem does this paper attempt to address?