High-dimensional kNN joins with incremental updates

Cui Yu,Rui Zhang,Yaochun Huang,Hui Xiong
DOI: https://doi.org/10.1007/s10707-009-0076-5
IF: 2.7729
2009-01-01
GeoInformatica
Abstract:The k Nearest Neighbor (kNN) join operation associates each data object in one data set with its k nearest neighbors from the same or a different data set. The kNN join on high-dimensional data ( high-dimensional kNN join ) is a very expensive operation. Existing high-dimensional kNN join algorithms were designed for static data sets and therefore cannot handle updates efficiently. In this article, we propose a novel kNN join method, named kNNJoin + , which supports efficient incremental computation of kNN join results with updates on high-dimensional data. As a by-product, our method also provides answers for the reverse kNN queries with very little overhead. We have performed an extensive experimental study. The results show the effectiveness of kNNJoin + for processing high-dimensional kNN joins in dynamic workloads.
What problem does this paper attempt to address?