Irregular Partitioning Method Based K-Nearest Neighbor Query Algorithm Using Mapreduce

Qingqing Zhang,Changyun Li,Pinjie He,Xu Li,Haojie Zou
DOI: https://doi.org/10.2991/isci-15.2015.237
2015-01-01
Abstract:With the dramatic increase of available data, the process of data processing should get higher and higher performance. Most researches on k-Nearest Neighbor (kNN) query algorithm are based on the regular partitioning method which is easy to cause the imbalance of load, even influence the overall performance of the kNN query algorithm. In addition, the traditional kNN query algorithm works on single process or single machine platforms, which cannot obtain high enough efficiency when dealing with big data. Aiming at these two issues, an irregular partitioning method based kNN algorithm is presented and being executed on the distributed parallel computing platform-MapReduce as of in this paper. Experiments show that the irregular partitioning method based kNN algorithm using MapReduce can obtain much higher performance and can guarantee a very efficient query when dealing with big data.
What problem does this paper attempt to address?