High-Dimensional Indexing Method Based on Elliptical-Shaped Clustering

Jiang-Tao CUI,Yong GUO,Shui-Sheng ZHOU
DOI: https://doi.org/10.3969/j.issn.1003-6059.2010.04.007
2010-01-01
Abstract:A high-dimensional linear indexing method is presented by sorting principal component based on elliptical-shaped clustering. The proposed approach reduces the number of data points accessed during the k-nearest neighbor search. The dataset is partitioned into some elliptical-shaped clusters, and KL transform is performed on each cluster. The approximate vectors are built at the KL transform domain on each cluster. When performing k-nearest neighbor search, the partial distortion searching algorithm is used to reject the improper approximate vectors. The clusters are accessed in increasing order of their lower bound from the query point. The experimental results on large image databases with high dimensions show that compared with other well-known vector approximate method, the proposed approach reduces the number of approximate vectors accessed and provides a higher search speed.
What problem does this paper attempt to address?