CD-Tree: A Clustering-Based Dynamic Indexing and Retrieval Approach.

Yuchai Wan,Xiabi Liu,Yi Wu
DOI: https://doi.org/10.3233/ida-150418
IF: 1.7
2017-01-01
Intelligent Data Analysis
Abstract:In the big data era, the efficient indexing of gradually increasing databases is becoming vitally important for information retrieval. To incrementally adapt to changes of databases, in this paper we propose a novel clustering based dynamic indexing and retrieval approach. The tree-like indexing structure, termed as CD-Tree, updates the structure with constant insertion of data, keeping the tree in consistent with the newest database. The nodes in the CD-Tree are fitted by Gaussian Mixture Models, based on which we design the efficient updating algorithm. The similarity retrieval method utilizing the CD-Tree is further presented, combining one-way search and backtracking strategy to gain good retrieval accuracy and efficiency. We applied the CD-Tree to example-based image retrieval. The experimental results confirm that our approach is effective and promising.
What problem does this paper attempt to address?