Abstract:Contemporary database applications often perform queries in hybrid data spaces (HDS) where vectors can have a mix of continuous valued and non-ordered discrete valued dimensions. To support efficient query processing for an HDS, a robust indexing method is required. Existing indexing techniques to process queries efficiently either apply to continuous data spaces (e.g., the R-tree) or non-ordered discrete data spaces (e.g., the ND-tree). No techniques directly indexing vectors in HDSs have been reported in the literature. In this paper, we propose a new multidimensional indexing technique, called the C-ND tree, to directly index vectors in an HDS. To build such an index, we first introduce some essential geometric concepts (e.g., hybrid bounding rectangle) in HDSs. The C-ND tree structure and the relevant tree building and query processing algorithms based on these geometric concepts in HDSs are then presented. Strategies have been suggested to make the values in continuous dimensions and non-ordered discrete dimensions comparable and controllable. Novel node splitting heuristics which exploit characteristics of both continuous and discrete dimensions are proposed. Performance of the C-ND tree is compared with that of linear scan, R*-tree and ND-tree using range queries on hybrid data. Experimental results demonstrate that the C-ND tree is quite promising in supporting range queries in HDSs.

A High Dimensional Index Based on Relative Distance Hashing Method

A New Linkless Hierarchical Structure Based on Perfect Hashing

Indexing High-Dimensional Data in Dual Distance Spaces

Novel High-Dimensional Indexing Structure Based on Dual-Distance Metric

Enhanced Locality Sensitive Clustering in High Dimensional Space

Indexing high-dimensional data in dual distance spaces: a symmetrical encoding approach

Composite Distance Transformation for Indexing and K -Nearest-neighbor Searching in High-Dimensional Spaces

iDistance: An adaptive B+-tree based indexing method for nearest neighbor search

LuSH: A Generic High-Dimensional Index Framework.

An Adaptive and Dynamic Dimensionality Reduction Method for High-Dimensional Indexing.

Density Sensitive Hashing

An Adaptive And Efficient Dimensionality Reduction Algorithm For High-Dimensional Indexing

Harmonious Hashing

An Encoding-Based Dual Distance Tree High-Dimensional Index

Bi-Level Locality Sensitive Hashing Index Based on Clustering

High Dimensional Hybrid Index Based on Query Sampling

Contorting High Dimensional Data for Efficient Main Memory KNN Processing

Experimental Analysis of Locality Sensitive Hashing Techniques for High-Dimensional Approximate Nearest Neighbor Searches

The C-ND Tree: a Multidimensional Index for Hybrid Continuous and Non-Ordered Discrete Data Spaces

Speed Up Linear Scan in High-Dimensions by Sorting One-Dimensional Projections