LAZY R-tree: the R-tree with Lazy Splitting Algorithm

Yang,Pengwei Bai,Ningling Ge,Zhipeng Gao,Xuesong Qiu
DOI: https://doi.org/10.1177/0165551519828616
2019-01-01
Journal of Information Science
Abstract:The spatial index is a data structure formed according to the position and shape of the spatial object or the relationship between the spatial objects according to certain rules, and the spatial data is managed by an effective spatial data structure. The quality of a spatial index directly affects the performance of spatial queries. The R-tree index structure is a highly efficient spatial index. According to the R-tree query rule, when performing spatial query, most data that is not related to the query condition can be filtered out, and finally, a few leaf nodes can be accessed to query the data satisfying the condition. Its query performance is affected by factors such as non-leaf node overlap and node space utilisation. This article proposes a lazy splitting method to improve the R-tree construction process. The scheme works as follows: (1) When a node overflows, it creates an overflow node for that node and all overflow nodes are saved in a hash table. (2) If the node continues to insert data, the data are added to its overflow node. (3) When an overflow node is saturated, the node and its overflow node are split into two saturated nodes. We use both simulated and actual data to perform experiments. The experimental results show that an R-tree constructed by the lazy algorithm is superior to an R-tree constructed using the original R-tree PM algorithm or the corner-based splitting (CBS) algorithm based on the number of splits created, the node space used and the efficiency of region queries and k-nearest neighbour (kNN) queries.
What problem does this paper attempt to address?