EHL*: Memory-Budgeted Indexing for Ultrafast Optimal Euclidean Pathfinding

Jinchun Du,Bojie Shen,Muhammad Aamir Cheema
2024-08-21
Abstract:The Euclidean Shortest Path Problem (ESPP), which involves finding the shortest path in a Euclidean plane with polygonal obstacles, is a classic problem with numerous real-world applications. The current state-of-the-art solution, Euclidean Hub Labeling (EHL), offers ultra-fast query performance, outperforming existing techniques by 1-2 orders of magnitude in runtime efficiency. However, this performance comes at the cost of significant memory overhead, requiring up to tens of gigabytes of storage on large maps, which can limit its applicability in memory-constrained environments like mobile phones or smaller devices. Additionally, EHL's memory usage can only be determined after index construction, and while it provides a memory-runtime tradeoff, it does not fully optimize memory utilization. In this work, we introduce an improved version of EHL, called EHL*, which overcomes these limitations. A key contribution of EHL* is its ability to create an index that adheres to a specified memory budget while optimizing query runtime performance. Moreover, EHL* can leverage preknown query distributions, a common scenario in many real-world applications to further enhance runtime efficiency. Our results show that EHL* can reduce memory usage by up to 10-20 times without much impact on query runtime performance compared to EHL, making it a highly effective solution for optimal pathfinding in memory-constrained environments.
Databases,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in a memory - constrained environment, how to optimize the query performance of the Euclidean Shortest Path Problem (ESPP) while ensuring that the optimal solution is found. Specifically, although the existing Euclidean Hub Labeling (EHL) method performs well in terms of query speed, its high memory usage limits its application in memory - constrained environments such as mobile devices. In addition, the memory usage of EHL can only be determined after the index is constructed, and it cannot fully utilize the known query distribution information to further optimize performance. To solve these problems, the authors propose an improved version of EHL*, which can optimize the query running time within a specified memory budget and can use the known query distribution information to further improve the running efficiency. EHL* reduces memory usage without significantly affecting query performance by introducing a compression phase to merge adjacent grid cells into regions of arbitrary shapes. ### Main contributions 1. **Memory budget control**: EHL* can create an index within a specified memory budget, ensuring query performance while meeting memory limitations. 2. **Utilizing query distribution**: EHL* can utilize the pre - known query distribution information to further enhance the running efficiency. 3. **Compression strategy**: By merging adjacent regions with highly similar labels, EHL* significantly reduces memory usage while maintaining query performance. ### Experimental results The experimental results show that EHL* can significantly reduce memory usage (up to 10 - 20 times) under different memory budgets, and has little impact on the query running time, making it an efficient solution suitable for memory - constrained environments. ### Formula representation - **Shortest distance calculation**: \[ d(s, t)=\min_{h_{i} \in H(c_{s}) \cap H(c_{t})}\left(vdist_{\text{min}}(s, h_{i})+vdist_{\text{min}}(t, h_{i})\right) \] where \(vdist_{\text{min}}(p, h_{i})\) represents the minimum passing distance from point \(p\) to hub node \(h_{i}\). - **Jaccard similarity**: \[ r = \arg\max_{r' \in R}\left(\frac{|H(r') \cap H(e)|}{|H(r') \cup H(e)|}\right) \] These formulas show how EHL* reduces memory usage while maintaining efficient query performance by selecting and merging similar regions.