Faster Fréchet Distance Approximation through Truncated Smoothing

Thijs van der Horst,Tim Ophelders
2024-01-26
Abstract:The Fréchet distance is a popular distance measure for curves. Computing the Fréchet distance between two polygonal curves of $n$ vertices takes roughly quadratic time, and conditional lower bounds suggest that even approximating to within a factor $3$ cannot be done in strongly-subquadratic time, even in one dimension. The current best approximation algorithms present trade-offs between approximation quality and running time. Recently, van der Horst $\textit{et al.}$ (SODA, 2023) presented an $O((n^2 / \alpha) \log^3 n)$ time $\alpha$-approximate algorithm for curves in arbitrary dimensions, for any $\alpha \in [1, n]$. Our main contribution is an approximation algorithm for curves in one dimension, with a significantly faster running time of $O(n \log^3 n + (n^2 / \alpha^3) \log^2 n \log \log n)$. Additionally, we give an algorithm for curves in arbitrary dimensions that improves upon the state-of-the-art running time by a logarithmic factor, to $O((n^2 / \alpha) \log^2 n)$. Both of our algorithms rely on a linear-time simplification procedure that in one dimension reduces the complexity of the reachable free space to $O(n^2 / \alpha)$ without making sacrifices in the asymptotic approximation factor.
Computational Geometry
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper attempts to solve the efficiency problem of calculating the Fréchet distance. Specifically, the goal of the paper is to significantly reduce the time complexity required to calculate the Fréchet distance while maintaining a high approximation accuracy. ### Background and motivation The Fréchet distance is a metric method for comparing the similarity of curves and is widely used in fields such as trajectory analysis, handwriting recognition, and time - series matching. However, calculating the Fréchet distance between two polygonal curves usually requires approximately quadratic time complexity \(O(n^2)\), which is very inefficient when dealing with large - scale data. Although there are some approximation algorithms, these algorithms often have a trade - off between approximation quality and running time. ### Main contributions 1. **Fast approximation algorithm for one - dimensional curves**: - Proposed an approximation algorithm that runs significantly faster in one - dimensional curves, with a time complexity of \(O(n\log^3 n+\frac{n^2}{\alpha^3}\log^2 n\log\log n)\). 2. **Improved algorithm for curves of arbitrary dimensions**: - For curves of arbitrary dimensions, an improved approximation algorithm is proposed, with a time complexity of \(O(\frac{n^2}{\alpha}\log^2 n)\), which is an improvement in the logarithmic factor compared to existing algorithms. ### Key techniques 1. **Curve simplification**: - Through a linear - time simplification process, the complexity of the curve is reduced, thereby reducing the complexity of the reachable free space. In the one - dimensional case, the complexity of the simplified reachable free space is \(O(\frac{n^2}{\alpha})\). 2. **Efficient traversal of the free - space graph**: - By using the orthogonally convex property of the free - space graph, a data structure is constructed so that the time complexity of traversing within each block is \(O((|P_i| + |Q_j|)\log n)\). 3. **Special treatment for one - dimensional curves**: - For one - dimensional curves, the time complexity of the algorithm is further optimized by constructing an approximate exit set method. In particular, an efficient approximate decision algorithm is achieved by dealing with the matching problem of signature vertices. ### Conclusion By introducing new curve - simplification techniques and efficient free - space - graph - traversal methods, the paper significantly improves the efficiency of the Fréchet - distance approximation algorithm, especially in the case of one - dimensional curves. These improvements are not only of great theoretical significance but also provide effective solutions for large - scale data processing in practical applications.