The k-outlier Fréchet distance

Maike Buchin,Lukas Plätz
DOI: https://doi.org/10.48550/arXiv.2202.12824
2022-02-26
Abstract:The Fréchet distance is a popular metric for curves; however, its bottleneck character is a disadvantage in many applications. Here we introduce two variants of the Fréchet distance to cope with this problem and expand the work on shortcut Fréchet distances. We present an efficient algorithm for computing the new distance measure.
Computational Geometry
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to deal with outliers in data when calculating the Fréchet distance between curves. Specifically, the traditional Fréchet distance is very sensitive to measurement errors in data because it is defined based on the maximum distance of the minimum correspondence between two curves. This means that even a single outlier can determine the Fréchet distance of the entire curve, thus affecting the accuracy of the result. To solve this problem, the paper introduces two new variants of the Fréchet distance - the one - sided and two - sided \(k\)-outlier Fréchet distances. These new distances allow ignoring at most \(k\) outliers on each curve and calculate the optimal Fréchet distance given the number of vertices to be ignored. This not only improves the robustness to outliers but also enables considering outliers on both curves simultaneously, rather than just a single curve. In addition, the paper also proposes an efficient algorithm to calculate this new distance metric, solving the problem of high computational complexity in previous methods. By introducing the concept of "outlier - free space", the algorithm can determine the \(k\)-outlier Fréchet distance in polynomial time, thus providing an effective tool for dealing with errors in real - data. ### Main contributions: 1. **Introduction of \(k\)-outlier Fréchet distance**: Allows ignoring at most \(k\) outliers on each curve, improving the robustness to outliers. 2. **Efficient algorithm**: Proposes an efficient dynamic programming algorithm that can calculate the undirected \(k\)-outlier Fréchet distance in \(O(nmk^3)\) time and the directed \(k\)-outlier Fréchet distance in \(O(nmk^2)\) time. 3. **Theoretical analysis**: Proves the correctness and time complexity of the algorithm and verifies its effectiveness through experiments. Through these improvements, the paper provides a more robust and efficient solution for processing curve data with measurement errors in the real world.