Sample-Based Distance-Approximation for Subsequence-Freeness

Cohen Sidon, Omer
DOI: https://doi.org/10.1007/s00453-024-01233-4
IF: 0.909
2024-05-14
Algorithmica
Abstract:In this work, we study the problem of approximating the distance to subsequence-freeness in the sample-based distribution-free model. For a given subsequence (word) , a sequence (text) is said to contain w if there exist indices such that for every . Otherwise, T is w -free. Ron and Rosin (ACM Trans Comput Theory 14(4):1–31, 2022) showed that the number of samples both necessary and sufficient for one-sided error testing of subsequence-freeness in the sample-based distribution-free model is . Denoting by the distance of T to w -freeness under a distribution , we are interested in obtaining an estimate , such that with probability at least 2/3, for a given error parameter . Our main result is a sample-based distribution-free algorithm whose sample complexity is . We first present an algorithm that works when the underlying distribution p is uniform, and then show how it can be modified to work for any (unknown) distribution p . We also show that a quadratic dependence on is necessary.
computer science, software engineering,mathematics, applied
What problem does this paper attempt to address?