Fast Algorithms for Computing the Statistics of Pattern Matching

Danna Zhang,Kai Jin
DOI: https://doi.org/10.1109/access.2021.3105607
IF: 3.9
2021-01-01
IEEE Access
Abstract:Pattern matching is a fundamental problem in theoretical computer science. The algorithms for pattern matching and the study on the statistics of pattern matching have found enormous applications in practical fields. In this paper, we revisit the Markov embedding approach for studying pattern matching in repeated experiments. For any pattern of length $m$ over alphabet $\Sigma $ , we show that the mean and variance of the waiting time of the pattern in iid experiments can be computed in $O(m)$ time based on Markov embedding technique, improving over the $O(|\Sigma | \cdot m)$ and $O(m^{2})$ naïve bounds. Our method extends to computing the $k$ -th moment of the waiting time, and it extends to computing other related statistics about pattern matching in repeated experiments, and it also extends to the case of Markov dependent experiments.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?