Augmented Thresholds for MONI

César Martínez-Guardiola,Nathaniel K. Brown,Fernando Silva-Coira,Dominik Köppl,Travis Gagie,Susana Ladra
DOI: https://doi.org/10.48550/arXiv.2211.07794
2022-11-14
Data Structures and Algorithms
Abstract:MONI (Rossi et al., 2022) can store a pangenomic dataset T in small space and later, given a pattern P, quickly find the maximal exact matches (MEMs) of P with respect to T. In this paper we consider its one-pass version (Boucher et al., 2021), whose query times are dominated in our experiments by longest common extension (LCE) queries. We show how a small modification lets us avoid most of these queries and thus significantly speeds up MONI in practice while only slightly increasing its size.
What problem does this paper attempt to address?