Fast, parallel, and cache-friendly suffix array construction

Jamshed Khan,Tobias Rubel,Erin Molloy,Laxman Dhulipala,Rob Patro
DOI: https://doi.org/10.1186/s13015-024-00263-5
2024-04-30
Algorithms for Molecular Biology
Abstract:String indexes such as the suffix array ( sa ) and the closely related longest common prefix ( lcp ) array are fundamental objects in bioinformatics and have a wide variety of applications. Despite their importance in practice, few scalable parallel algorithms for constructing these are known, and the existing algorithms can be highly non-trivial to implement and parallelize.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?