Algorithm design and performance evaluation of sparse induced suffix sorting

Wenbo Wu,Ge Nong
DOI: https://doi.org/10.1016/j.ipm.2024.103777
IF: 7.466
2024-05-23
Information Processing & Management
Abstract:Sorting any m target suffixes of an input string X of n characters from a constant alphabet is a key task for building the sparse suffix array SSA(X) for index construction. A number of probabilistic and deterministic algorithms have been proposed for sorting sparse suffixes with varying time and space complexities, but only some experimental results are available for performance evaluation of these algorithms. We design a divide-and-conquer algorithm called sSAIS for computing SSA(X) in O(nlogmlog(n/m)) time and O(m) workspace by using the induced sorting principle, and conduct an experimental performance study on real and artificial datasets. This work reveals that to design an efficient deterministic algorithm for sorting sparse suffixes is a tough challenge and the density of target suffixes might be considered as a critical design parameter.
computer science, information systems,information science & library science
What problem does this paper attempt to address?