Sparse suffix tree construction in optimal time and space
From MaRDI portal
Publication:4575764
Abstract: Suffix tree (and the closely related suffix array) are fundamental structures capturing all substrings of a given text essentially by storing all its suffixes in the lexicographical order. In some applications, we work with a subset of interesting suffixes, which are stored in the so-called sparse suffix tree. Because the size of this structure is , it is natural to seek a construction algorithm using only words of space assuming read-only random access to the text. We design a linear-time Monte Carlo algorithm for this problem, hence resolving an open question explicitly stated by Bille et al. [TALG 2016]. The best previously known algorithm by I et al. [STACS 2014] works in time. Our solution proceeds in rounds; in the -th round, we consider all suffixes starting at positions congruent to modulo . By maintaining rolling hashes, we lexicographically sort all interesting suffixes starting at such positions, and then we merge them with the already considered suffixes. For efficient merging, we also need to answer LCE queries in small space. By plugging in the structure of Bille et al. [CPM 2015] we obtain time complexity. We improve this structure, which implies a linear-time sparse suffix tree construction algorithm. We complement our Monte Carlo algorithm with a deterministic verification procedure. The verification takes time, which improves upon the bound of obtained by I et al. [STACS 2014]. This is obtained by first observing that the pruning done inside the previous solution has a rather clean description using the notion of graph spanners with small multiplicative stretch. Then, we are able to decrease the verification time by applying difference covers twice. Combined with the Monte Carlo algorithm, this gives us an -time and -space Las Vegas algorithm.
Recommendations
Cited in
(14)- Space-efficient representation of truncated suffix trees, with applications to Markov order estimation
- Strictly in-place algorithms for permuting and inverting permutations
- Sparse suffix trees
- Practical Performance of Space Efficient Data Structures for Longest Common Extensions.
- scientific article; zbMATH DE number 2102777 (Why is no real title available?)
- Tight lower bounds for the longest common extension problem
- Sparse text indexing in small space
- Deterministic Sparse Suffix Sorting on Rewritable Texts
- From Ukkonen to McCreight and Weiner: A unifying view of linear-time suffix tree construction
- Faster sparse suffix sorting
- Sparse suffix tree construction in small space
- Universal compressed text indexing
- Space-efficient conversions from SLPs
- Sparse suffix and LCP array: simple, direct, small, and fast
This page was built for publication: Sparse suffix tree construction in optimal time and space
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4575764)