Improved space-time tradeoffs for approximate full-text indexing with one edit error

From MaRDI portal
Publication:494804

DOI10.1007/S00453-014-9873-9zbMATH Open1328.68321arXiv1103.2167OpenAlexW1808478258MaRDI QIDQ494804FDOQ494804


Authors: Djamal Belazzougui Edit this on Wikidata


Publication date: 2 September 2015

Published in: Algorithmica (Search for Journal in Brave)

Abstract: In this paper we are interested in indexing texts for substring matching queries with one edit error. That is, given a text T of n characters over an alphabet of size sigma, we are asked to build a data structure that answers the following query: find all the occ substrings of the text that are at edit distance at most 1 from a given string q of length m. In this paper we show two new results for this problem. The first result, suitable for an unbounded alphabet, uses O(nlogepsilonn) (where epsilon is any constant such that 0<epsilon<1) words of space and answers to queries in time O(m+occ). This improves simultaneously in space and time over the result of Cole et al. The second result, suitable only for a constant alphabet, relies on compressed text indices and comes in two variants: the first variant uses O(nlogepsilonn) bits of space and answers to queries in time O(m+occ), while the second variant uses O(nloglogn) bits of space and answers to queries in time O((m+occ)loglogn). This second result improves on the previously best results for constant alphabets achieved in Lam et al. (Algorithmica 2008) and Chan et al. (Algorithmica 2010).


Full work available at URL: https://arxiv.org/abs/1103.2167




Recommendations




Cites Work


Cited In (5)





This page was built for publication: Improved space-time tradeoffs for approximate full-text indexing with one edit error

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q494804)