Fingerprints in compressed strings

From MaRDI portal
Publication:2396828

DOI10.1007/978-3-642-40104-6_13zbMATH Open1370.68060DBLPjournals/jcss/BilleGCSVV17arXiv1305.2777OpenAlexW2577924862WikidataQ60554414 ScholiaQ60554414MaRDI QIDQ2396828FDOQ2396828

Patrick Hagge Cording, Inge Li Gørtz, Philip Bille, Hjalte Wedel Vildhøj, Søren Vind, Benjamin Sach

Publication date: 26 May 2017

Published in: Journal of Computer and System Sciences, Lecture Notes in Computer Science (Search for Journal in Brave)

Abstract: The Karp-Rabin fingerprint of a string is a type of hash value that due to its strong properties has been used in many string algorithms. In this paper we show how to construct a data structure for a string S of size N compressed by a context-free grammar of size n that answers fingerprint queries. That is, given indices i and j, the answer to a query is the fingerprint of the substring S[i,j]. We present the first O(n) space data structures that answer fingerprint queries without decompressing any characters. For Straight Line Programs (SLP) we get O(logN) query time, and for Linear SLPs (an SLP derivative that captures LZ78 compression and its variations) we get O(loglogN) query time. Hence, our data structures has the same time and space complexity as for random access in SLPs. We utilize the fingerprint data structures to solve the longest common extension problem in query time O(logNloglce) and O(loglcelogloglce+loglogN) for SLPs and Linear SLPs, respectively. Here, lce denotes the length of the LCE.


Full work available at URL: https://arxiv.org/abs/1305.2777




Recommendations




Cites Work


Cited In (8)





This page was built for publication: Fingerprints in compressed strings

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2396828)