Faster longest common extension queries in strings over general alphabets

From MaRDI portal
Publication:5369537

DOI10.4230/LIPICS.CPM.2016.5zbMATH Open1380.68474arXiv1602.00447OpenAlexW2963452431MaRDI QIDQ5369537FDOQ5369537


Authors: Paweł Gawrychowski, Tomasz Kociumaka, Wojciech Rytter, Tomasz Waleń Edit this on Wikidata


Publication date: 17 October 2017

Abstract: Longest common extension queries (often called longest common prefix queries) constitute a fundamental building block in multiple string algorithms, for example computing runs and approximate pattern matching. We show that a sequence of q LCE queries for a string of size n over a general ordered alphabet can be realized in O(qloglogn+nlogn) time making only O(q+n) symbol comparisons. Consequently, all runs in a string over a general ordered alphabet can be computed in O(nloglogn) time making O(n) symbol comparisons. Our results improve upon a solution by Kosolobov (Information Processing Letters, 2016), who gave an algorithm with O(nlog2/3n) running time and conjectured that O(n) time is possible. We make a significant progress towards resolving this conjecture. Our techniques extend to the case of general unordered alphabets, when the time increases to O(qlogn+nlogn). The main tools are difference covers and the disjoint-sets data structure.


Full work available at URL: https://arxiv.org/abs/1602.00447




Recommendations





Cited In (15)





This page was built for publication: Faster longest common extension queries in strings over general alphabets

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5369537)