Space-efficient string indexing for wildcard pattern matching

From MaRDI portal
Publication:2965512

DOI10.4230/LIPICS.STACS.2014.506zbMATH Open1359.68339arXiv1401.0625OpenAlexW2964110985MaRDI QIDQ2965512FDOQ2965512


Authors: Yakov Nekrich, Moshe Lewenstein, Jeffrey Scott Vitter Edit this on Wikidata


Publication date: 3 March 2017

Abstract: In this paper we describe compressed indexes that support pattern matching queries for strings with wildcards. For a constant size alphabet our data structure uses O(nlogvarepsilonn) bits for any varepsilon>0 and reports all mathrmocc occurrences of a wildcard string in O(m+sigmagcdotmu(n)+mathrmocc) time, where mu(n)=o(logloglogn), sigma is the alphabet size, m is the number of alphabet symbols and g is the number of wildcard symbols in the query string. We also present an O(n)-bit index with O((m+sigmag+mathrmocc)logvarepsilonn) query time and an O(n(loglogn)2)-bit index with O((m+sigmag+mathrmocc)loglogn) query time. These are the first non-trivial data structures for this problem that need o(nlogn) bits of space.


Full work available at URL: https://arxiv.org/abs/1401.0625




Recommendations





Cited In (13)





This page was built for publication: Space-efficient string indexing for wildcard pattern matching

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2965512)