Faster approximate string matching for short patterns

From MaRDI portal
Publication:692899

DOI10.1007/S00224-011-9322-YzbMATH Open1280.68304arXiv0811.3490OpenAlexW1821184416MaRDI QIDQ692899FDOQ692899

Philip Bille

Publication date: 6 December 2012

Published in: Theory of Computing Systems (Search for Journal in Brave)

Abstract: We study the classical approximate string matching problem, that is, given strings P and Q and an error threshold k, find all ending positions of substrings of Q whose edit distance to P is at most k. Let P and Q have lengths m and n, respectively. On a standard unit-cost word RAM with word size wgeqlogn we present an algorithm using time O(nk cdot min(frac{log^2 m}{log n},frac{log^2 mlog w}{w}) + n) When P is short, namely, m=2o(sqrtlogn) or m=2o(sqrtw/logw) this improves the previously best known time bounds for the problem. The result is achieved using a novel implementation of the Landau-Vishkin algorithm based on tabulation and word-level parallelism.


Full work available at URL: https://arxiv.org/abs/0811.3490




Recommendations




Cites Work


Cited In (4)





This page was built for publication: Faster approximate string matching for short patterns

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q692899)