Faster approximate string matching for short patterns

From MaRDI portal




Abstract: We study the classical approximate string matching problem, that is, given strings P and Q and an error threshold k, find all ending positions of substrings of Q whose edit distance to P is at most k. Let P and Q have lengths m and n, respectively. On a standard unit-cost word RAM with word size wgeqlogn we present an algorithm using time O(nk cdot min(frac{log^2 m}{log n},frac{log^2 mlog w}{w}) + n) When P is short, namely, m=2o(sqrtlogn) or m=2o(sqrtw/logw) this improves the previously best known time bounds for the problem. The result is achieved using a novel implementation of the Landau-Vishkin algorithm based on tabulation and word-level parallelism.









This page was built for publication: Faster approximate string matching for short patterns

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q692899)