On string matching with mismatches (Q1736652)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: On string matching with mismatches |
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | On string matching with mismatches |
scientific article |
Statements
On string matching with mismatches (English)
0 references
26 March 2019
0 references
Summary: In this paper, we consider several variants of the pattern matching with mismatches problem. In particular, given a text \(T=t_1 t_2\cdots t_n\) and a pattern \(P=p_1 p_2\cdots p_m\), we investigate the following problems: (1) pattern matching with mismatches: for every \(i\), \(1 \leq i \leq n-m+1\) output, the distance between \(P\) and \(t_i t_{i+1}\cdots t_{i+m-1}\); and (2) pattern matching with \(k\) mismatches: output those positions \(i\) where the distance between \(P\) and \(t_i t_{i+1}\cdots t_{i+m-1}\) is less than a given threshold \(k\). The distance metric used is the Hamming distance. We present some novel algorithms and techniques for solving these problems. We offer deterministic, randomized and approximation algorithms. We consider variants of these problems where there could be wild cards in either the text or the pattern or both. We also present an experimental evaluation of these algorithms. The source code is available at \url{http://www.engr.uconn.edu/~man09004/kmis.zip}.
0 references
pattern matching with mismatches
0 references
k-mismatches problem
0 references
approximate counting of mismatches
0 references
0 references
0.8958490490913391
0 references
0.8946316838264465
0 references
0.8797699213027954
0 references
0.8776193261146545
0 references