Approximate string matching with compressed indexes (Q1662494): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
ReferenceBot (talk | contribs)
Changed an Item
 
(5 intermediate revisions by 4 users not shown)
Property / author
 
Property / author: Pedro Morales-Almazan / rank
Normal rank
 
Property / author
 
Property / author: Pedro Morales-Almazan / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.3390/a2031105 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2121806125 / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q58883988 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Average-optimal single and multiple approximate string matching / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dictionary matching and indexing with errors and don't cares / rank
 
Normal rank
Property / cites work
 
Property / cites work: Combinatorial Pattern Matching / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Linear Size Index for Approximate Pattern Matching / rank
 
Normal rank
Property / cites work
 
Property / cites work: Suffix Arrays: A New Method for On-Line String Searches / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4661875 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A sublinear algorithm for approximate keyword searching / rank
 
Normal rank
Property / cites work
 
Property / cites work: Indexing text with approximate \(q\)-grams / rank
 
Normal rank
Property / cites work
 
Property / cites work: Compressed representations of sequences and full-text indexes / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of the Burrows—Wheeler transform / rank
 
Normal rank
Property / cites work
 
Property / cites work: Compression of individual sequences via variable-rate coding / rank
 
Normal rank
Property / cites work
 
Property / cites work: Indexing compressed text / rank
 
Normal rank
Property / cites work
 
Property / cites work: Indexing text using the Ziv--Lempel trie / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reducing the Space Requirement of LZ-Index / rank
 
Normal rank
Property / cites work
 
Property / cites work: New text indexing functionalities of the compressed suffix arrays / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4471381 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Compressed Suffix Arrays and Suffix Trees with Applications to Text Indexing and String Matching / rank
 
Normal rank
Property / cites work
 
Property / cites work: Compressed suffix trees with full functionality / rank
 
Normal rank
Property / cites work
 
Property / cites work: An(other) Entropy-Bounded Compressed Suffix Tree / rank
 
Normal rank
Property / cites work
 
Property / cites work: Fully-Compressed Suffix Trees / rank
 
Normal rank
Property / cites work
 
Property / cites work: Compressed text indexes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Combinatorial Pattern Matching / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms and Computation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Improving an algorithm for approximate pattern matching / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3690245 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms on Strings, Trees and Sequences / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Complexity of Finite Sequences / rank
 
Normal rank
Property / cites work
 
Property / cites work: A universal algorithm for sequential data compression / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finding approximate patterns in strings / rank
 
Normal rank
Property / cites work
 
Property / cites work: Incremental String Comparison / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear bidirectional on-line construction of affix trees / rank
 
Normal rank
Property / cites work
 
Property / cites work: Implementing the LZ-index / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic Rank-Select Structures with Applications to Run-Length Encoded Texts / rank
 
Normal rank
Property / cites work
 
Property / cites work: A fast bit-vector algorithm for approximate string matching based on dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Very fast and simple approximate string matching / rank
 
Normal rank
Property / cites work
 
Property / cites work: Succinct Indexable Dictionaries with Applications to Encoding $k$-ary Trees, Prefix Sums and Multisets / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic entropy-compressed sequences and full-text indexes / rank
 
Normal rank

Latest revision as of 10:20, 16 July 2024

scientific article
Language Label Description Also known as
English
Approximate string matching with compressed indexes
scientific article

    Statements

    Approximate string matching with compressed indexes (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    20 August 2018
    0 references
    Summary: A compressed full-text self-index for a text \(T\) is a data structure requiring reduced space and able to search for patterns \(P\) in \(T\). It can also reproduce any substring of \(T\), thus actually replacing \(T\). Despite the recent explosion of interest on compressed indexes, there has not been much progress on functionalities beyond the basic exact search. In this paper we focus on indexed approximate string matching (ASM), which is of great interest, say, in bioinformatics. We study ASM algorithms for Lempel-Ziv compressed indexes and for compressed suffix trees/arrays. Most compressed self-indexes belong to one of these classes. We start by adapting the classical method of partitioning into exact search to self-indexes, and optimize it over a representative of either class of self-index. Then, we show that a Lempel-Ziv index can be seen as an extension of the classical \(q\)-samples index. We give new insights on this type of index, which can be of independent interest, and then apply them to a Lempel-Ziv index. Finally, we improve hierarchical verification, a successful technique for sequential searching, so as to extend the matches of pattern pieces to the left or right. Most compressed suffix trees/arrays support the required bidirectionality, thus enabling the implementation of the improved technique. In turn, the improved verification largely reduces the accesses to the text, which are expensive in self-indexes. We show experimentally that our algorithms are competitive and provide useful space-time tradeoffs compared to classical indexes.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    compressed index
    0 references
    approximate string matching
    0 references
    Lempel-Ziv index
    0 references
    compressed suffix tree
    0 references
    compressed suffix array
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references