stringdist (Q45722): Difference between revisions
From MaRDI portal
Removed claim: depends on software (P342): Item:Q13519 |
Changed an Item |
||
Property / depends on software | |||
Property / depends on software: R / rank | |||
Normal rank | |||
Property / depends on software: R / qualifier | |||
software version identifier: ≥ 2.15.3 |
Revision as of 12:53, 4 March 2024
Approximate String Matching, Fuzzy Text Search, and String Distance Functions
Language | Label | Description | Also known as |
---|---|---|---|
English | stringdist |
Approximate String Matching, Fuzzy Text Search, and String Distance Functions |
Statements
28 November 2023
0 references
Implements an approximate string matching version of R's native 'match' function. Also offers fuzzy text search based on various string distance measures. Can calculate various string distances based on edits (Damerau-Levenshtein, Hamming, Levenshtein, optimal sting alignment), qgrams (q- gram, cosine, jaccard distance) or heuristic metrics (Jaro, Jaro-Winkler). An implementation of soundex is provided as well. Distances can be computed between character vectors while taking proper care of encoding or between integer vectors representing generic sequences. This package is built for speed and runs in parallel by using 'openMP'. An API for C or C++ is exposed as well. Reference: MPJ van der Loo (2014) <doi:10.32614/RJ-2014-011>.
0 references