refinr (Q110667)

From MaRDI portal





Cluster and Merge Similar Values Within a Character Vector
Language Label Description Also known as
default for all languages
No label defined
    English
    refinr
    Cluster and Merge Similar Values Within a Character Vector

      Statements

      0 references
      0.3.2
      24 April 2022
      0 references
      0.2.0
      5 January 2018
      0 references
      0.3.0
      5 May 2018
      0 references
      0.3.1
      17 June 2018
      0 references
      0.3.3
      12 November 2023
      0 references
      0 references
      0 references
      12 November 2023
      0 references
      These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.
      0 references
      0 references
      0 references
      0 references
      0 references

      Identifiers

      0 references