refinr (Q110667): Difference between revisions
From MaRDI portal
Removed claim: imports (P585): stringdist (Q45722) |
Added link to MaRDI item. |
||||||||||||||
(5 intermediate revisions by 2 users not shown) | |||||||||||||||
Property / last update | |||||||||||||||
| |||||||||||||||
Property / last update: 24 April 2022 / rank | |||||||||||||||
Property / imports | |||||||||||||||
Property / imports: stringi / rank | |||||||||||||||
Property / software version identifier | |||||||||||||||
0.2.0 | |||||||||||||||
Property / software version identifier: 0.2.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.2.0 / qualifier | |||||||||||||||
publication date: 5 January 2018
| |||||||||||||||
Property / software version identifier | |||||||||||||||
0.3.0 | |||||||||||||||
Property / software version identifier: 0.3.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.3.0 / qualifier | |||||||||||||||
publication date: 5 May 2018
| |||||||||||||||
Property / software version identifier | |||||||||||||||
0.3.1 | |||||||||||||||
Property / software version identifier: 0.3.1 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.3.1 / qualifier | |||||||||||||||
publication date: 17 June 2018
| |||||||||||||||
Property / software version identifier | |||||||||||||||
0.3.3 | |||||||||||||||
Property / software version identifier: 0.3.3 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.3.3 / qualifier | |||||||||||||||
publication date: 12 November 2023
| |||||||||||||||
Property / last update | |||||||||||||||
12 November 2023
| |||||||||||||||
Property / last update: 12 November 2023 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / description | |||||||||||||||
These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>. | |||||||||||||||
Property / description: These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>. / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Chris Muir / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: GNU General Public License, version 3.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / imports | |||||||||||||||
Property / imports: Rcpp / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / imports | |||||||||||||||
Property / imports: stringdist / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / imports: stringdist / qualifier | |||||||||||||||
software version identifier: ≥ 0.9.5.1 | |||||||||||||||
Property / imports | |||||||||||||||
Property / imports: stringi / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / depends on software | |||||||||||||||
Property / depends on software: R / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / depends on software: R / qualifier | |||||||||||||||
software version identifier: ≥ 3.0.2 | |||||||||||||||
Property / MaRDI profile type | |||||||||||||||
Property / MaRDI profile type: MaRDI software profile / rank | |||||||||||||||
Normal rank | |||||||||||||||
links / mardi / name | links / mardi / name | ||||||||||||||
Latest revision as of 19:56, 12 March 2024
Cluster and Merge Similar Values Within a Character Vector
Language | Label | Description | Also known as |
---|---|---|---|
English | refinr |
Cluster and Merge Similar Values Within a Character Vector |
Statements
12 November 2023
0 references
These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.
0 references