genieclust (Q55119): Difference between revisions
From MaRDI portal
Removed claim: imports (P585): Rcpp (Q20394) |
Swh import (talk | contribs) SWHID from Software Heritage |
||||||||||||||
(6 intermediate revisions by 4 users not shown) | |||||||||||||||
Property / last update | |||||||||||||||
| |||||||||||||||
Property / last update: 17 January 2023 / rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: genieclust: Fast and robust hierarchical clustering / rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Genie: A new, fast, and outlier-resistant hierarchical clustering algorithm / rank | |||||||||||||||
Property / software version identifier | |||||||||||||||
0.9.3 | |||||||||||||||
Property / software version identifier: 0.9.3 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.9.3 / qualifier | |||||||||||||||
publication date: 30 July 2020
| |||||||||||||||
Property / software version identifier | |||||||||||||||
0.9.4 | |||||||||||||||
Property / software version identifier: 0.9.4 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.9.4 / qualifier | |||||||||||||||
publication date: 1 August 2020
| |||||||||||||||
Property / software version identifier | |||||||||||||||
0.9.7 | |||||||||||||||
Property / software version identifier: 0.9.7 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.9.7 / qualifier | |||||||||||||||
publication date: 7 January 2021
| |||||||||||||||
Property / software version identifier | |||||||||||||||
0.9.8 | |||||||||||||||
Property / software version identifier: 0.9.8 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 0.9.8 / qualifier | |||||||||||||||
publication date: 8 January 2021
| |||||||||||||||
Property / software version identifier | |||||||||||||||
1.0.0 | |||||||||||||||
Property / software version identifier: 1.0.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 1.0.0 / qualifier | |||||||||||||||
publication date: 22 April 2021
| |||||||||||||||
Property / software version identifier | |||||||||||||||
1.0.1 | |||||||||||||||
Property / software version identifier: 1.0.1 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 1.0.1 / qualifier | |||||||||||||||
publication date: 8 August 2022
| |||||||||||||||
Property / software version identifier | |||||||||||||||
1.1.0 | |||||||||||||||
Property / software version identifier: 1.1.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 1.1.0 / qualifier | |||||||||||||||
publication date: 5 September 2022
| |||||||||||||||
Property / software version identifier | |||||||||||||||
1.1.5-2 | |||||||||||||||
Property / software version identifier: 1.1.5-2 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 1.1.5-2 / qualifier | |||||||||||||||
publication date: 18 October 2023
| |||||||||||||||
Property / last update | |||||||||||||||
18 October 2023
| |||||||||||||||
Property / last update: 18 October 2023 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / description | |||||||||||||||
A retake on the Genie algorithm (Gagolewski, 2021 <doi:10.1016/j.softx.2021.100722>) - a robust hierarchical clustering method (Gagolewski, Bartoszuk, Cena, 2016 <doi:10.1016/j.ins.2016.05.003>). Now faster and more memory efficient; determining the whole hierarchy for datasets of 10M points in low dimensional Euclidean spaces or 100K points in high-dimensional ones takes only 1-2 minutes. Allows clustering with respect to mutual reachability distances so that it can act as a noise point detector or a robustified version of 'HDBSCAN*' (that is able to detect a predefined number of clusters and hence it does not dependent on the somewhat fragile 'eps' parameter). The package also features an implementation of inequality indices (the Gini, Bonferroni index), external cluster validity measures (e.g., the normalised clustering accuracy and partition similarity scores such as the adjusted Rand, Fowlkes-Mallows, adjusted mutual information, and the pair sets index), and internal cluster validity indices (e.g., the Calinski-Harabasz, Davies-Bouldin, Ball-Hall, Silhouette, and generalised Dunn indices). See also the 'Python' version of 'genieclust' available on 'PyPI', which supports sparse data, more metrics, and even larger datasets. | |||||||||||||||
Property / description: A retake on the Genie algorithm (Gagolewski, 2021 <doi:10.1016/j.softx.2021.100722>) - a robust hierarchical clustering method (Gagolewski, Bartoszuk, Cena, 2016 <doi:10.1016/j.ins.2016.05.003>). Now faster and more memory efficient; determining the whole hierarchy for datasets of 10M points in low dimensional Euclidean spaces or 100K points in high-dimensional ones takes only 1-2 minutes. Allows clustering with respect to mutual reachability distances so that it can act as a noise point detector or a robustified version of 'HDBSCAN*' (that is able to detect a predefined number of clusters and hence it does not dependent on the somewhat fragile 'eps' parameter). The package also features an implementation of inequality indices (the Gini, Bonferroni index), external cluster validity measures (e.g., the normalised clustering accuracy and partition similarity scores such as the adjusted Rand, Fowlkes-Mallows, adjusted mutual information, and the pair sets index), and internal cluster validity indices (e.g., the Calinski-Harabasz, Davies-Bouldin, Ball-Hall, Silhouette, and generalised Dunn indices). See also the 'Python' version of 'genieclust' available on 'PyPI', which supports sparse data, more metrics, and even larger datasets. / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Marek Gagolewski / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: GNU Affero General Public License, version 3.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / imports | |||||||||||||||
Property / imports: Rcpp / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / imports: Rcpp / qualifier | |||||||||||||||
software version identifier: ≥ 1.0.4 | |||||||||||||||
Property / imports | |||||||||||||||
Property / imports: stats / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / imports | |||||||||||||||
Property / imports: utils / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: genieclust: Fast and robust hierarchical clustering / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / cites work | |||||||||||||||
Property / cites work: Genie: A new, fast, and outlier-resistant hierarchical clustering algorithm / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / MaRDI profile type | |||||||||||||||
Property / MaRDI profile type: MaRDI software profile / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / source code repository | |||||||||||||||
Property / source code repository: https://github.com/gagolews/genieclust / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / Software Heritage ID | |||||||||||||||
Property / Software Heritage ID: swh:1:snp:bd79c9bd1fd0bcf4416e7a05262df8220ce6b7d7 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / Software Heritage ID: swh:1:snp:bd79c9bd1fd0bcf4416e7a05262df8220ce6b7d7 / qualifier | |||||||||||||||
Property / Software Heritage ID: swh:1:snp:bd79c9bd1fd0bcf4416e7a05262df8220ce6b7d7 / qualifier | |||||||||||||||
point in time: 17 March 2024
| |||||||||||||||
links / mardi / name | links / mardi / name | ||||||||||||||
Latest revision as of 05:04, 22 March 2024
Fast and Robust Hierarchical Clustering with Noise Points Detection
Language | Label | Description | Also known as |
---|---|---|---|
English | genieclust |
Fast and Robust Hierarchical Clustering with Noise Points Detection |
Statements
18 October 2023
0 references
A retake on the Genie algorithm (Gagolewski, 2021 <doi:10.1016/j.softx.2021.100722>) - a robust hierarchical clustering method (Gagolewski, Bartoszuk, Cena, 2016 <doi:10.1016/j.ins.2016.05.003>). Now faster and more memory efficient; determining the whole hierarchy for datasets of 10M points in low dimensional Euclidean spaces or 100K points in high-dimensional ones takes only 1-2 minutes. Allows clustering with respect to mutual reachability distances so that it can act as a noise point detector or a robustified version of 'HDBSCAN*' (that is able to detect a predefined number of clusters and hence it does not dependent on the somewhat fragile 'eps' parameter). The package also features an implementation of inequality indices (the Gini, Bonferroni index), external cluster validity measures (e.g., the normalised clustering accuracy and partition similarity scores such as the adjusted Rand, Fowlkes-Mallows, adjusted mutual information, and the pair sets index), and internal cluster validity indices (e.g., the Calinski-Harabasz, Davies-Bouldin, Ball-Hall, Silhouette, and generalised Dunn indices). See also the 'Python' version of 'genieclust' available on 'PyPI', which supports sparse data, more metrics, and even larger datasets.
0 references