parallelpam (Q82394)

From MaRDI portal
!
WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

Parallel Partitioning-Around-Medoids (PAM) for Big Sets of Data
Language Label Description Also known as
default for all languages
No label defined
    English
    parallelpam
    Parallel Partitioning-Around-Medoids (PAM) for Big Sets of Data

      Statements

      0 references
      1.0.1
      8 January 2023
      0 references
      1.0
      24 November 2022
      0 references
      1.3
      23 June 2023
      0 references
      1.4
      9 October 2023
      0 references
      0 references
      9 October 2023
      0 references
      Application of the Partitioning-Around-Medoids (PAM) clustering algorithm described in Schubert, E. and Rousseeuw, P.J.: "Fast and eager k-medoids clustering: O(k) runtime improvement of the PAM, CLARA, and CLARANS algorithms." Information Systems, vol. 101, p. 101804, (2021). <doi:10.1016/j.is.2021.101804>.It uses a binary format for storing and retrieval of matrices developed for the 'jmatrix' package but the functionality of 'jmatrix'is included here, so you do not need to install it. Also, it is used by package 'scellpam', so if you have installed it, you do not needto install this package.PAM can be applied to sets of data whose dissimilarity matrix can be very big. It has been tested with up to 100.000 points.It does this with the help of the code developed for other package, 'jmatrix', which allows the matrix not to be loaded in 'R' memory (whichwould force it to be of double type) but it gets from disk, which allows using float (or even smaller data types). Moreover, thedissimilarity matrix is calculated in parallel if the computer has several cores so it can open many threads. The initial partof the PAM algorithm can be done with the BUILD or LAB algorithms; the BUILD algorithm has been implemented in parallel. The optimizationphase implements the FastPAM1 algorithm, also in parallel. Finally, calculation of silhouette is available and also implemented in parallel.
      0 references
      0 references
      0 references
      0 references

      Identifiers

      0 references