Efficient algorithms for agglomerative hierarchical clustering methods (Q1057599)

From MaRDI portal
Revision as of 10:38, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)





scientific article
Language Label Description Also known as
English
Efficient algorithms for agglomerative hierarchical clustering methods
scientific article

    Statements

    Efficient algorithms for agglomerative hierarchical clustering methods (English)
    0 references
    0 references
    0 references
    1984
    0 references
    Whenever n objects are characterized by a matrix of pairwise dissimilarities, they may be clustered by any of a number of sequential, agglomerative, hierarchical, nonoverlapping clustering methods. These SAHN clustering methods are defined by a paradigmatic algorithm that usually requires \(O(n^ 3)\) time, in the worst case, to cluster the objects. We describe a SAHN clustering algorithm that requires \(O(n^ 2 \log n)\) time in the worst case. When SAHN clustering methods exhibit reasonable space distortion properties, further improvements are possible. We adapt a SAHN clustering algorithm, based on the efficient construction of nearest neighbor chains, to obtain a reasonably general SAHN clustering algorithm that requires in the worst case \(O(n^ 2)\) time and space. Whenever n objects are characterized by k-tuples of real numbers, they may be clustered by any of a family of a centroid SAHN clustering methods. These methods are based on a geometric model in which clusters are represented by points in k-dimensional real space and points being agglomerated are replaced by a single (centroid) point. For this model, we have solved a class of special packing problems involving point- symmetric convex objects and have exploited it to design an efficient centroid clustering algorithm.
    0 references
    algorithm design
    0 references
    sequential, agglomerative, hierarchical, nonoverlapping clustering methods
    0 references
    SAHN clustering methods
    0 references
    nearest neighbor chains
    0 references
    centroid SAHN clustering methods
    0 references
    geometric model
    0 references
    packing problems
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references