Probably certifiably correct \(k\)-means clustering (Q1675259)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Probably certifiably correct \(k\)-means clustering
scientific article

    Statements

    Probably certifiably correct \(k\)-means clustering (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    27 October 2017
    0 references
    Clustering is a central problem in unsupervised machine learning. The aim of \(k\)-means clustering is the partitioning of a given finite set \(\{ x_i \}_{i=1}^N\) into \(k\) subsets. A definition of a probably certificably correct algorithm is given. First, it is proven that Peng and Wei's semidefinite relaxation of \(k\)-means [\textit{J. Peng} and \textit{Y. Wei}, SIAM J. Optim. 18, No. 1, 186--205 (2007; Zbl 1146.90046)] is tight. It is shown how to test the optimality of a proposed \(k\)-means solution using this dual certificate in quasilinear time. It typically plants clusters under the stochastic ball method. A version of spectral clusteing is designed to solve \(k\)-means in the case of two clusters.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    tightness
    0 references
    \(k\)-means clustering
    0 references
    probably certifiably correct algorithm
    0 references
    Peng-Wei semidefinitaion relaxation
    0 references
    machine learning
    0 references
    algorithm
    0 references
    stochastic ball method
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references