Probably certifiably correct \(k\)-means clustering (Q1675259)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Probably certifiably correct \(k\)-means clustering |
scientific article |
Statements
Probably certifiably correct \(k\)-means clustering (English)
0 references
27 October 2017
0 references
Clustering is a central problem in unsupervised machine learning. The aim of \(k\)-means clustering is the partitioning of a given finite set \(\{ x_i \}_{i=1}^N\) into \(k\) subsets. A definition of a probably certificably correct algorithm is given. First, it is proven that Peng and Wei's semidefinite relaxation of \(k\)-means [\textit{J. Peng} and \textit{Y. Wei}, SIAM J. Optim. 18, No. 1, 186--205 (2007; Zbl 1146.90046)] is tight. It is shown how to test the optimality of a proposed \(k\)-means solution using this dual certificate in quasilinear time. It typically plants clusters under the stochastic ball method. A version of spectral clusteing is designed to solve \(k\)-means in the case of two clusters.
0 references
tightness
0 references
\(k\)-means clustering
0 references
probably certifiably correct algorithm
0 references
Peng-Wei semidefinitaion relaxation
0 references
machine learning
0 references
algorithm
0 references
stochastic ball method
0 references