Clustering stability: an overview
From MaRDI portal
Publication:3569376
DOI10.1561/2200000008zbMATH Open1191.68615DBLPjournals/ftml/Luxburg09arXiv1007.1075OpenAlexW3105761396WikidataQ57408120 ScholiaQ57408120MaRDI QIDQ3569376FDOQ3569376
Authors: Ulrike Von Luxburg
Publication date: 18 June 2010
Published in: Foundations and Trends® in Machine Learning (Search for Journal in Brave)
Abstract: A popular method for selecting the number of clusters is based on stability arguments: one chooses the number of clusters such that the corresponding clustering results are "most stable". In recent years, a series of papers has analyzed the behavior of this method from a theoretical point of view. However, the results are very technical and difficult to interpret for non-experts. In this paper we give a high-level overview about the existing literature on clustering stability. In addition to presenting the results in a slightly informal but accessible way, we relate them to each other and discuss their different implications.
Full work available at URL: https://arxiv.org/abs/1007.1075
Recommendations
Learning and adaptive systems in artificial intelligence (68T05) Pattern recognition, speech recognition (68T10)
Cited In (40)
- Estimating the number of clusters via a corrected clustering instability
- An automatic and stable clustering algorithm
- Title not available (Why is that?)
- Clustering stability-based evolutionary K-means
- Detecting Lagrangian coherent structures from sparse and noisy trajectory data
- On the stability of hierarchical classification: qualitative approaches
- Title not available (Why is that?)
- On the discrepancy between Kleinberg's clustering axioms and \(k\)-means clustering algorithm behavior
- Measuring the stability of spectral clustering
- Stability-Based Validation of Clustering Solutions
- Richness fallacy
- A Sober Look at Clustering Stability
- Selection of the number of clusters via the bootstrap method
- Stability estimation for unsupervised clustering: a review
- Precision medicine
- Detecting communities in attributed networks through bi-direction penalized clustering and its application
- Data stability in clustering: a closer look
- Optimality-based clustering: an inverse optimization approach
- Riding down the Bay: space-time clustering of ecological trends
- Likelihood Inference for Large Scale Stochastic Blockmodels With Covariates Based on a Divide-and-Conquer Parallelizable Algorithm With Communication
- Stability of k-Means Clustering
- Estimation of the global mode of a density: minimaxity, adaptation, and computational complexity
- Banks' business models in the euro area: a cluster analysis in high dimensions
- Multicuts and perturb \& MAP for probabilistic graph clustering
- On the Estimation of the Number of Communities for Sparse Networks
- Explaining mixture models through semantic pattern mining and banded matrix visualization
- Optimal transport, mean partition, and uncertainty assessment in cluster analysis
- Visualizing non-metric similarities in multiple maps
- A statistical model of cluster stability
- A cautionary note on using internal cross validation to select the number of clusters
- Divisive clustering of high dimensional data streams
- Clustering ensemble based on sample's stability
- Good clusterings have large volume
- Modal clustering asymptotics with applications to bandwidth selection
- Adjusting the Adjusted Rand Index. A multinomial story
- A family of distances for preference-approvals
- A statistical view of clustering performance through the theory of \(U\)-processes
- Stability and model selection in \(k\)-means clustering
- Bootstrapping estimates of stability for clusters, observations and model selection
- Probabilistic correlation clustering and image partitioning using perturbed multicuts
This page was built for publication: Clustering stability: an overview
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3569376)