Abstract: We construct a cross-entropy clustering (CEC) theory which finds the optimal number of clusters by automatically removing groups which carry no information. Moreover, our theory gives simple and efficient criterion to verify cluster validity. Although CEC can be build on an arbitrary family of densities, in the most important case of Gaussian CEC: {em -- the division into clusters is affine invariant; -- the clustering will have the tendency to divide the data into ellipsoid-type shapes; -- the approach is computationally efficient as we can apply Hartigan approach.} We study also with particular attention clustering based on the Spherical Gaussian densities and that of Gaussian densities with covariance . In the letter case we show that with converging to zero we obtain the classical k-means clustering.
Recommendations
- Application of the cross-entropy method to clustering and vector quantization
- scientific article; zbMATH DE number 1098881
- Entropic approach to multiscale clustering analysis
- A clustering model with Rényi entropy regularization
- scientific article; zbMATH DE number 1975248
- K-MEANS CLUSTERING USING ENTROPY MINIMIZATION
Cites work
- scientific article; zbMATH DE number 6381735 (Why is no real title available?)
- scientific article; zbMATH DE number 1810276 (Why is no real title available?)
- scientific article; zbMATH DE number 2131221 (Why is no real title available?)
- scientific article; zbMATH DE number 41467 (Why is no real title available?)
- scientific article; zbMATH DE number 107482 (Why is no real title available?)
- scientific article; zbMATH DE number 1059776 (Why is no real title available?)
- scientific article; zbMATH DE number 2061729 (Why is no real title available?)
- scientific article; zbMATH DE number 5586166 (Why is no real title available?)
- scientific article; zbMATH DE number 3241743 (Why is no real title available?)
- scientific article; zbMATH DE number 3340881 (Why is no real title available?)
- A deterministic annealing approach to clustering
- Applied multivariate statistical analysis
- Bayesian k-Means as a “Maximization-Expectation” Algorithm
- Clustering Methods: A History of k-Means Algorithms
- Competitive EM algorithm for finite mixture models
- Estimating the number of clusters in a data set via the gap statistic
- Least squares quantization in PCM
- Model-Based Gaussian and Non-Gaussian Clustering
- NP-hardness of Euclidean sum-of-squares clustering
- On clustering validation techniques
- Printer graphics for clustering
Cited in
(14)- Logistic regression with weight grouping priors
- Ellipticity and circularity measuring via Kullback-Leibler divergence
- Neighborhood density information in clustering
- scientific article; zbMATH DE number 1975248 (Why is no real title available?)
- Semi-supervised cross-entropy clustering with information bottleneck constraint
- Clustering of seasonal events: A simulation study using circular methods
- Constrained clustering with a complex cluster structure
- R2DS: a novel hierarchical framework for driver fatigue detection in mountain freeway
- CEC
- Extreme entropy machines: robust information theoretic classification
- Efficient mixture model for clustering of sparse high dimensional binary data
- K-MEANS CLUSTERING USING ENTROPY MINIMIZATION
- Application of the cross-entropy method to clustering and vector quantization
- Lossy compression approach to subspace clustering
This page was built for publication: Cross-entropy clustering
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q85407)