Choosing the number of clusters in a finite mixture model using an exact integrated completed likelihood criterion
From MaRDI portal
(Redirected from Publication:497086)
Abstract: The integrated completed likelihood (ICL) criterion has proven to be a very popular approach in model-based clustering through automatically choosing the number of clusters in a mixture model. This approach effectively maximises the complete data likelihood, thereby including the allocation of observations to clusters in the model selection criterion. However for practical implementation one needs to introduce an approximation in order to estimate the ICL. Our contribution here is to illustrate that through the use of conjugate priors one can derive an exact expression for ICL and so avoiding any approximation. Moreover, we illustrate how one can find both the number of clusters and the best allocation of observations in one algorithmic framework. The performance of our algorithm is presented on several simulated and real examples.
Recommendations
- Estimation and model selection for model-based clustering with the conditional classification likelihood
- Order selection in finite mixture models: complete or observed likelihood information criteria?
- Variable selection for model-based clustering using the integrated complete-data likelihood
- An entropy criterion for assessing the number of clusters in a mixture model
- How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis
Cites work
- scientific article; zbMATH DE number 3986503 (Why is no real title available?)
- scientific article; zbMATH DE number 1085980 (Why is no real title available?)
- An invariant form for the prior probability in estimation problems
- Block clustering with collapsed latent block models
- Density Estimation With Confidence Sets Exemplified by Superclusters and Voids in the Galaxies
- Exact and Monte Carlo calculations of integrated likelihoods for the latent class model
- Finite mixture models and model-based clustering
- Improved Bayesian inference for the stochastic block model with application to large networks
- Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling
- Mixtures
- Model selection and clustering in stochastic block models based on the exact integrated complete data likelihood
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determination
Cited in
(8)- Estimation and model selection for model-based clustering with the conditional classification likelihood
- Hierarchical clustering with discrete latent variable models and the integrated classification likelihood
- The sensitivity of the number of clusters in a Gaussian mixture model to prior distributions
- Comparing classical criteria for selecting intra-class correlated features in Multimix
- Variable selection for model-based clustering using the integrated complete-data likelihood
- Finding the Number of Normal Groups in Model-Based Clustering via Constrained Likelihoods
- Latent variable models for the analysis of socio-economic data
- Optimal Bayesian estimators for latent variable cluster models
This page was built for publication: Choosing the number of clusters in a finite mixture model using an exact integrated completed likelihood criterion
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q497086)