Enhancing the selection of a model-based clustering with external categorical variables
From MaRDI portal
Abstract: In cluster analysis, it can be useful to interpret the partition built from the data in the light of external categorical variables which were not directly involved to cluster the data. An approach is proposed in the model-based clustering context to select a model and a number of clusters which both fit the data well and take advantage of the potential illustrative ability of the external variables. This approach makes use of the integrated joint likelihood of the data and the partitions at hand, namely the model-based partition and the partitions associated to the external variables. It is noteworthy that each mixture model is fitted by the maximum likelihood methodology to the data, excluding the external variables which are used to select a relevant mixture model only. Numerical experiments illustrate the promising behaviour of the derived criterion.
Recommendations
- Variable Selection for Clustering with Gaussian Mixture Models
- Variable selection in model-based clustering: a general variable role modeling
- Modelling the role of variables in model-based cluster analysis
- Variable Selection for Model-Based Clustering
- Variable selection for model-based clustering using the integrated complete-data likelihood
Cites work
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 1348600 (Why is no real title available?)
- Consistent estimation of the order of mixture models.
- Estimating the dimension of a model
- Finite mixture models
- Model-based cluster and discriminant analysis with the MIXMOD software
- Practical Bayesian Density Estimation Using Mixtures of Normals
- The EM Algorithm and Extensions, 2E
Cited in
(7)- A model selection criterion for model-based clustering of annotated gene expression data
- Exploring dependence between categorical variables: benefits and limitations of using variable selection within Bayesian clustering in relation to log-linear modelling with interaction terms
- Improved model-based clustering performance using Bayesian initialization averaging
- Stability approach to selecting the number of principal components
- Distance Metrics and Clustering Methods for Mixed‐type Data
- Constrained clustering with a complex cluster structure
- Beyond the number of classes: separating substantive from non-substantive dependence in latent class analysis
This page was built for publication: Enhancing the selection of a model-based clustering with external categorical variables
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2418394)