Model-based clustering based on sparse finite Gaussian mixtures
From MaRDI portal
Abstract: In the framework of Bayesian model-based clustering based on a finite mixture of Gaussian distributions, we present a joint approach to estimate the number of mixture components and identify cluster-relevant variables simultaneously as well as to obtain an identified model. Our approach consists in specifying sparse hierarchical priors on the mixture weights and component means. In a deliberately overfitting mixture model the sparse prior on the weights empties superfluous components during MCMC. A straightforward estimator for the true number of components is given by the most frequent number of non-empty components visited during MCMC sampling. Specifying a shrinkage prior, namely the normal gamma prior, on the component means leads to improved parameter estimates as well as identification of cluster-relevant variables. After estimating the mixture model using MCMC methods based on data augmentation and Gibbs sampling, an identified model is obtained by relabeling the MCMC output in the point process representation of the draws. This is performed using -centroids cluster analysis based on the Mahalanobis distance. We evaluate our proposed strategy in a simulation setup with artificial data and by applying it to benchmark data sets.
Recommendations
- From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering
- Model-based clustering with sparse covariance matrices
- Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination
- Variable Selection for Clustering with Gaussian Mixture Models
- A Bayesian sparse finite mixture model for clustering data from a heterogeneous population
Cites work
- scientific article; zbMATH DE number 597901 (Why is no real title available?)
- scientific article; zbMATH DE number 1085980 (Why is no real title available?)
- scientific article; zbMATH DE number 1489827 (Why is no real title available?)
- A toolbox for \(K\)-centroids cluster analysis
- Asymptotic behaviour of the posterior distribution in overfitted mixture models
- Bayes variable selection in semiparametric linear models
- Bayesian Model Selection in Finite Mixtures by Marginal Density Decompositions
- Bayesian Variable Selection in Clustering High-Dimensional Data
- Bayesian inference for finite mixtures of univariate and multivariate skew-normal and skew-\(t\) distributions
- Bayesian mixture labeling by highest posterior density
- Bayesian profile regression with an application to the National Survey of Children's Health
- Bayesian wavelet-based curve classification via discriminant analysis with Markov random tree priors
- Computational and Inferential Difficulties with Mixture Posterior Distributions
- Dealing With Label Switching in Mixture Models
- Dealing with label switching in mixture models under genuine multimodality
- Detecting Features in Spatial Point Processes with Clutter via Model-Based Clustering
- Deviance information criteria for missing data models
- Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques*
- Finding Groups in Data
- Finite mixture and Markov switching models.
- Finite mixture models
- Finite mixtures of multivariate skew \(t\)-distributions: some recent and new results
- Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination
- Inference with normal-gamma prior distributions in regression problems
- Interpretation and inference in mixture models: simple MCMC works
- Latent class analysis variable selection
- Markov chain Monte Carlo Estimation of Classical and Dynamic Switching and Mixture Models
- Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling
- Methods for merging Gaussian mixture components
- Model-Based Gaussian and Non-Gaussian Clustering
- Model-based clustering of longitudinal data
- Model-based clustering of non-Gaussian panel data based on skew-\(t\) distributions
- Nonparametric Bayes conditional distribution modeling with variable selection
- On the posterior distribution of the number of components in a finite mixture
- Panel data analysis: a survey on model-based clustering of time series
- Penalized model-based clustering with application to variable selection
- Sparse Bayesian hierarchical modeling of high-dimensional clustering problems
- Statistical Analysis of Financial Data in S-Plus
- The Bayesian Lasso
- Variable Selection for Clustering with Gaussian Mixture Models
- Variable Selection for Model-Based Clustering
- Variable Selection for Model-Based High-Dimensional Clustering and Its Application to Microarray Data
- Variable Selection in Penalized Model‐Based Clustering Via Regularization on Grouped Parameters
- Variable selection in clustering via Dirichlet process mixture models
Cited in
(59)- Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination
- How many data clusters are in the galaxy data set? Bayesian cluster analysis in action
- A two-stage Bayesian semiparametric model for novelty detection with robust prior information
- Variable selection in finite mixture of regression models with an unknown number of components
- Practical Initialization of Recursive Mixture-Based Clustering for Non-negative Data
- A Bayesian sparse finite mixture model for clustering data from a heterogeneous population
- Generalized mixtures of finite mixtures and telescoping sampling
- Stochastic model specification in Markov switching vector error correction models
- Bayesian mixture model of extended redundancy analysis
- Variational inference and sparsity in high-dimensional deep Gaussian mixture models
- Semi-supervised nonparametric Bayesian modelling of spatial proteomics
- Model-based clustering with sparse covariance matrices
- Bayesian sparse convex clustering via global-local shrinkage priors
- A novel heuristic algorithm to solve penalized regression-based clustering model
- Algorithms for Model-Based Gaussian Hierarchical Clustering
- On the identifiability of Bayesian factor analytic models
- Variable selection methods for model-based clustering
- Keeping the balance -- bridge sampling for marginal likelihood estimation in finite mixture, mixture of experts and Markov mixture models
- Using conditional independence for parsimonious model-based Gaussian clustering
- Determinantal point process mixtures via spectral density approach
- Effect fusion using model-based clustering
- A Bayesian panel vector autoregression to analyze the impact of climate shocks on high-income economies
- Modelling the role of variables in model-based cluster analysis
- Estimation and Selection for High-Order Markov Chains with Bayesian Mixture Transition Distribution Models
- Clustering multivariate data using factor analytic Bayesian mixtures with an unknown number of components
- Robust clustering with subpopulation-specific deviations
- A semiparametric Bayesian joint model for multiple mixed-type outcomes: an application to acute myocardial infarction
- A flexible predictive density combination for large financial data sets in regular and crisis periods
- Finite mixture models and model-based clustering
- Multivariate bounded asymmetric Gaussian mixture model
- Component elimination strategies to fit mixtures of multiple scale distributions
- Bayesian variable selection in clustering high-dimensional data via a mixture of finite mixtures
- Variable diagnostics in model-based clustering through variation partition
- Identifying connected components in Gaussian finite mixture models for clustering
- BayesMultiMode
- Dynamic Dirichlet process mixture model for identifying voting coalitions in the United Nations General Assembly human rights roll call votes
- A survey on model-based co-clustering: high dimension and estimation challenges
- A horseshoe mixture model for Bayesian screening with an application to light sheet fluorescence microscopy in brain imaging
- From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering
- Polynomial whitening for high-dimensional data
- Is infinity that far? A Bayesian nonparametric perspective of finite mixture models
- On a loss-based prior for the number of components in mixture models
- Bayesian inference for continuous-time hidden Markov models with an unknown number of states
- Bayesian shrinkage in mixture-of-experts models: identifying robust determinants of class membership
- Bayesian curve fitting and clustering with Dirichlet process mixture models for microarray data
- Bayesian inference and prediction of a multiple-change-point panel model with nonparametric priors
- A Bayesian mixture model for clustering circular data
- Dynamic model-based clustering for spatio-temporal data
- Finite mixtures of ERGMs for modeling ensembles of networks
- Semiparametric finite mixture of regression models with Bayesian P-splines
- Variational Bayes estimation of hierarchical Dirichlet-multinomial mixtures for text clustering
- An enriched mixture model for functional clustering
- Population size estimation by repeated identifications of units. A Bayesian semi-parametric mixture model approach
- Assessing aquatic toxicity assessment via a clustered variance model
- Bayesian mixture models (in)consistency for the number of clusters
- Fast and Flexible Bayesian Inference in Time-varying Parameter Regression Models
- Finite-dimensional Discrete Random Structures and Bayesian Clustering
- Bayesian mode inference for discrete distributions in economics and finance
- Clusterwise multivariate regression of mixed-type panel data
This page was built for publication: Model-based clustering based on sparse finite Gaussian mixtures
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q66958)