Clustering multivariate data using factor analytic Bayesian mixtures with an unknown number of components
From MaRDI portal
(Redirected from Publication:140405)
Abstract: Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting mixture models yields a simple and efficient way in order to estimate the unknown number of clusters and model parameters by Markov chain Monte Carlo (MCMC) sampling. The present study extends this approach by considering a set of eight parameterizations, giving rise to parsimonious representations of the covariance matrix per cluster. A Gibbs sampler combined with a prior parallel tempering scheme is implemented in order to approximately sample from the posterior distribution of the overfitting mixture. The parameterization and number of factors is selected according to the Bayesian Information Criterion. Identifiability issues related to label switching are dealt by post-processing the simulated output with the Equivalence Classes Representatives algorithm. The contributed method and software are demonstrated and compared to similar models estimated using the Expectation-Maximization algorithm on simulated and real datasets. The software is available online at https://CRAN.R-project.org/package=fabMix.
Recommendations
- Bayesian variable selection in clustering high-dimensional data via a mixture of finite mixtures
- Flexible clustering via extended mixtures of common \(t\)-factor analyzers
- Dimensionally reduced model-based clustering through mixtures of factor mixture analyzers
- scientific article; zbMATH DE number 5059940
- Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination
- Robust clustering via mixtures of t factor analyzers with incomplete data
- Model-based clustering via mixtures of unrestricted skew normal factor analyzers with complete and incomplete data
- Bayesian analysis of mixtures of factor analyzers
- Mixtures of general location model with factor analyzer covariance structure for clustering mixed type data
- A Bayesian sparse finite mixture model for clustering data from a heterogeneous population
Cites work
- scientific article; zbMATH DE number 3860199 (Why is no real title available?)
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 1086057 (Why is no real title available?)
- A Bayesian analysis of some nonparametric problems
- Annealing Markov Chain Monte Carlo with Applications to Ancestral Inference
- Asymptotic behaviour of the posterior distribution in overfitted mixture models
- Bayesian analysis of mixture models with an unknown number of components\,--\,an alternative to reversible jump methods.
- Bayesian exploratory factor analysis
- Computational and Inferential Difficulties with Mixture Posterior Distributions
- Estimating the dimension of a model
- Finite mixture models
- From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering
- Handling the label switching problem in latent class models via the ECR algorithm
- Infinite mixtures of infinite factor analysers
- Latent variable models and factor analysis. A unified approach
- Mixture Densities, Maximum Likelihood and the EM Algorithm
- Mixtures of factor analysers. Bayesian estimation and inference by stochastic simulation
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Model-based clustering based on sparse finite Gaussian mixtures
- On the convergence rate of random permutation sampler and ECR algorithm in missing data models
- Overfitting Bayesian mixtures of factor analyzers with an unknown number of components
- RcppArmadillo: accelerating R with high-performance C++ linear algebra
- Reversible jump MCMC in mixtures of normal distributions with the same component means
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determination
- Sampling-Based Approaches to Calculating Marginal Densities
- Serial and parallel implementations of model-based clustering via parsimonious Gaussian mixture models
- Sparse Bayesian infinite factor models
- Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images
- Stochastic search item selection for factor analytic models
- Updating the Inverse of a Matrix
Cited in
(6)- A data-driven reversible jump for estimating a finite mixture of regression models
- On the identifiability of Bayesian factor analytic models
- Overfitting Bayesian mixtures of factor analyzers with an unknown number of components
- fabMix
- Clustering and classification via cluster-weighted factor analyzers
- Adaptive estimation in structured factor models with applications to overlapping clustering
Describes a project that uses
Uses Software
This page was built for publication: Clustering multivariate data using factor analytic Bayesian mixtures with an unknown number of components
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q140405)