Model based clustering of high-dimensional binary data
From MaRDI portal
Publication:1663311
Abstract: We propose a mixture of latent trait models with common slope parameters (MCLT) for model-based clustering of high-dimensional binary data, a data type for which few established methods exist. Recent work on clustering of binary data, based on a -dimensional Gaussian latent variable, is extended by incorporating common factor analyzers. Accordingly, our approach facilitates a low-dimensional visual representation of the clusters. We extend the model further by the incorporation of random block effects. The dependencies in each block are taken into account through block-specific parameters that are considered to be random variables. A variational approximation to the likelihood is exploited to derive a fast algorithm for determining the model parameters. Our approach is demonstrated on real and simulated data.
Recommendations
- Model-based clustering of high-dimensional data: a review
- Efficient mixture model for clustering of sparse high dimensional binary data
- Model-based clustering
- Model-based clustering
- Clustering for binary data and mixture models—choice of the model
- The remarkable simplicity of very high dimensional data: application of model-based clustering
- On model-based clustering, classification, and discriminant analysis
- Model-based clustering of multivariate ordinal data relying on a stochastic binary search algorithm
Cites work
- scientific article; zbMATH DE number 45532 (Why is no real title available?)
- scientific article; zbMATH DE number 2188755 (Why is no real title available?)
- A factor mixture analysis model for multivariate binary data
- Analytic calculations for the EM algorithm for multivariate skew-\(t\) mixture models
- Dimension reduction for model-based clustering via mixtures of shifted asymmetric Laplace distributions
- Estimating common principal components in high dimensions
- Estimating the dimension of a model
- Finite mixture models
- Latent class and finite mixture models for multilevel data sets
- Mixture model clustering using the MULTIMIX program
- Mixture modelling for cluster analysis
- Mixture of latent trait analyzers for model-based clustering of categorical data
- Mixtures of skew-\(t\) factor analyzers
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Model-Based Gaussian and Non-Gaussian Clustering
- Model-based clustering, classification, and discriminant analysis of data with mixed type
- Multivariate and mixture distribution Rasch models. Extensions and applications
- On mixtures of skew normal and skew \(t\)-distributions
- On the Bumpy Road to the Dominant Mode
- Orthogonal Stiefel manifold optimization for eigen-decomposed covariance parameter estimation in mixture models
- Parsimonious skew mixture models for model-based clustering and classification
- The distribution of the likelihood ratio for mixtures of densities from the one-parameter exponential family
- Variational Bayes approximations for clustering via mixtures of normal inverse Gaussian distributions
Cited in
(33)- Multi-way blockmodels for analyzing coordinated high-dimensional responses
- Noise-free latent block model for high dimensional data
- On bandwidth selection using minimal spanning tree for kernel density estimation
- Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data
- Ensemble clustering for step data via binning
- Diagonal latent block model for binary data
- Model-based clustering
- Investigation of the structure of binary variables
- Piecewise regression mixture for simultaneous functional data clustering and optimal segmentation
- Model-based multidimensional clustering of categorical data
- scientific article; zbMATH DE number 912697 (Why is no real title available?)
- Latent simplex position model: high dimensional multi-view clustering with uncertainty quantification
- Iterative factor clustering of binary data
- A general framework for association analysis of heterogeneous data
- A finite mixture approach to joint clustering of individuals and multivariate discrete outcomes
- Statistical analysis of very high-dimensional data sets of hierarchically structured binary variables with missing data: An application to marine corps readiness evaluations
- Model-based clustering of high-dimensional data: a review
- A family of parsimonious mixtures of multivariate Poisson-lognormal distributions for clustering multivariate count data
- A model-based approach to simultaneous clustering and dimensional reduction of ordinal data
- Block clustering with collapsed latent block models
- Using latent variables in model based clustering: an e-government application
- Model based clustering of large data sets: tracing the development of spelling ability
- Block Bernoulli Parsimonious Clustering Models
- Mixture of latent trait analyzers for model-based clustering of categorical data
- Efficient mixture model for clustering of sparse high dimensional binary data
- Conditionally conjugate mean-field variational Bayes for logistic models
- Clustering with hidden Markov model on variable blocks
- A Bayesian approach to model-based clustering for binary panel probit models
- Clustering of multivariate binary data with dimension reduction via \(L_{1}\)-regularized likelihood maximization
- A family of block-wise one-factor distributions for modeling high-dimensional binary data
- A latent variables approach for clustering mixed binary and continuous variables within a Gaussian mixture model
- On Generalized Latent Factor Modeling and Inference for High-Dimensional Binomial Data
- A Bayesian approach to restricted latent class models for scientifically structured clustering of multivariate binary outcomes
This page was built for publication: Model based clustering of high-dimensional binary data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1663311)