Model based clustering of high-dimensional binary data

From MaRDI portal
Publication:1663311

DOI10.1016/J.CSDA.2014.12.009zbMATH Open1468.62191arXiv1404.3174OpenAlexW1964414401MaRDI QIDQ1663311FDOQ1663311


Authors: Yang Tang, Ryan P. Browne, Paul D. McNicholas Edit this on Wikidata


Publication date: 21 August 2018

Published in: Computational Statistics and Data Analysis (Search for Journal in Brave)

Abstract: We propose a mixture of latent trait models with common slope parameters (MCLT) for model-based clustering of high-dimensional binary data, a data type for which few established methods exist. Recent work on clustering of binary data, based on a d-dimensional Gaussian latent variable, is extended by incorporating common factor analyzers. Accordingly, our approach facilitates a low-dimensional visual representation of the clusters. We extend the model further by the incorporation of random block effects. The dependencies in each block are taken into account through block-specific parameters that are considered to be random variables. A variational approximation to the likelihood is exploited to derive a fast algorithm for determining the model parameters. Our approach is demonstrated on real and simulated data.


Full work available at URL: https://arxiv.org/abs/1404.3174




Recommendations




Cites Work


Cited In (18)

Uses Software





This page was built for publication: Model based clustering of high-dimensional binary data

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1663311)