Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data
From MaRDI portal
Abstract: The human microbiome plays an important role in human health and disease status. Next generating sequencing technologies allow for quantifying the composition of the human microbiome. Clustering these microbiome data can provide valuable information by identifying underlying patterns across samples. Recently, Fang and Subedi (2020) proposed a logistic normal multinomial mixture model (LNM-MM) for clustering microbiome data. As microbiome data tends to be high dimensional, here, we develop a family of logistic normal multinomial factor analyzers (LNM-FA) by incorporating a factor analyzer structure in the LNM-MM. This family of models is more suitable for high-dimensional data as the number of parameters in LNM-FA can be greatly reduced by assuming that the number of latent factors is small. Parameter estimation is done using a computationally efficient variant of the alternating expectation conditional maximization algorithm that utilizes variational Gaussian approximations. The proposed method is illustrated using simulated and real datasets.
Cites work
- scientific article; zbMATH DE number 3942813 (Why is no real title available?)
- scientific article; zbMATH DE number 3772748 (Why is no real title available?)
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 1059776 (Why is no real title available?)
- scientific article; zbMATH DE number 1086057 (Why is no real title available?)
- A Dirichlet-Tree Multinomial Regression Model for Associating Dietary Nutrients with Gut Microorganisms
- A correlated topic model of science
- A logistic normal multinomial regression model for microbiome compositional data analysis
- Cluster-weighted t-factor analyzers for robust model-based clustering and dimension reduction
- Clustering and classification via cluster-weighted factor analyzers
- Consistent estimation of the order of mixture models.
- Dirichlet-tree multinomial mixtures for clustering microbiome compositions
- Estimating the dimension of a model
- Finite mixture models
- Gaussian Kullback-Leibler approximate inference
- Graphical models, exponential families, and variational inference
- How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis
- Mixture of latent trait analyzers for model-based clustering of categorical data
- Model based clustering of high-dimensional binary data
- Modelling high-dimensional data by mixtures of factor analyzers
- New perspectives on water chemistry and compositional data analysis
- On Information and Sufficiency
- Simultaneous model-based clustering and visualization in the Fisher discriminative subspace
- The distribution of the likelihood ratio for mixtures of densities from the one-parameter exponential family
- Variable selection for sparse Dirichlet-multinomial regression with an application to microbiome data analysis
- Variational Gaussian approximation for Poisson data
This page was built for publication: Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q101518)