Regression analysis for microbiome compositional data
From MaRDI portal
Abstract: One important problem in microbiome analysis is to identify the bacterial taxa that are associated with a response, where the microbiome data are summarized as the composition of the bacterial taxa at different taxonomic levels. This paper considers regression analysis with such compositional data as covariates. In order to satisfy the subcompositional coherence of the results, linear models with a set of linear constraints on the regression coefficients are introduced. Such models allow regression analysis for subcompositions and include the log-contrast model for compositional covariates as a special case. A penalized estimation procedure for estimating the regression coefficients and for selecting variables under the linear constraints is developed. A method is also proposed to obtain de-biased estimates of the regression coefficients that are asymptotically unbiased and have a joint asymptotic multivariate normal distribution. This provides valid confidence intervals of the regression coefficients and can be used to obtain the -values. Simulation results show the validity of the confidence intervals and smaller variances of the de-biased estimates when the linear constraints are imposed. The proposed methods are applied to a gut microbiome data set and identify four bacterial genera that are associated with the body mass index after adjusting for the total fat and caloric intakes.
Recommendations
- Generalized linear models with linear constraints for microbiome compositional data
- Variable selection in regression with compositional covariates
- A logistic normal multinomial regression model for microbiome compositional data analysis
- Compositional knockoff filter for high‐dimensional regression analysis of microbiome data
- Bayesian graphical compositional regression for microbiome data
Cites work
- scientific article; zbMATH DE number 3914081 (Why is no real title available?)
- scientific article; zbMATH DE number 3772748 (Why is no real title available?)
- scientific article; zbMATH DE number 1734442 (Why is no real title available?)
- Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
- Estimation and accuracy after model selection
- Exact post-selection inference, with application to the Lasso
- On asymptotically optimal confidence regions and tests for high-dimensional models
- Regression analysis for microbiome compositional data
- Scaled sparse linear regression
- Statistical significance in high-dimensional linear models
- Variable selection in regression with compositional covariates
Cited in
(46)- Compositional knockoff filter for high‐dimensional regression analysis of microbiome data
- Multivariate log-contrast regression with sub-compositional predictors: testing the association between preterm infants' gut microbiome and neurobehavioral outcomes
- Robust logistic zero-sum regression for microbiome compositional data
- Penalized and constrained LAD estimation in fixed and high dimension
- Robust regression with compositional covariates
- FDR control for linear log-contrast models with high-dimensional compositional covariates
- Principal component analysis for zero-inflated compositional data
- Bayesian graphical compositional regression for microbiome data
- Three approaches to supervised learning for compositional data with pairwise logratios
- Globally Adaptive Longitudinal Quantile Regression With High Dimensional Compositional Covariates
- An adaptive independence test for microbiome community data
- Identification of microbial features in multivariate regression under false discovery rate control
- Kernel-penalized regression for analysis of microbiome data
- Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data
- Log-Contrast Regression with Functional Compositional Predictors: Linking Preterm Infant's Gut Microbiome Trajectories to Neurobehavioral Outcome
- Variable selection in regression with compositional covariates
- A new parameterization for elliptically symmetric angular Gaussian distributions of arbitrary dimension
- A review of compositional data analysis and recent advances
- A dual semismooth Newton based augmented Lagrangian method for large-scale linearly constrained sparse group square-root Lasso problems
- It's All Relative: Regression Analysis with Compositional Predictors
- Structured subcomposition selection in regression and its application to microbiome data analysis
- A decomposition method for Lasso problems with zero-sum constraint
- Bayesian compositional regression with structured priors for microbiome feature selection
- A flexible Bayesian tool for CoDa mixed models: logistic-normal distribution with Dirichlet covariance
- High-dimensional count and compositional data analysis in\\ microbiome studies
- Statistical analysis of microbiome data with R. YinglinXia, JunSun, Ding‐GenChen. (2018). Singapore: Springer. 505 pages, ISBN: 978‐981‐13‐1533‐6
- Generalized linear models with linear constraints for microbiome compositional data
- Compositional data: the sample space and its structure
- A robust knockoff filter for sparse regression analysis of microbiome compositional data
- Assessing mediating effects of high‐dimensional microbiome measurements in dietary intervention studies
- Rare feature selection in high dimensions
- Regression analysis for microbiome compositional data
- A logistic normal multinomial regression model for microbiome compositional data analysis
- A Bayesian model of microbiome data for simultaneous identification of covariate associations and prediction of phenotypic outcomes
- Statistical analysis of microbiome data with R
- The Integrated Nested Laplace Approximation for Fitting Dirichlet Regression Models
- Some aspects of non-standard multivariate analysis
- Discerning the linear convergence of ADMM for structured convex optimization through the lens of variational analysis
- Statistical models and computational algorithms for discovering relationships in microbiome data
- A Bayesian framework for identifying consistent patterns of microbial abundance between body sites
- Factor Augmented Inverse Regression and its Application to Microbiome Data Analysis
- Compositional mediation analysis for microbiome studies
- A folded model for compositional data analysis
- Flexible non-parametric regression models for compositional data
- Algorithms for Fitting the Constrained Lasso
- Robust Signal Recovery for High-Dimensional Linear Log-Contrast Models with Compositional Covariates
This page was built for publication: Regression analysis for microbiome compositional data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q312961)