Heterogeneity adjustment with applications to graphical model inference
From MaRDI portal
Abstract: Heterogeneity is an unwanted variation when analyzing aggregated datasets from multiple sources. Though different methods have been proposed for heterogeneity adjustment, no systematic theory exists to justify these methods. In this work, we propose a generic framework named ALPHA (short for Adaptive Low-rank Principal Heterogeneity Adjustment) to model, estimate, and adjust heterogeneity from the original data. Once the heterogeneity is adjusted, we are able to remove the biases of batch effects and to enhance the inferential power by aggregating the homogeneous residuals from multiple sources. Under a pervasive assumption that the latent heterogeneity factors simultaneously affect a large fraction of observed variables, we provide a rigorous theory to justify the proposed framework. Our framework also allows the incorporation of informative covariates and appeals to the "Bless of Dimensionality". As an illustrative application of this generic framework, we consider a problem of estimating high-dimensional precision matrix for graphical model inference based on multiple datasets. We also provide thorough numerical studies on both synthetic datasets and a brain imaging dataset to demonstrate the efficacy of the developed theory and methods.
Recommendations
Cites work
- A Linear Mixed-Effects Model With Heterogeneity in the Random-Effects Population
- A constrained \(\ell _{1}\) minimization approach to sparse precision matrix estimation
- A tail inequality for quadratic forms of subgaussian random vectors
- Adjusting batch effects in microarray expression data using empirical Bayes methods
- Asymptotics of empirical eigenstructure for high dimensional spiked covariance
- Asymptotics of sample eigenstructure for a large dimensional spiked covariance model
- Asymptotics of the principal components estimator of large factor models with weakly influential factors
- Covariate-adjusted precision matrix estimation with an application in genetical genomics
- Determining the Number of Factors in Approximate Factor Models
- Efficient semiparametric estimation of the Fama-French model and extensions
- Eigenvalue ratio test for the number of factors
- Estimating heterogeneous graphical models for discrete data with an application to roll call voting
- Estimation of (near) low-rank matrices with noise and high-dimensional scaling
- Estimation of functionals of sparse covariance matrices
- Factor modeling for high-dimensional time series: inference for the number of factors
- Forecasting Using Principal Components From a Large Number of Predictors
- Fused multiple graphical lasso
- Hanson-Wright inequality and sub-Gaussian concentration
- High dimensional inverse covariance matrix estimation via linear programming
- High-dimensional covariance estimation by minimizing \(\ell _{1}\)-penalized log-determinant divergence
- High-dimensional graphs and variable selection with the Lasso
- Inferential Theory for Factor Models of Large Dimensions
- Joint estimation of multiple graphical models
- Large covariance estimation by thresholding principal orthogonal complements. With discussion and authors' reply
- Likelihood-based selection and sharp parameter estimation
- Model selection and estimation in the Gaussian graphical model
- On consistency and sparsity for principal components analysis in high dimensions
- Principal components estimation and identification of static factors
- Projected principal component analysis in factor models
- Sparse PCA: optimal rates and adaptive estimation
- Sparse inverse covariance estimation with the graphical lasso
- Sparsistency and rates of convergence in large covariance matrix estimation
- Structure estimation for discrete graphical models: generalized covariance matrices and their inverses
- The Joint Graphical Lasso for Inverse Covariance Estimation Across Multiple Classes
- The nonparanormal: semiparametric estimation of high dimensional undirected graphs
Cited in
(4)- High-Dimensional Factor Regression for Heterogeneous Subpopulations
- Bayesian Edge Regression in Undirected Graphical Models to Characterize Interpatient Heterogeneity in Cancer
- scientific article; zbMATH DE number 6253970 (Why is no real title available?)
- Heterogeneous large datasets integration using Bayesian factor regression
This page was built for publication: Heterogeneity adjustment with applications to graphical model inference
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1711558)