High-dimensional inference for cluster-based graphical models
From MaRDI portal
Publication:4969100
Abstract: Motivated by modern applications in which one constructs graphical models based on a very large number of features, this paper introduces a new class of cluster-based graphical models, in which variable clustering is applied as an initial step for reducing the dimension of the feature space. We employ model assisted clustering, in which the clusters contain features that are similar to the same unobserved latent variable. Two different cluster-based Gaussian graphical models are considered: the latent variable graph, corresponding to the graphical model associated with the unobserved latent variables, and the cluster-average graph, corresponding to the vector of features averaged over clusters. Our study reveals that likelihood based inference for the latent graph, not analyzed previously, is analytically intractable. Our main contribution is the development and analysis of alternative estimation and inference strategies, for the precision matrix of an unobservable latent vector . We replace the likelihood of the data by an appropriate class of empirical risk functions, that can be specialized to the latent graphical model and to the simpler, but under-analyzed, cluster-average graphical model. The estimators thus derived can be used for inference on the graph structure, for instance on edge strength or pattern recovery. Inference is based on the asymptotic limits of the entry-wise estimates of the precision matrices associated with the conditional independence graphs under consideration. While taking the uncertainty induced by the clustering step into account, we establish Berry-Esseen central limit theorems for the proposed estimators. It is noteworthy that, although the clusters are estimated adaptively from the data, the central limit theorems regarding the entries of the estimated graphs are proved under the same conditions one would use if the clusters were known....
Recommendations
- The cluster graphical Lasso for improved estimation of Gaussian graphical models
- scientific article; zbMATH DE number 7255095
- Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
- Inferring sparse Gaussian graphical models with latent structure
- Graphical model selection with latent variables
Cites work
- scientific article; zbMATH DE number 6388313 (Why is no real title available?)
- scientific article; zbMATH DE number 490141 (Why is no real title available?)
- scientific article; zbMATH DE number 720689 (Why is no real title available?)
- A constrained \(\ell _{1}\) minimization approach to sparse precision matrix estimation
- A general theory of hypothesis tests and confidence regions for sparse high dimensional models
- A likelihood ratio framework for high-dimensional semiparametric regression
- A significance test for the lasso
- A unified theory of confidence regions and testing for high-dimensional estimating equations
- Asymptotic Statistics
- Asymptotic normality and optimalities in estimation of large Gaussian graphical models
- Conditional-mean least-squares fitting of Gaussian Markov random fields to Gaussian fields
- Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
- Confidence intervals for high-dimensional inverse covariance estimation
- Confidence intervals for high-dimensional linear regression: minimax rates and adaptivity
- Confidence intervals for low dimensional parameters in high dimensional linear models
- Consistent covariate selection and post model selection inference in semiparametric regression.
- Efficient, certifiably optimal clustering with applications to latent variable graphical models
- Estimating sparse precision matrix: optimal rates of convergence and adaptive estimation
- Exact post-selection inference, with application to the Lasso
- Gaussian graphical model estimation with false discovery rate control
- High dimensional inverse covariance matrix estimation via linear programming
- High dimensional semiparametric latent graphical model for mixed data
- High-dimensional covariance estimation by minimizing \(\ell _{1}\)-penalized log-determinant divergence
- High-dimensional graphs and variable selection with the Lasso
- High-dimensional semiparametric Gaussian copula graphical models
- High-dimensional semiparametric bigraphical models
- Honest confidence regions and optimality in high-dimensional precision matrix estimation
- Model assisted variable clustering: minimax-optimal recovery and algorithms
- Model selection and estimation in the Gaussian graphical model
- On semiparametric exponential family graphical models
- One-Step Huber Estimates in the Linear Model
- Partial correlation estimation by joint sparse regression models
- ROCKET: robust confidence intervals via Kendall's tau for transelliptical graphical models
- Regularized rank-based estimation of high-dimensional nonparanormal graphical models
- Replicates in high dimensions, with applications to latent variable graphical models
- Sparse inverse covariance estimation with the graphical lasso
- Sparse matrix inversion with scaled Lasso
- Sparse permutation invariant covariance estimation
- Sparsistency and rates of convergence in large covariance matrix estimation
- Testing and Confidence Intervals for High Dimensional Proportional Hazards Models
- The control of the false discovery rate in multiple testing under dependency.
Cited in
(8)- Diffuse Interface Models on Graphs for Classification of High Dimensional Data
- Combinatorial inference for graphical models
- The cluster graphical Lasso for improved estimation of Gaussian graphical models
- StarTrek: combinatorial variable selection with false discovery rate control
- The huge Package for High-dimensional Undirected Graph Estimation in R
- Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
- Identifying graph clusters using variational inference and links to covariance parametrization
- Finding Non-Overlapping Clusters for Generalized Inference Over Graphical Models
This page was built for publication: High-dimensional inference for cluster-based graphical models
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4969100)