Transposable regularized covariance models with an application to missing data imputation
From MaRDI portal
(Redirected from Publication:993250)
Abstract: Missing data estimation is an important challenge with high-dimensional data arranged in the form of a matrix. Typically this data matrix is transposable, meaning that either the rows, columns or both can be treated as features. To model transposable data, we present a modification of the matrix-variate normal, the mean-restricted matrix-variate normal, in which the rows and columns each have a separate mean vector and covariance matrix. By placing additive penalties on the inverse covariance matrices of the rows and columns, these so-called transposable regularized covariance models allow for maximum likelihood estimation of the mean and nonsingular covariance matrices. Using these models, we formulate EM-type algorithms for missing data imputation in both the multivariate and transposable frameworks. We present theoretical results exploiting the structure of our transposable models that allow these models and imputation methods to be applied to high-dimensional data. Simulations and results on microarray data and the Netflix data show that these imputation techniques often outperform existing methods and offer a greater degree of flexibility.
Recommendations
- Missing values: sparse inverse covariance estimation and an extension to sparse regression
- An Imputation–Regularized Optimization Algorithm for High Dimensional Missing Data Problems and Beyond
- High-dimensional covariance matrix estimation with missing observations
- Pattern alternating maximization algorithm for missing data in high-dimensional problems
- Minimax rate-optimal estimation of high-dimensional covariance matrices with incomplete data
Cites work
- scientific article; zbMATH DE number 4159863 (Why is no real title available?)
- scientific article; zbMATH DE number 1834445 (Why is no real title available?)
- scientific article; zbMATH DE number 1391247 (Why is no real title available?)
- Are a set of microarrays independent of each other?
- Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models
- Component selection and smoothing in multivariate nonparametric regression
- Correlated \(z\)-values and the accuracy of large-scale statistical estimates
- Covariance-regularized regression and classification for high dimensional problems
- Exact matrix completion via convex optimization
- Maximum likelihood estimation via the ECM algorithm: A general framework
- Multiple Imputation After 18+ Years
- Sparse inverse covariance estimation with the graphical lasso
- Sparse permutation invariant covariance estimation
- Stochastic versions of the em algorithm: an experimental study in the mixture case
- The mle algorithm for the matrix normal distribution
- Transposable regularized covariance models with an application to missing data imputation
- Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties
Cited in
(41)- PLS for Big Data: a unified parallel algorithm for regularised group PLS
- Existence and uniqueness of the maximum likelihood estimator for models with a Kronecker product covariance structure
- Transposable regularized covariance models with an application to missing data imputation
- Hypothesis Testing of Matrix Graph Model with Application to Brain Connectivity Analysis
- Scalable Bayesian matrix normal graphical models for brain functional networks
- The mixed Lipschitz space and its dual for tree metrics
- A generalized least-square matrix decomposition
- Consistency of large dimensional sample covariance matrix under weak dependence
- Pattern alternating maximization algorithm for missing data in high-dimensional problems
- Gaussian and robust Kronecker product covariance estimation: existence and uniqueness
- Covariance estimation via sparse Kronecker structures
- Gemini: graph estimation with matrix variate normal instances
- Model selection and estimation in the matrix normal graphical model
- Co-clustering of spatially resolved transcriptomic data
- Kronecker-structured covariance models for multiway data
- Separable factor analysis with applications to mortality data
- Estimating high-dimensional covariance and precision matrices under general missing dependence
- Concentration of measure bounds for matrix-variate data with missing values
- Inferring Phenotypic Trait Evolution on Large Trees With Many Incomplete Measurements
- Imputation and low-rank estimation with missing not at random data
- scientific article; zbMATH DE number 7626713 (Why is no real title available?)
- An expectation-maximization algorithm for the matrix normal distribution with an application in remote sensing
- Mixed Hölder matrix discovery via wavelet shrinkage and Calderón-Zygmund decompositions
- Many-sample tests for the equality and the proportionality hypotheses between large covariance matrices
- Maximum likelihood estimation for matrix normal models via quiver representations
- Testing high-dimensional mean vector with applications. A normal reference approach
- Unifying and generalizing methods for removing unwanted variation based on negative controls
- Testing the mean matrix in high-dimensional transposable data
- A constrained matrix-variate Gaussian process for transposable data
- Missing values: sparse inverse covariance estimation and an extension to sparse regression
- Matrix Completion, Counterfactuals, and Factor Analysis of Missing Data
- Sparse Matrix Graphical Models
- Mixture of multivariate Gaussian processes for classification of irregularly sampled satellite image time-series
- Existence and uniqueness of the Kronecker covariance MLE
- Recovering networks from distance data
- Graphical model selection and estimation for high dimensional tensor data
- Autoregressive identification of Kronecker graphical models
- A Penalized Likelihood Method for Classification With Matrix-Valued Predictors
- Hypothesis testing for the covariance matrix in high-dimensional transposable data with Kronecker product dependence structure
- Detecting column dependence when rows are correlated and estimating the strength of the row correlation
- Sampling, denoising and compression of matrices by coherent matrix organization
This page was built for publication: Transposable regularized covariance models with an application to missing data imputation
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q993250)