An Imputation–Regularized Optimization Algorithm for High Dimensional Missing Data Problems and Beyond
From MaRDI portal
Publication:4628019
Abstract: Missing data are frequently encountered in high-dimensional problems, but they are usually difficult to deal with using standard algorithms, such as the expectation-maximization (EM) algorithm and its variants. To tackle this difficulty, some problem-specific algorithms have been developed in the literature, but there still lacks a general algorithm. This work is to fill the gap: we propose a general algorithm for high-dimensional missing data problems. The proposed algorithm works by iterating between an imputation step and a consistency step. At the imputation step, the missing data are imputed conditional on the observed data and the current estimate of parameters; and at the consistency step, a consistent estimate is found for the minimizer of a Kullback-Leibler divergence defined on the pseudo-complete data. For high dimensional problems, the consistent estimate can be found under sparsity constraints. The consistency of the averaged estimate for the true parameter can be established under quite general conditions. The proposed algorithm is illustrated using high-dimensional Gaussian graphical models, high-dimensional variable selection, and a random coefficient model.
Recommendations
- From predictive methods to missing data imputation: an optimization approach
- Semi-parametric optimization for missing data imputation
- Pattern alternating maximization algorithm for missing data in high-dimensional problems
- Imputation and low-rank estimation with missing not at random data
- Chunk-wise regularised PCA-based imputation of missing data
- On algorithmic and modeling approaches to imputation in large data sets
- A new algorithm to impute the missing values in the multivariate case
- Optimal imputation of the missing data using multi auxiliary information
- scientific article; zbMATH DE number 1945810
Cited in
(11)- Two directional Laplacian pyramids with application to data imputation
- Bayesian regularization of Gaussian graphical models with measurement error
- Transposable regularized covariance models with an application to missing data imputation
- Time-varying dynamic Bayesian network learning for an fMRI study of emotion processing
- Missing values: sparse inverse covariance estimation and an extension to sparse regression
- Variable selection for high-dimensional incomplete data using horseshoe estimation with data augmentation
- Estimating high-dimensional covariance and precision matrices under general missing dependence
- Online missing value imputation for high-dimensional mixed-type data via generalized factor models
- Bayesian estimation of sparse precision matrices in the presence of Gaussian measurement error
- Pattern alternating maximization algorithm for missing data in high-dimensional problems
- Regularized TLS-EM for estimating missing data
This page was built for publication: An Imputation–Regularized Optimization Algorithm for High Dimensional Missing Data Problems and Beyond
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4628019)