CoCoLasso for high-dimensional error-in-variables regression
From MaRDI portal
Publication:682285
DOI10.1214/16-AOS1527zbMATH Open1486.62210arXiv1510.07123MaRDI QIDQ682285FDOQ682285
Authors: Abhirup Datta, Hui Zou
Publication date: 14 February 2018
Published in: The Annals of Statistics (Search for Journal in Brave)
Abstract: Much theoretical and applied work has been devoted to high-dimensional regression with clean data. However, we often face corrupted data in many applications where missing data and measurement errors cannot be ignored. Loh and Wainwright (2012) proposed a non-convex modification of the Lasso for doing high-dimensional regression with noisy and missing data. It is generally agreed that the virtues of convexity contribute fundamentally the success and popularity of the Lasso. In light of this, we propose a new method named CoCoLasso that is convex and can handle a general class of corrupted datasets including the cases of additive measurement error and random missing data. We establish the estimation error bounds of CoCoLasso and its asymptotic sign-consistent selection property. We further elucidate how the standard cross validation techniques can be misleading in presence of measurement error and develop a novel corrected cross-validation technique by using the basic idea in CoCoLasso. The corrected cross-validation has its own importance. We demonstrate the superior performance of our method over the non-convex approach by simulation studies.
Full work available at URL: https://arxiv.org/abs/1510.07123
Recommendations
- Calibrated zero-norm regularized LS estimator for high-dimensional error-in-variables regression
- Balanced estimation for high-dimensional measurement error models
- Sparse estimation in high-dimensional linear errors-in-variables regression via a covariate relaxation method
- Scalable interpretable learning for multi-response error-in-variables regression
- Linear and Conic Programming Estimators in High Dimensional Errors-in-variables Models
Asymptotic properties of parametric estimators (62F12) Ridge regression; shrinkage estimators (Lasso) (62J07)
Cited In (35)
- Learning partial differential equations for biological transport models from noisy spatio-temporal data
- On parameter estimation for high dimensional errors-in-variables models
- Inference in high dimensional linear measurement error models
- Subgroup analysis method for accelerated failure time model
- Variable selection for high‐dimensional generalized linear model with block‐missing data
- Multi-Task Learning with High-Dimensional Noisy Images
- Screening Methods for Linear Errors-in-Variables Models in High Dimensions
- Model selection in high-dimensional noisy data: a simulation study
- Covariance-regularized regression and classification for high dimensional problems
- Balanced estimation for high-dimensional measurement error models
- Rate optimal estimation and confidence intervals for high-dimensional regression with missing covariates
- Detection of block-exchangeable structure in large-scale correlation matrices
- On Robustness of Principal Component Regression
- An Explicit Mean-Covariance Parameterization for Multivariate Response Linear Regression
- On high-dimensional Poisson models with measurement error: hypothesis testing for nonlinear nonconvex optimization
- A unified precision matrix estimation framework via sparse column-wise inverse operator under weak sparsity
- Title not available (Why is that?)
- Calibrated zero-norm regularized LS estimator for high-dimensional error-in-variables regression
- Estimating high-dimensional covariance and precision matrices under general missing dependence
- Low-rank matrix estimation via nonconvex optimization methods in multi-response errors-in-variables regression
- MEBoost: variable selection in the presence of measurement error
- STRATOS guidance document on measurement error and misclassification of variables in observational epidemiology. II: More complex methods of adjustment and advanced topics
- L 0 -regularization for high-dimensional regression with corrupted data
- Error bound of critical points and KL property of exponent 1/2 for squared F-norm regularized factorization
- Sparse estimation in high-dimensional linear errors-in-variables regression via a covariate relaxation method
- A Note on Cross-Validation for Lasso Under Measurement Errors
- Inference for high dimensional linear models with error-in-variables
- Double bias correction for high-dimensional sparse additive hazards regression with covariate measurement errors
- Identification of survival relevant genes with measurement error in gene expression incorporated
- Optimal sparse linear prediction for block-missing multi-modality data without imputation
- Poisson Regression With Error Corrupted High Dimensional Features
- Scalable interpretable learning for multi-response error-in-variables regression
- Logistic regression error-in-covariate models for longitudinal high-dimensional covariates
- High-dimensional regression with potential prior information on variable importance
- Adaptive Bayesian SLOPE: Model Selection With Incomplete Data
This page was built for publication: CoCoLasso for high-dimensional error-in-variables regression
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q682285)