CoCoLasso for high-dimensional error-in-variables regression

From MaRDI portal
Publication:682285

DOI10.1214/16-AOS1527zbMATH Open1486.62210arXiv1510.07123MaRDI QIDQ682285FDOQ682285


Authors: Abhirup Datta, Hui Zou Edit this on Wikidata


Publication date: 14 February 2018

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: Much theoretical and applied work has been devoted to high-dimensional regression with clean data. However, we often face corrupted data in many applications where missing data and measurement errors cannot be ignored. Loh and Wainwright (2012) proposed a non-convex modification of the Lasso for doing high-dimensional regression with noisy and missing data. It is generally agreed that the virtues of convexity contribute fundamentally the success and popularity of the Lasso. In light of this, we propose a new method named CoCoLasso that is convex and can handle a general class of corrupted datasets including the cases of additive measurement error and random missing data. We establish the estimation error bounds of CoCoLasso and its asymptotic sign-consistent selection property. We further elucidate how the standard cross validation techniques can be misleading in presence of measurement error and develop a novel corrected cross-validation technique by using the basic idea in CoCoLasso. The corrected cross-validation has its own importance. We demonstrate the superior performance of our method over the non-convex approach by simulation studies.


Full work available at URL: https://arxiv.org/abs/1510.07123




Recommendations





Cited In (35)





This page was built for publication: CoCoLasso for high-dimensional error-in-variables regression

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q682285)