Learning from MOM's principles: Le Cam's approach

DOI10.1016/J.SPA.2018.11.024zbMATH Open1435.62175arXiv1701.01961OpenAlexW2738598838WikidataQ128702491 ScholiaQ128702491MaRDI QIDQ2010482FDOQ2010482

Authors: Guillaume Lecué, M. Lerasle

Publication date: 27 November 2019

Published in: Stochastic Processes and their Applications (Search for Journal in Brave)

Abstract: We obtain estimation error rates for estimators obtained by aggregation of regularized median-of-means tests, following a construction of Le Cam. The results hold with exponentially large probability -- as in the gaussian framework with independent noise- under only weak moments assumptions on data and without assuming independence between noise and design. Any norm may be used for regularization. When it has some sparsity inducing power we recover sparse rates of convergence. The procedure is robust since a large part of data may be corrupted, these outliers have nothing to do with the oracle we want to reconstruct. Our general risk bound is of order �egin{equation*} maxleft(mbox{minimax rate in the i.i.d. setup}, frac{ ext{number of outliers}}{ ext{number of observations}} ight) enspace. end{equation*}In particular, the number of outliers may be as large as (number of data)