Estimating minimum effect with outlier selection

From MaRDI portal
Publication:2656596

DOI10.1214/20-AOS1956zbMATH Open1461.62052arXiv1809.08330OpenAlexW2892349150MaRDI QIDQ2656596FDOQ2656596


Authors: Alexandra Carpentier, Étienne Roquain, Nicolas Verzelen, Sylvain Delattre Edit this on Wikidata


Publication date: 11 March 2021

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: We introduce one-sided versions of Huber's contamination model, in which corrupted samples tend to take larger values than uncorrupted ones. Two intertwined problems are addressed: estimation of the mean of uncorrupted samples (minimum effect) and selection of corrupted samples (outliers). Regarding the minimum effect estimation, we derive the minimax risks and introduce adaptive estimators to the unknown number of contaminations. Interestingly, the optimal convergence rate highly differs from that in classical Huber's contamination model. Also, our analysis uncovers the effect of particular structural assumptions on the distribution of the contaminated samples. As for the problem of selecting the outliers, we formulate the problem in a multiple testing framework for which the location/scaling of the null hypotheses are unknown. We rigorously prove how estimating the null hypothesis is possible while maintaining a theoretical guarantee on the amount of the falsely selected outliers, both through false discovery rate (FDR) or post hoc bounds. As a by-product, we address a long-standing open issue on FDR control under equi-correlation, which reinforces the interest of removing dependency when making multiple testing.


Full work available at URL: https://arxiv.org/abs/1809.08330




Recommendations




Cites Work


Cited In (6)

Uses Software





This page was built for publication: Estimating minimum effect with outlier selection

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2656596)