Adaptive novelty detection with false discovery rate guarantee

From MaRDI portal
Publication:6151968

DOI10.1214/23-AOS2338arXiv2208.06685MaRDI QIDQ6151968FDOQ6151968

Ariane Marandon, Lihua Lei, Étienne Roquain, David Mary

Publication date: 11 March 2024

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: Classical false discovery rate (FDR) controlling procedures offer strong and interpretable guarantees but often lack flexibility to work with complex data. By contrast, machine learning-based classification algorithms have superior performances on modern datasets but typically fall short of error-controlling guarantees. In this paper, we make these two meet by introducing a new adaptive novelty detection procedure with FDR control, called AdaDetect. It extends the scope of recent works of multiple testing literature to the high dimensional setting, notably the one in Yang et al. (2021). We prove that AdaDetect comes with finite sample guarantees: it controls the FDR strongly and approximates the oracle in terms of the power, with explicit remainder terms that are small under mild conditions. In practice, AdaDetect can be used in combination with any machine learning-based classifier, which allows the user to choose the most relevant classification approach. We illustrate this with classical real-world datasets, for which random forest and neural network classifiers are particularly efficient. The versatility of our method is also shown with an astrophysical application.


Full work available at URL: https://arxiv.org/abs/2208.06685







Cites Work






This page was built for publication: Adaptive novelty detection with false discovery rate guarantee

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6151968)