Adaptive novelty detection with false discovery rate guarantee

DOI10.1214/23-AOS2338MaRDI QIDQ6151968zbMATH OpenFDO

Authors Ariane Marandon, Lihua Lei, David Mary, Étienne Roquain

Publication date 11 March 2024

Published in The Annals of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2208.06685, https://projecteuclid.org/journals/annals-of-statistics/volume-52/issue-1/Adaptive-novelty-detection-with-false-discovery-rate-guarantee/10.1214/23-AOS2338.full

zbMATH Keywords

classification false discovery rate machine learning neural network novelty detection knockoff adaptive multiple testing conformal \(p\)-values

Mathematics Subject Classification ID

Nonparametric hypothesis testing (62G10) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Paired and multiple comparisons; multiple testing (62J15)

Abstract: Classical false discovery rate (FDR) controlling procedures offer strong and interpretable guarantees but often lack flexibility to work with complex data. By contrast, machine learning-based classification algorithms have superior performances on modern datasets but typically fall short of error-controlling guarantees. In this paper, we make these two meet by introducing a new adaptive novelty detection procedure with FDR control, called AdaDetect. It extends the scope of recent works of multiple testing literature to the high dimensional setting, notably the one in Yang et al. (2021). We prove that AdaDetect comes with finite sample guarantees: it controls the FDR strongly and approximates the oracle in terms of the power, with explicit remainder terms that are small under mild conditions. In practice, AdaDetect can be used in combination with any machine learning-based classifier, which allows the user to choose the most relevant classification approach. We illustrate this with classical real-world datasets, for which random forest and neural network classifiers are particularly efficient. The versatility of our method is also shown with an astrophysical application.

Recommendations

Cites work

Cited in

(5)

This page was built for publication: Adaptive novelty detection with false discovery rate guarantee

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6151968)