Adaptive novelty detection with false discovery rate guarantee
From MaRDI portal
Publication:6151968
Abstract: Classical false discovery rate (FDR) controlling procedures offer strong and interpretable guarantees but often lack flexibility to work with complex data. By contrast, machine learning-based classification algorithms have superior performances on modern datasets but typically fall short of error-controlling guarantees. In this paper, we make these two meet by introducing a new adaptive novelty detection procedure with FDR control, called AdaDetect. It extends the scope of recent works of multiple testing literature to the high dimensional setting, notably the one in Yang et al. (2021). We prove that AdaDetect comes with finite sample guarantees: it controls the FDR strongly and approximates the oracle in terms of the power, with explicit remainder terms that are small under mild conditions. In practice, AdaDetect can be used in combination with any machine learning-based classifier, which allows the user to choose the most relevant classification approach. We illustrate this with classical real-world datasets, for which random forest and neural network classifiers are particularly efficient. The versatility of our method is also shown with an astrophysical application.
Recommendations
Cites work
- scientific article; zbMATH DE number 1332320 (Why is no real title available?)
- scientific article; zbMATH DE number 720689 (Why is no real title available?)
- scientific article; zbMATH DE number 2168212 (Why is no real title available?)
- scientific article; zbMATH DE number 6306019 (Why is no real title available?)
- scientific article; zbMATH DE number 7064043 (Why is no real title available?)
- A Neyman–Pearson Approach to Statistical Learning
- A sequential algorithm for false discovery rate control on directed acyclic graphs
- A unified treatment of multiple testing with prior knowledge using the p-filter
- AdaPT: an interactive procedure for multiple testing with side information
- Adaptive false discovery rate control under independence and dependence
- Adaptive linear step-up procedures that control the false discovery rate
- Conditional calibration for false discovery rate control under dependence
- Controlling the false discovery rate via knockoffs
- Controlling the number of false discoveries: application to high-dimensional genomic data
- Convergence rates of deep ReLU networks for multiclass classification
- Covariate-assisted ranking and screening for large-scale two-sample inference
- Cross-conformal predictors
- Deep learning
- Density ratio estimation in machine learning. Foreword by Thomas G. Dietterich
- Discovering the False Discovery Rate
- Doing thousands of hypothesis tests at the same time
- Empirical Bayes Analysis of a Microarray Experiment
- Empirical Bayes estimates for large-scale prediction problems
- Estimating the support of a high-dimensional distribution
- Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing
- Exact calculations for false discovery proportion with application to least favorable configura\-tions
- False discovery rate control via debiased Lasso
- Fast learning rates for plug-in classifiers
- Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models
- Introduction to High-Dimensional Statistics
- Large-Scale Simultaneous Hypothesis Testing
- Large-scale multiple testing under dependence
- Learning from positive and unlabeled data: a survey
- Microarrays, empirical Bayes and the two-groups model
- Multiple testing for exploratory research
- On methods controlling the false discovery rate
- On the Benjamini-Hochberg method
- Oracle and Adaptive Compound Decision Rules for False Discovery Rate Control
- Predictive inference with the jackknife+
- Robust inference with knockoffs
- SLOPE-adaptive variable selection via convex optimization
- Semi-supervised multiple testing
- Semi-supervised novelty detection
- Simultaneous Testing of Grouped Hypotheses: Finding Needles in Multiple Haystacks
- Smoothed nested testing on directed acyclic graphs
- Strong Control, Conservative Point Estimation and Simultaneous Conservative Consistency of False Discovery Rates: A Unified Approach
- Testing for outliers with conformal p-values
- The \(p\)-filter: multilayer false discovery rate control for grouped hypotheses
- The control of the false discovery rate in multiple testing under dependency.
This page was built for publication: Adaptive novelty detection with false discovery rate guarantee
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6151968)