A knockoff filter for high-dimensional selective inference
From MaRDI portal
(Redirected from Publication:2328050)
Abstract: This paper develops a framework for testing for associations in a possibly high-dimensional linear model where the number of features/variables may far exceed the number of observational units. In this framework, the observations are split into two groups, where the first group is used to screen for a set of potentially relevant variables, whereas the second is used for inference over this reduced set of variables; we also develop strategies for leveraging information from the first part of the data at the inference step for greater power. In our work, the inferential step is carried out by applying the recently introduced knockoff filter, which creates a knockoff copy-a fake variable serving as a control-for each screened variable. We prove that this procedure controls the directional false discovery rate (FDR) in the reduced model controlling for all screened variables; this says that our high-dimensional knockoff procedure 'discovers' important variables as well as the directions (signs) of their effects, in such a way that the expected proportion of wrongly chosen signs is below the user-specified level (thereby controlling a notion of Type S error averaged over the selected set). This result is non-asymptotic, and holds for any distribution of the original features and any values of the unknown regression coefficients, so that inference is not calibrated under hypothesized values of the effect sizes. We demonstrate the performance of our general and flexible approach through numerical studies, showing more power than existing alternatives. Finally, we apply our method to a genome-wide association study to find locations on the genome that are possibly associated with a continuous phenotype.
Recommendations
Cites work
- scientific article; zbMATH DE number 5957408 (Why is no real title available?)
- scientific article; zbMATH DE number 720689 (Why is no real title available?)
- scientific article; zbMATH DE number 1906319 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- A knockoff filter for high-dimensional selective inference
- A significance test for the lasso
- Asymptotics of selective inference
- Can one estimate the conditional distribution of post-model-selection estimators?
- Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
- Confidence intervals for low dimensional parameters in high dimensional linear models
- Controlling Variable Selection by the Addition of Pseudovariables
- Controlling the false discovery rate via knockoffs
- EigenPrism: inference for high dimensional signal-to-noise ratios
- Exact post-selection inference, with application to the Lasso
- False Discovery Rate–Adjusted Multiple Confidence Intervals for Selected Parameters
- False discoveries occur early on the Lasso path
- Familywise error rate control via knockoffs
- Graph estimation with joint additive models
- High-dimensional variable selection
- Inference on treatment effects after selection among high-dimensional controls
- John W. Tukey's contributions to multiple comparisons
- Panning for Gold: ‘Model-X’ Knockoffs for High Dimensional Controlled Variable Selection
- Selective inference with unknown variance via the square-root Lasso
- Sequential selection procedures and false discovery rate control
- Square-root lasso: pivotal recovery of sparse signals via conic programming
- Sure independence screening for ultrahigh dimensional feature space. With discussion and authors' reply
- The sparsity and bias of the LASSO selection in high-dimensional linear regression
- Type S error for classical and Bayesian single and multiple comparison procedures
- Valid post-selection inference
Cited in
(61)- Derandomizing Knockoffs
- Nonparametric augmented probability weighting with sparsity
- False Discovery Rate Control via Data Splitting
- Nonparametric false discovery rate control for identifying simultaneous signals
- A power analysis for Model-X knockoffs with \(\ell_p\)-regularized statistics
- The revisited knockoffs method for variable selection in L1-penalized regressions
- Revisiting feature selection for linear models with FDR and power guarantees
- Sufficient variable screening with high-dimensional controls
- Large-Scale Two-Sample Comparison of Support Sets
- Sequential knockoffs for continuous and categorical predictors: with application to a large psoriatic arthritis clinical trial pool
- FANOK: knockoffs in linear time
- A generalized knockoff procedure for FDR control in structural change detection
- IPAD: stable interpretable forecasting with knockoffs inference
- Knockoff procedure for false discovery rate control in high-dimensional data streams
- A knockoff filter for high-dimensional selective inference
- Projection-based Inference for High-dimensional Linear Models
- Structure learning of exponential family graphical model with false discovery rate control
- Null-free false discovery rate control using decoy permutations
- FDR control and power analysis for high-dimensional logistic regression via Stabkoff
- Robust inference with knockoffs
- Multilayer knockoff filter: controlled variable selection at multiple resolutions
- scientific article; zbMATH DE number 6453379 (Why is no real title available?)
- Reproducible learning for accelerated failure time models via deep knockoffs
- False discovery rate-controlled multiple testing for union null hypotheses: a knockoff-based approach
- Online rules for control of false discovery rate and false discovery exceedance
- Empirical Bayes cumulative \(\ell\)-value multiple testing procedure for sparse sequences
- Feature screening and FDR control with knockoff features for ultrahigh-dimensional right-censored data
- A Critical Review of LASSO and Its Derivatives for Variable Selection Under Dependence Among Covariates
- Model-Free Conditional Feature Screening with FDR Control
- GGM Knockoff Filter: False Discovery Rate Control for Gaussian Graphical Models
- RANK: Large-Scale Inference With Graphical Nonlinear Knockoffs
- Overview of research advance for knockoff methods
- Differential network knockoff filter with application to brain connectivity analysis
- A powerful procedure that controls the false discovery rate with directional information
- Testing Mediation Effects Using Logic of Boolean Matrices
- Learning sparse conditional distribution: an efficient kernel-based approach
- Multicarving for high-dimensional post-selection inference
- StarTrek: combinatorial variable selection with false discovery rate control
- Determine the number of clusters by data augmentation
- Adaptive procedures for directional false discovery rate control
- Two-directional simultaneous inference for high-dimensional models
- A stable and adaptive polygenic signal detection method based on repeated sample splitting
- Model-Free Feature Screening and FDR Control With Knockoff Features
- A prototype knockoff filter for group selection with FDR control
- Panning for Gold: ‘Model-X’ Knockoffs for High Dimensional Controlled Variable Selection
- Split Knockoffs for Multiple Comparisons: Controlling the Directional False Discovery Rate
- Stab-GKnock: controlled variable selection for partially linear models using generalized knockoffs
- Kernel Knockoffs Selection for Nonparametric Additive Models
- Reproducible feature selection in high-dimensional accelerated failure time models
- Knockoffs with side information
- Gene hunting with hidden Markov model knockoffs
- False Discovery Rate Control Under General Dependence By Symmetrized Data Aggregation
- Reproducible learning in large-scale graphical models
- Compositional knockoff filter for high‐dimensional regression analysis of microbiome data
- Threshold Selection in Feature Screening for Error Rate Control
- Semi-supervised multiple testing
- A robust knockoff filter for sparse regression analysis of microbiome compositional data
- An ensemble learning method for variable selection: application to high-dimensional data and missing values
- Selective inference via marginal screening for high dimensional classification
- CoxKnockoff: controlled feature selection for the Cox model using knockoffs
- Statistical proof? The problem of irreproducibility
This page was built for publication: A knockoff filter for high-dimensional selective inference
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2328050)