Valid post-selection inference
From MaRDI portal
Abstract: It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides for tests and confidence intervals when the model has been chosen a priori. We propose to produce valid ``post-selection inference by reducing the problem to one of simultaneous inference and hence suitably widening conventional confidence and retention intervals. Simultaneity is required for all linear functions that arise as coefficient estimates in all submodels. By purchasing ``simultaneity insurance for all possible submodels, the resulting post-selection inference is rendered universally valid under all possible model selection procedures. This inference is therefore generally conservative for particular selection procedures, but it is always less conservative than full Scheffe protection. Importantly it does not depend on the truth of the selected submodel, and hence it produces valid inference even in wrong models. We describe the structure of the simultaneous inference problem and give some asymptotic results.
Recommendations
- Exact post-selection inference, with application to the Lasso
- Valid post-selection inference in model-free linear regression
- MODEL SELECTION AND INFERENCE: FACTS AND FICTION
- Selective inference after likelihood- or test-based model selection in linear models
- Conditional predictive inference post model selection
Cites work
- scientific article; zbMATH DE number 3141625 (Why is no real title available?)
- scientific article; zbMATH DE number 4100415 (Why is no real title available?)
- scientific article; zbMATH DE number 46309 (Why is no real title available?)
- A Note on Quantiles in Large Samples
- Asymptotic properties of maximum likelihood estimators based on conditional specification
- CAN ONE ESTIMATE THE UNCONDITIONAL DISTRIBUTION OF POST-MODEL-SELECTION ESTIMATORS?
- Can one estimate the conditional distribution of post-model-selection estimators?
- Confidence sets based on penalized maximum likelihood estimators in Gaussian regression
- Distributional results for thresholding estimators in high-dimensional Gaussian regression models
- Frequentist Model Average Estimators
- MODEL SELECTION AND INFERENCE: FACTS AND FICTION
- Mostly harmless econometrics. An empiricist's companion.
- Note on a Conditional Property of Student's $t^1$
- On model uncertainty and its statistical implications. Proceedings of a workshop, held in Groningen, Netherlands, September 25-26, 1986
- On preliminary test and shrinkage M-estimation in linear models
- On the Large-Sample Minimal Coverage Probability of Confidence Intervals After Model Selection
- On the distribution of penalized maximum likelihood estimators: the LASSO, SCAD, and thresholding
- On the distribution of the adaptive LASSO estimator
- PERFORMANCE LIMITS FOR ESTIMATORS OF THE RISK OR DISTRIBUTION OF SHRINKAGE-TYPE ESTIMATORS, AND SOME GENERAL LOWER RISK-BOUND RESULTS
- Random Packings and Coverings of the Unit n-Sphere
- Sparse estimators and the oracle property, or the return of Hodges' estimator
- THE FINITE-SAMPLE DISTRIBUTION OF POST-MODEL-SELECTION ESTIMATORS AND UNIFORM VERSUS NONUNIFORM APPROXIMATIONS
- The Conditional Level of Student's $t$ Test
- The Conditional Level of the F-Test
- The Focused Information Criterion
- The distribution of a linear predictor after model selection: unconditional finite-sample distributions and asymptotic approximations
- The distribution of model averaging estimators and an impossibility result regarding its estima\-tion
- Valid post-selection inference
Cited in
(only showing first 100 items - show all)- A bootstrap Lasso+partial ridge method to construct confidence intervals for parameters in high-dimensional sparse linear models
- Kernel Ordinary Differential Equations
- SLOPE-adaptive variable selection via convex optimization
- On various confidence intervals post-model-selection
- High-dimensional inference: confidence intervals, \(p\)-values and R-software \texttt{hdi}
- Heterogeneous heterogeneity by default: Testing categorical moderators in mixed‐effects meta‐analysis
- Inference for low‐ and high‐dimensional inhomogeneous Gibbs point processes
- Confidence intervals for tree-structured varying coefficients
- Proximal MCMC for Bayesian Inference of Constrained and Regularized Estimation
- Inference After Model Selection
- False Discovery Rate Control via Data Splitting
- High dimensional regression with many nuisance parameters: both cases of specified and unspecified parameters of interest
- Selective inference after convex clustering with _1 penalization
- Post-selection inference of generalized linear models based on the lasso and the elastic net
- Parametric programming-based approximate selective inference for adaptive lasso, adaptive elastic net and group lasso
- Forward-selected panel data approach for program evaluation
- Selection of mixed copula for association modeling with tied observations
- Valid Inference After Causal Discovery
- Exploration of the variability of variable selection based on distances between bootstrap sample results
- Confidently Comparing Estimates with the c-value
- Selective inference for latent block models
- On Hodges' superefficiency and merits of oracle property in model selection
- Simultaneous high-probability bounds on the false discovery proportion in structured, regression and online settings
- Distribution-free predictive inference for regression
- Distributionally robust and generalizable inference
- FANOK: knockoffs in linear time
- A conditional Bayesian approach with valid inference for high dimensional logistic regression
- Uniformly valid confidence intervals post-model-selection
- Selective inference with distributed data
- Post-selection inference for the Cox model with interval-censored data
- Markov Neighborhood Regression for High-Dimensional Inference
- The costs and benefits of uniformly valid causal inference with high-dimensional nuisance parameters
- Asymptotics of selective inference
- Post hoc confidence bounds on false positives using reference families
- An evolutionary estimation procedure for generalized semilinear regression trees
- Penalized estimation of a class of single‐index varying‐coefficient models for integrative genomic analysis
- Bootstrapping and sample splitting for high-dimensional, assumption-lean inference
- Penalized likelihood and multiple testing
- Valid post-selection inference in high-dimensional approximately sparse quantile regression models
- A knockoff filter for high-dimensional selective inference
- Projection-based Inference for High-dimensional Linear Models
- scientific article; zbMATH DE number 7750673 (Why is no real title available?)
- Exact post-selection inference for the generalized Lasso path
- Small Tuning Parameter Selection for the Debiased Lasso
- Inferactive data analysis
- Inference for High-Dimensional Censored Quantile Regression
- Exact post-selection inference for adjusted R squared selection
- Confidence Intervals for Parameters of Unobserved Events
- Optimal configurations of lines and a statistical application
- scientific article; zbMATH DE number 7750675 (Why is no real title available?)
- Bootstrap for inference after model selection and model averaging for likelihood models
- On the post selection inference constant under restricted isometry properties
- Optimal model averaging for divergent-dimensional Poisson regressions
- The Perils of Balance Testing in Experimental Design: Messy Analyses of Clean Data
- Selective inference for additive and linear mixed models
- A new approach to the gender pay gap decomposition by economic activity
- Variable selection in modelling clustered data via within-cluster resampling
- Ensuring valid inference for Cox hazard ratios after variable selection
- Sparse estimation in semiparametric finite mixture of varying coefficient regression models
- Logistic regression: from art to science
- Targeted Inference Involving High-Dimensional Data Using Nuisance Penalized Regression
- On the length of post-model-selection confidence intervals conditional on polyhedral constraints
- Inference for possibly misspecified generalized linear models with nonpolynomial-dimensional nuisance parameters
- Exact selective inference with randomization
- Variable Selection for Global Fréchet Regression
- Post-selection inference for \(\ell_1\)-penalized likelihood models
- Focused model selection for linear mixed models with an application to whale ecology
- High-dimensional statistical inference via DATE
- High-dimensional CLT: improvements, non-uniform extensions and large deviations
- Post-selection inference following aggregate level hypothesis testing in large-scale genomic data
- Post-model-selection inference in linear regression models: an integrated review
- Selective inference with unknown variance via the square-root Lasso
- Trade-off between predictive performance and FDR control for high-dimensional Gaussian model selection
- Asymptotic post-selection inference for regularized graphical models
- Estimation and group-feature selection in sparse mixture-of-experts with diverging number of parameters
- On the impact of model selection on predictor identification and parameter inference
- Selective inference after likelihood- or test-based model selection in linear models
- Uniformly valid confidence sets based on the Lasso
- Projection-based techniques for high-dimensional optimal transport problems
- Integrative analysis of `-omics' data using penalty functions
- Locally simultaneous inference
- Improving power by conditioning on less in post-selection inference for changepoints
- On the least-squares model averaging interval estimator
- Post-selection inference in regression models for group testing data
- Two-step estimation in ratio-of-mediator-probability weighted causal mediation analysis
- Integrative Bayesian Models Using Post-Selective Inference: A Case Study in Radiogenomics
- Empirical likelihood based tests for detecting the presence of significant predictors in marginal quantile regression
- Post-selection inference via algorithmic stability
- Cellwise outlier detection with false discovery rate control
- A structured brain‐wide and genome‐wide association study using ADNI PET images
- Generalized fused Lasso for treatment pooling in network meta-analysis
- Statistical learning and selective inference
- Model selection and inference for estimation of causal parameters
- A multi-resolution theory for approximating infinite-\(p\)-zero-\(n\): transitional inference, individualized predictions, and a world without bias-variance tradeoff
- Post-selection estimation and testing following aggregate association tests
- Carving model-free inference
- Approximate Selective Inference via Maximum Likelihood
- Unlucky Number 13? Manipulating Evidence Subject to Snooping
- Markov neighborhood regression for statistical inference of high-dimensional generalized linear models
- Benchmarking sparse variable selection methods for genomic data analyses
This page was built for publication: Valid post-selection inference
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q355109)