High-dimensional variable selection
From MaRDI portal
Point estimation (62F10) Asymptotic properties of parametric estimators (62F12) Linear regression; mixed models (62J05) Applications of statistics to biology and medical sciences; meta analysis (62P10) Estimation in multivariate analysis (62H12) Ridge regression; shrinkage estimators (Lasso) (62J07)
Abstract: This paper explores the following question: what kind of statistical guarantees can be given when doing variable selection in high-dimensional models? In particular, we look at the error rates and power of some multi-stage regression methods. In the first stage we fit a set of candidate models. In the second stage we select one model by cross-validation. In the third stage we use hypothesis testing to eliminate some variables. We refer to the first two stages as "screening" and the last stage as "cleaning." We consider three screening methods: the lasso, marginal regression, and forward stepwise regression. Our method gives consistent variable selection under certain conditions.
Recommendations
- Variable selection in high dimensional data analysis with applications
- Variable selection for high dimensional multivariate outcomes
- Variable selection in high-dimensional partially linear models
- A Selective Overview of Variable Selection in High Dimensional Feature Space (Invited Review Article)
- Selection of variables and dimension reduction in high-dimensional non-parametric regression
- Variable selection methods in high-dimensional regression -- a simulation study
- Simultaneous dimension reduction and variable selection in modeling high dimensional data
- Variable selection and estimation in high-dimensional partially linear models
- High Dimensional Variable Selection via Tilting
- Feature selection for high-dimensional data
Cites work
- scientific article; zbMATH DE number 5957408 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- Approximation and learning by greedy algorithms
- Boosting for high-dimensional linear models
- Causation, prediction, and search. With additional material by David Heckerman, Christopher Meek, Gregory F. Cooper and Thomas Richardson.
- For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution
- Greed is Good: Algorithmic Results for Sparse Approximation
- High-dimensional graphs and variable selection with the Lasso
- Just relax: convex programming methods for identifying sparse signals in noise
- Lasso-type recovery of sparse representations for high-dimensional data
- Least angle regression. (With discussion)
- Persistene in high-dimensional linear predictor-selection and the virtue of overparametrization
- Relaxed Lasso
- The Adaptive Lasso and Its Oracle Properties
- The Dantzig selector: statistical estimation when \(p\) is much larger than \(n\). (With discussions and rejoinder).
- The sparsity and bias of the LASSO selection in high-dimensional linear regression
- Uniform consistency in causal inference
Cited in
(only showing first 100 items - show all)- Tests for high-dimensional single-index models
- SLOPE-adaptive variable selection via convex optimization
- Conformal inference for random objects
- Derandomizing Knockoffs
- High-dimensional inference: confidence intervals, \(p\)-values and R-software \texttt{hdi}
- Variable selection for high-dimensional Cox model with error rate control
- A regularization-based adaptive test for high-dimensional GLMs
- Testing the differential network between two gaussian graphical models with false discovery rate control
- Hierarchical inference for genome-wide association studies: a view on methodology with software
- Controlling the false-discovery rate by procedures adapted to the length bias of RNA-seq
- scientific article; zbMATH DE number 7559724 (Why is no real title available?)
- Minimal conditions for consistent variable selection in high dimension
- False Discovery Rate Control via Data Splitting
- UPS delivers optimal phase diagram in high-dimensional variable selection
- Covariate assisted screening and estimation
- Two-sample spatial rank test using projection
- Variable selection for longitudinal data with high-dimensional covariates and dropouts
- Testing covariates in high dimension linear regression with latent factors
- Dimension-agnostic change point detection
- A stepwise regression algorithm for high-dimensional variable selection
- Robust Variable and Interaction Selection for Logistic Regression and General Index Models
- Sure independence screening for ultrahigh dimensional feature space. With discussion and authors' reply
- The revisited knockoffs method for variable selection in L1-penalized regressions
- Discussion: ``A significance test for the lasso
- Discussion: ``A significance test for the lasso
- Discussion: ``A significance test for the lasso
- Discussion: ``A significance test for the lasso
- Classifier variability: accounting for training and testing
- Data-adaptive binary classifiers in high dimensions using random partitioning
- A global homogeneity test for high-dimensional linear regression
- A unified theory of confidence regions and testing for high-dimensional estimating equations
- Variable selection techniques after multiple imputation in high-dimensional data
- Dynamic tilted current correlation for high dimensional variable screening
- A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum
- Two-sample mean vector projection test in high-dimensional data
- Multiple hypothesis testing for variable selection
- A new data adaptive elastic net predictive model using hybridized smoothed covariance estimators with information complexity
- Forward selection and post-selection inference in factorial designs
- Selective inference with a randomized response
- Selective inference with distributed data
- Markov Neighborhood Regression for High-Dimensional Inference
- A significance test for the lasso
- Inference for high‐dimensional linear models with locally stationary error processes
- A generalized knockoff procedure for FDR control in structural change detection
- Bootstrapping and sample splitting for high-dimensional, assumption-lean inference
- Analysis of testing-based forward model selection
- A knockoff filter for high-dimensional selective inference
- Projection-based Inference for High-dimensional Linear Models
- Structure learning of exponential family graphical model with false discovery rate control
- The geometry of least squares in the 21st century
- Convex and non-convex regularization methods for spatial point processes intensity estimation
- Integrative analysis and variable selection with multiple high-dimensional data sets
- Inference under Fine-Gray competing risks model with high-dimensional covariates
- Variable screening in predicting clinical outcome with high-dimensional microarrays
- Sharp support recovery from noisy random measurements by \(\ell_1\)-minimization
- The adaptive and the thresholded Lasso for potentially misspecified models (and a lower bound for the Lasso)
- Variable screening with multiple studies
- Feature screening for network autoregression model
- Simultaneous dimension reduction and variable selection in modeling high dimensional data
- Debiasing the Lasso: optimal sample size for Gaussian designs
- Efficient test-based variable selection for high-dimensional linear models
- Exact tests via multiple data splitting
- Endogeneity in high dimensions
- Detection of gene-gene interactions using multistage sparse and low-rank regression
- Asymptotics for high dimensional regression \(M\)-estimates: fixed design results
- Cross projection test for high-dimension mean vectors
- Self-semi-supervised clustering for large scale data with massive null group
- High-dimensional projection-based ANOVA test
- Statistical Inference for High-Dimensional Models via Recursive Online-Score Estimation
- Discussion: ``A significance test for the lasso
- Optimal screening and discovery of sparse signals with applications to multistage high throughput studies
- LOL selection in high dimension
- An \(L_1\)-regularized logistic model for detecting short-term neuronal interactions
- High-dimensional variable screening and bias in subsequent inference, with an empirical comparison
- Causal learning via manifold regularization
- Inference for sparse linear regression based on the leave-one-covariate-out solution path
- High-dimensional statistical inference via DATE
- Cross-validation with confidence
- Post-model-selection inference in linear regression models: an integrated review
- Support recovery of Gaussian graphical model with false discovery rate control
- A three-stage approach to identify biomarker signatures for cancer genetic data with survival endpoints
- ``Preconditioning for feature selection and regression in high-dimensional problems
- Screening-based Bregman divergence estimation with NP-dimensionality
- Thresholding least-squares inference in high-dimensional regression models
- A metropolized adaptive subspace algorithm for high-dimensional Bayesian variable selection
- High-dimensional variable selection with heterogeneous signals: a precise asymptotic perspective
- Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems
- On the impact of model selection on predictor identification and parameter inference
- Thresholding tests based on affine Lasso to achieve non-asymptotic nominal level and high power under sparse and dense alternatives in high dimension
- Feature screening via false discovery rate control for linear model with multivariate responses
- High-dimensional variable selection and prediction under competing risks with application to SEER-medicare linked data
- A Critical Review of LASSO and Its Derivatives for Variable Selection Under Dependence Among Covariates
- Post-selection inference in regression models for group testing data
- Collaborative targeted learning using regression shrinkage
- Estimation for high-dimensional linear mixed-effects models using \(\ell_1\)-penalization
- Honest variable selection in linear and logistic regression models via \(\ell _{1}\) and \(\ell _{1}+\ell _{2}\) penalization
- A Selective Overview of Variable Selection in High Dimensional Feature Space (Invited Review Article)
- Estimating and testing conditional sums of means in high dimensional multivariate binary data
- Cellwise outlier detection with false discovery rate control
- _1-penalized multinomial regression: estimation, inference, and prediction, with an application to risk factor identification for different dementia subtypes
This page was built for publication: High-dimensional variable selection
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q834336)