High-dimensional variable selection

DOI10.1214/08-AOS646MaRDI QIDQ834336zbMATH OpenOpenAlexWikidataFDO

Publication date 19 August 2009

Published in The Annals of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/0704.1139

lasso simulations sparsity stepwise regression osteoporotic fractures

Point estimation (62F10) Asymptotic properties of parametric estimators (62F12) Linear regression; mixed models (62J05) Applications of statistics to biology and medical sciences; meta analysis (62P10) Estimation in multivariate analysis (62H12) Ridge regression; shrinkage estimators (Lasso) (62J07)

Abstract: This paper explores the following question: what kind of statistical guarantees can be given when doing variable selection in high-dimensional models? In particular, we look at the error rates and power of some multi-stage regression methods. In the first stage we fit a set of candidate models. In the second stage we select one model by cross-validation. In the third stage we use hypothesis testing to eliminate some variables. We refer to the first two stages as "screening" and the last stage as "cleaning." We consider three screening methods: the lasso, marginal regression, and forward stepwise regression. Our method gives consistent variable selection under certain conditions.

Recommendations

Cites work

Cited in

(only showing first 100 items - show all)

Describes a project that uses

Uses Software

TETRAD

This page was built for publication: High-dimensional variable selection

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q834336)