A significance test for the lasso (Q2249837): Difference between revisions

A linear regression model is considered, \[ y=X\beta^*+\varepsilon,\quad \varepsilon\sim N(0, \sigma^2I), \] where $y\in \mathbb{R}^n$ is an outcome vector, $X$ is a design matrix, and $\beta^*\in \mathbb{R}^p$ are unknown coefficients to be estimated. The lasso estimator $\hat {\beta} =\hat {\beta} (\lambda)$ minimizes the objective function \[ Q(\beta; \lambda)=\frac{1}{2} \|y-X\beta\|_2^2+\lambda \|\beta\|_1,\quad \beta\in \mathbb{R}^p, \] where $\lambda \geq 0$ is a tuning parameter, controlling the level of sparsity in $\hat {\beta} $. It is assumed that the columns of $X$ are in general position in order to ensure uniqueness of the lasso solution, see [\textit{R. J. Tibshirani}, Electron. J. Stat. 7, 1456--1490 (2013; Zbl 1337.62173)]. The path $\hat {\beta} (\lambda)$ is a piecewise linear function, with knots at values $\lambda_1 \geq \lambda_2 \geq \cdots \geq \lambda_r \geq 0$. At $\lambda=\infty$, the solution $\hat {\beta}(\infty)$ has no active variables, and for decreasing $\lambda$, each knot $\lambda_k$ marks the entry or removal of some variable from the current active set. At any $\lambda \geq 0$, the corresponding active set $A=\operatorname{supp}(\hat {\beta}(\lambda))$ indexes a linearly independent set of predictor variables, that is, $\operatorname{rank}(X_A)=|A|$, where $X_A$ denotes the columns of $X$ in $A$. Let $A$ be the active set just before $\lambda_k$, and suppose that predictor $j$ enters at $\lambda_k$. Denote by $\hat {\beta}(\lambda_{k+1})$ the solution at point $\lambda=\lambda_{k+1}$, using predictors $A$ and $j$. Let $\tilde{\beta}_A (\lambda_{k+1})$ be the lasso solution using only the active predictors $X_A$, at $\lambda=\lambda_{k+1}$. In the paper under review, the \textit{covariance test statistic} is proposed, \[ T_k=\frac{1}{\sigma^2}(y, X\hat {\beta} (\lambda_{k+1})-X_A\tilde{\beta}_A (\lambda_{k+1})). \] The main result given in Theorem 3 states the following: under the null hypothesis that current lasso model contains all truly active variables, $\operatorname{supp}(\beta^*) \subseteq A$, $T_k$ is asymptotically distributed as a standard exponential random variable, given reasonable assumption on $X$ and the magnitudes of the nonzero true coefficients. This statistic can be used to test the significance of an additional variable between two nested models, when this additional variable is not fixed and has been chosen adaptively. In Section 6, this result is modified for the case of unknown $\sigma^2$. Section 8 discusses some extensions to the elastic net, generalized linear models, and the Cox proportional hazards model; the proposals there are supported by simulations, but no theory is offered.

0 references

zbMATH Keywords

lasso

0 references

least angle regression

0 references

$p$-value

0 references

significance test

0 references

reviewed by

Alexander G. Kukush

0 references

describes a project that uses

0 references

0 references

0 references

0 references

0 references

0 references

MaRDI publication profile

0 references

cites work

A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems

0 references

NESTA: A Fast and Accurate First-Order Method for Sparse Recovery

0 references

Templates for convex cone problems with applications to sparse signal recovery

0 references

Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers

0 references

Statistical significance in high-dimensional linear models

0 references

Near-ideal model selection by $\ell _{1}$ minimization

0 references

Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies?

0 references

Atomic Decomposition by Basis Pursuit

0 references

Q5485944

0 references

Compressed sensing

0 references

How Biased is the Apparent Error Rate of a Prediction Rule?

0 references

Least angle regression. (With discussion)

0 references

Pathwise coordinate optimization

0 references

Recovery of Exact Sparse Representations in the Presence of Bounded Noise

0 references

Persistene in high-dimensional linear predictor-selection and the virtue of overparametrization

0 references

Confidence Intervals and Hypothesis Testing for High-Dimensional Regression

0 references

Hypothesis Testing in High-Dimensional Regression Under the Gaussian Random Design Model: Asymptotic Theory

0 references

<i>p</i>-Values for High-Dimensional Regression

0 references

A Perturbation Method for Inference on Regularized Regression Estimates

0 references

A new approach to variable selection in least squares problems

0 references

Scaled sparse linear regression

0 references

Inference in adaptive regression via the Kac-Rice formula

0 references

Validity of the expected Euler characteristic heuristic

0 references

Q4864293

0 references

The Lasso problem and uniqueness

0 references

Degrees of freedom in lasso problems

0 references

Discussion of: ``Grouping strategies and thresholding for high dimension linear models''

0 references

Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using $\ell _{1}$-Constrained Quadratic Programming (Lasso)

0 references

High-dimensional variable selection

0 references

Estimation of Parameters and Larger Quantiles Based on the k Largest Observations

0 references

Q3174050

0 references

Regularization and Variable Selection Via the Elastic Net

0 references

On the ``degrees of freedom'' of the lasso

0 references

Identifiers

zbMATH Open document ID

1305.62254

0 references

DOI

10.1214/13-AOS1175

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2249837

@@ Property / cites work @@
+A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems
+Normal rank
@@ Property / cites work @@
+NESTA: A Fast and Accurate First-Order Method for Sparse Recovery
+Normal rank
@@ Property / cites work @@
+Templates for convex cone problems with applications to sparse signal recovery
+Normal rank
@@ Property / cites work @@
+Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers
+Normal rank
@@ Property / cites work @@
+Statistical significance in high-dimensional linear models
+Normal rank
@@ Property / cites work @@
+Near-ideal model selection by \(\ell _{1}\) minimization
+Normal rank
@@ Property / cites work @@
+Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies?
+Normal rank
@@ Property / cites work @@
+Atomic Decomposition by Basis Pursuit
@@ Property / cites work: Atomic Decomposition by Basis Pursuit / rank @@
+Normal rank
@@ Property / cites work @@
+Q5485944
@@ Property / cites work: Q5485944 / rank @@
+Normal rank
@@ Property / cites work @@
+Compressed sensing
@@ Property / cites work: Compressed sensing / rank @@
+Normal rank
@@ Property / cites work @@
+How Biased is the Apparent Error Rate of a Prediction Rule?
+Normal rank
@@ Property / cites work @@
+Least angle regression. (With discussion)
@@ Property / cites work: Least angle regression. (With discussion) / rank @@
+Normal rank
@@ Property / cites work @@
+Pathwise coordinate optimization
@@ Property / cites work: Pathwise coordinate optimization / rank @@
+Normal rank
@@ Property / cites work @@
+Recovery of Exact Sparse Representations in the Presence of Bounded Noise
+Normal rank
@@ Property / cites work @@
+Persistene in high-dimensional linear predictor-selection and the virtue of overparametrization
+Normal rank
@@ Property / cites work @@
+Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
+Normal rank
@@ Property / cites work @@
+Hypothesis Testing in High-Dimensional Regression Under the Gaussian Random Design Model: Asymptotic Theory
+Normal rank
@@ Property / cites work @@
+<i>p</i>-Values for High-Dimensional Regression
@@ Property / cites work: <i>p</i>-Values for High-Dimensional Regression / rank @@
+Normal rank
@@ Property / cites work @@
+A Perturbation Method for Inference on Regularized Regression Estimates
+Normal rank
@@ Property / cites work @@
+A new approach to variable selection in least squares problems
+Normal rank
@@ Property / cites work @@
+Scaled sparse linear regression
@@ Property / cites work: Scaled sparse linear regression / rank @@
+Normal rank
@@ Property / cites work @@
+Inference in adaptive regression via the Kac-Rice formula
+Normal rank
@@ Property / cites work @@
+Validity of the expected Euler characteristic heuristic
+Normal rank
@@ Property / cites work @@
+Q4864293
@@ Property / cites work: Q4864293 / rank @@
+Normal rank
@@ Property / cites work @@
+The Lasso problem and uniqueness
@@ Property / cites work: The Lasso problem and uniqueness / rank @@
+Normal rank
@@ Property / cites work @@
+Degrees of freedom in lasso problems
@@ Property / cites work: Degrees of freedom in lasso problems / rank @@
+Normal rank
@@ Property / cites work @@
+Discussion of: ``Grouping strategies and thresholding for high dimension linear models''
+Normal rank
@@ Property / cites work @@
+Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using $\ell _{1}$-Constrained Quadratic Programming (Lasso)
+Normal rank
@@ Property / cites work @@
+High-dimensional variable selection
@@ Property / cites work: High-dimensional variable selection / rank @@
+Normal rank
@@ Property / cites work @@
+Estimation of Parameters and Larger Quantiles Based on the k Largest Observations
+Normal rank
@@ Property / cites work @@
+Q3174050
@@ Property / cites work: Q3174050 / rank @@
+Normal rank
@@ Property / cites work @@
+Regularization and Variable Selection Via the Elastic Net
+Normal rank
@@ Property / cites work @@
+On the ``degrees of freedom'' of the lasso
@@ Property / cites work: On the ``degrees of freedom'' of the lasso / rank @@
+Normal rank