SLOPE-adaptive variable selection via convex optimization

From MaRDI portal
Publication:902886

DOI10.1214/15-AOAS842zbMATH Open1454.62212arXiv1407.3824OpenAlexW1916786071WikidataQ40166307 ScholiaQ40166307MaRDI QIDQ902886FDOQ902886

Chiara Sabatti, Ewout van den Berg, Małgorzata Bogdan, Małgorzata Bogdan, Chiara Sabatti, Weijie Su, Emmanuel J. Candès, Weijie Su, Ewout van Den Berg, Emmanuel J. Candès

Publication date: 4 January 2016

Published in: The Annals of Applied Statistics (Search for Journal in Brave)

Abstract: We introduce a new estimator for the vector of coefficients in the linear model , where X has dimensions nimesp with p possibly larger than n. SLOPE, short for Sorted L-One Penalized Estimation, is the solution to [min_{binmathbb{R}^p}frac{1}{2}Vert y-XbVert _{ell_2}^2+lambda_1vert bvert _{(1)}+lambda_2vert bvert_{(2)}+cdots+lambda_pvert bvert_{(p)},] where lambda1gelambda2gecdotsgelambdapge0 and vertbvert(1)gevertbvert(2)gecdotsgevertbvert(p) are the decreasing absolute values of the entries of b. This is a convex program and we demonstrate a solution algorithm whose computational complexity is roughly comparable to that of classical ell1 procedures such as the Lasso. Here, the regularizer is a sorted ell1 norm, which penalizes the regression coefficients according to their rank: the higher the rank - that is, stronger the signal - the larger the penalty. This is similar to the Benjamini and Hochberg [J. Roy. Statist. Soc. Ser. B 57 (1995) 289-300] procedure (BH) which compares more significant p-values with more stringent thresholds. One notable choice of the sequence lambdai is given by the BH critical values lambdamathrmBH(i)=z(1icdotq/2p), where qin(0,1) and z(alpha) is the quantile of a standard normal distribution. SLOPE aims to provide finite sample guarantees on the selected model; of special interest is the false discovery rate (FDR), defined as the expected proportion of irrelevant regressors among all selected predictors. Under orthogonal designs, SLOPE with lambdamathrmBH provably controls FDR at level q. Moreover, it also appears to have appreciable inferential properties under more general designs X while having substantial power, as demonstrated in a series of experiments running on both simulated and real data.


Full work available at URL: https://arxiv.org/abs/1407.3824





Cites Work


Cited In (83)

Uses Software






This page was built for publication: SLOPE-adaptive variable selection via convex optimization

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q902886)