Universality of regularized regression estimators in high dimensions

DOI10.1214/23-AOS2309MaRDI QIDQ6183759zbMATH OpenFDO

Publication date 4 January 2024

Published in The Annals of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2206.07936, https://projecteuclid.org/journals/annals-of-statistics/volume-51/issue-4/Universality-of-regularized-regression-estimators-in-high-dimensions/10.1214/23-AOS2309.full

zbMATH Keywords

Lasso ridge regression random matrix theory robust regression high-dimensional asymptotics universality Gaussian comparison inequalities Lindeberg's principle

Mathematics Subject Classification ID

Random matrices (probabilistic aspects) (60B20) Approximations to statistical distributions (nonasymptotic) (62E17) Functional limit theorems; invariance principles (60F17)

Abstract: The Convex Gaussian Min-Max Theorem (CGMT) has emerged as a prominent theoretical tool for analyzing the precise stochastic behavior of various statistical estimators in the so-called high dimensional proportional regime, where the sample size and the signal dimension are of the same order. However, a well recognized limitation of the existing CGMT machinery rests in its stringent requirement on the exact Gaussianity of the design matrix, therefore rendering the obtained precise high dimensional asymptotics largely a specific Gaussian theory in various important statistical models. This paper provides a structural universality framework for a broad class of regularized regression estimators that is particularly compatible with the CGMT machinery. In particular, we show that with a good enough

e l l_{i} n f t y

bound for the regression estimator

h a t {m u}_{A}

, any `structural property' that can be detected via the CGMT for

h a t {m u}_{G}

(under a standard Gaussian design

G

) also holds for

h a t {m u}_{A}

under a general design

A

with independent entries. As a proof of concept, we demonstrate our new universality framework in three key examples of regularized regression estimators: the Ridge, Lasso and regularized robust regression estimators, where new universality properties of risk asymptotics and/or distributions of regression estimators and other related quantities are proved. As a major statistical implication of the Lasso universality results, we validate inference procedures using the degrees-of-freedom adjusted debiased Lasso under general design and error distributions. We also provide a counterexample, showing that universality properties for regularized regression estimators do not extend to general isotropic designs.

Recommendations

Cites work

Cited in

(4)

This page was built for publication: Universality of regularized regression estimators in high dimensions

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6183759)