The all-or-nothing phenomenon in sparse linear regression (Q2078961)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	The all-or-nothing phenomenon in sparse linear regression	scientific article

Statements

scholarly article

0 references

The all-or-nothing phenomenon in sparse linear regression (English)

0 references

0 references

0 references

0 references

Mathematical Statistics and Learning

0 references

publication date

4 March 2022

0 references

full work available at URL

https://arxiv.org/abs/1903.05046

0 references

Summary: We study the problem of recovering a hidden binary \(k\)-sparse \(p\)-dimensional vector \(\beta\) from \(n\) noisy linear observations \(Y=X\beta+W\), where \(X_{ij}\) are i.i.d. \( \mathcal{N}(0,1)\) and \(W_i\) are i.i.d. \( \mathcal{N}(0,\sigma^2)\). A closely related hypothesis testing problem is to distinguish the pair \((X,Y)\) generated from this structured model from a corresponding null model where \((X,Y)\) consist of purely independent Gaussian entries. In the low sparsity \(k=o(\sqrt{p})\) and high signal-to-noise ratio \(k/\sigma^2 \to \infty\) regime, we establish an ``all-or-nothing'' information-theoretic phase transition at a critical sample size \(n^{\ast}=2 k\log (p/k) /\log (1+k/\sigma^2)\), resolving a conjecture of Gamarnik and Zadik (2017). Specifically, we show that if \(\liminf_{p\to \infty} n/n^{\ast}>1\), then the maximum likelihood estimator almost perfectly recovers the hidden vector with high probability and moreover the true hypothesis can be detected with a vanishing error probability. Conversely, if \(\limsup_{p\to \infty} n/n^{\ast}<1\), then it becomes information-theoretically impossible even to recover an arbitrarily small but fixed fraction of the hidden vector support, or to test hypotheses strictly better than random guess. Our proof of the impossibility result builds upon two key techniques, which could be of independent interest. First, we use a conditional second moment method to upper bound the Kullback-Leibler (KL) divergence between the structured and the null model. Second, inspired by the celebrated area theorem, we establish a lower bound to the minimum mean squared estimation error of the hidden vector in terms of the KL divergence between the two models.

0 references

zbMATH Keywords

sparse regression

0 references

second moment method

0 references

area theorem

0 references

describes a project that uses

0 references

MaRDI profile type

MaRDI publication profile

0 references

Information Theoretic Bounds for Compressed Sensing

0 references

Shannon-Theoretic Limits on Noisy Compressive Sampling

0 references

The Distribution of a Quadratic Form of Normal Random Variables

0 references

Information-Theoretic Bounds and Phase Transitions in Clustering, Sparse PCA, and Submatrix Localization

0 references

Approximate Message-Passing Decoder and Capacity Achieving Sparse Superposition Codes

0 references

Mutual Information and Optimality of Approximate Message-Passing in Random Linear Estimation

0 references

Variable selection with Hamming loss

0 references

Decoding by Linear Programming

0 references

Optimal sparsity testing in linear regression model

0 references

Atomic Decomposition by Basis Pursuit

0 references

Compressed sensing

0 references

Necessary and Sufficient Conditions for Sparsity Pattern Recovery

0 references

Randomly Spread CDMA: Asymptotics Via Statistical Physics

0 references

Detection boundary in sparse regression

0 references

Limits on Support Recovery of Sparse Signals via Multiple-Access Communication Techniques

0 references

Least Squares Superposition Codes of Moderate Dictionary Size Are Reliable at Rates up to Capacity

0 references

Fast Sparse Superposition Codes Have Near Exponential Error Probability for <formula formulatype="inline"><tex Notation="TeX">$R&lt;{\cal C}$</tex></formula>

0 references

Reed–Muller Codes Achieve Capacity on Erasure Channels

0 references

Adaptive estimation of a quadratic functional by model selection.

0 references

Maxwell Construction: The Hidden Bridge Between Iterative and Maximum<i>a Posteriori</i>Decoding

0 references

0 references

Reconstruction and estimation in the planted partition model

0 references

Optimal Variable Selection and Adaptive Noisy Compressed Sensing

0 references

Statistical limits of spiked tensor models

0 references

Nearly Sharp Sufficient Conditions on Exact Sparsity Pattern Recovery

0 references

The Sampling Rate-Distortion Tradeoff for Sparsity Pattern Recovery in Compressed Sensing

0 references

Approximate Sparsity Pattern Recovery: Information-Theoretic Lower Bounds

0 references

The all-or-nothing phenomenon in sparse linear regression

0 references

Capacity-Achieving Sparse Superposition Codes via Approximate Message Passing Decoding

0 references

Limits on Support Recovery With Probabilistic Models: An Information-Theoretic Framework

0 references

A statistical-mechanics approach to large-system analysis of CDMA multiuser detectors

0 references

Minimax risks for sparse regressions: ultra-high dimensional phenomenons

0 references

Information-Theoretic Limits on Sparsity Recovery in the High-Dimensional and Noisy Setting

0 references

Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using $\ell _{1}$-Constrained Quadratic Programming (Lasso)

0 references

Information-Theoretic Limits on Sparse Signal Recovery: Dense versus Sparse Measurement Matrices

0 references

Identifiers

zbMATH Open document ID

0 references

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2078961

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q2078961&oldid=37130939"