Significance testing in non-sparse high-dimensional linear models
From MaRDI portal
Abstract: In high-dimensional linear models, the sparsity assumption is typically made, stating that most of the parameters are equal to zero. Under the sparsity assumption, estimation and, recently, inference have been well studied. However, in practice, sparsity assumption is not checkable and more importantly is often violated; a large number of covariates might be expected to be associated with the response, indicating that possibly all, rather than just a few, parameters are non-zero. A natural example is a genome-wide gene expression profiling, where all genes are believed to affect a common disease marker. We show that existing inferential methods are sensitive to the sparsity assumption, and may, in turn, result in the severe lack of control of Type-I error. In this article, we propose a new inferential method, named CorrT, which is robust to model misspecification such as heteroscedasticity and lack of sparsity. CorrT is shown to have Type I error approaching the nominal level for extit{any} models and Type II error approaching zero for sparse and many dense models. In fact, CorrT is also shown to be optimal in a variety of frameworks: sparse, non-sparse and hybrid models where sparse and dense signals are mixed. Numerical experiments show a favorable performance of the CorrT test compared to the state-of-the-art methods.
Recommendations
- Linear hypothesis testing in dense high-dimensional linear models
- Testability of high-dimensional linear models with nonsparse structures
- Statistical significance in high-dimensional linear models
- Testing regression coefficients in high-dimensional and sparse settings
- Testing a single regression coefficient in high dimensional linear models
Cites work
- scientific article; zbMATH DE number 6378086 (Why is no real title available?)
- scientific article; zbMATH DE number 3169866 (Why is no real title available?)
- scientific article; zbMATH DE number 3723610 (Why is no real title available?)
- scientific article; zbMATH DE number 3249395 (Why is no real title available?)
- A Bernstein type inequality and moderate deviations for weakly dependent sequences
- A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity
- A general theory of hypothesis tests and confidence regions for sparse high dimensional models
- A lava attack on the recovery of sums of dense and sparse signals
- Adapting to Unknown Smoothness via Wavelet Shrinkage
- Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing Data
- Asymptotic Statistics
- Comments on: ``High-dimensional simultaneous inference with the bootstrap
- Concentration inequalities. A nonasymptotic theory of independence
- Confidence Intervals and Hypothesis Testing for High-Dimensional Regression
- Confidence intervals for high-dimensional linear regression: minimax rates and adaptivity
- Confidence intervals for low dimensional parameters in high dimensional linear models
- Debiasing the Lasso: optimal sample size for Gaussian designs
- Detection boundary in sparse regression
- Doubly Robust and Efficient Estimators for Heteroscedastic Partially Linear Single-Index Models Allowing high Dimensional Covariates
- Doubly robust learning for estimating individualized treatment with censored data
- EigenPrism: inference for high dimensional signal-to-noise ratios
- Estimates for the distribution of sums and maxima of sums of random variables without the Cramér condition
- Estimating Regression Models with Multiplicative Heteroscedasticity
- Estimation of Regression Coefficients When Some Regressors Are Not Always Observed
- Hypothesis Testing in High-Dimensional Regression Under the Gaussian Random Design Model: Asymptotic Theory
- Inference on treatment effects after selection among high-dimensional controls
- Linear programming. Foundations and extensions
- Minimax Rates of Estimation for High-Dimensional Linear Regression Over $\ell_q$-Balls
- Minimax estimation of linear and quadratic functionals on sparsity classes
- Minimax risk over \(l_ p\)-balls for \(l_ q\)-error
- On asymptotically optimal confidence regions and tests for high-dimensional models
- Optimal adaptive estimation of linear functionals under sparsity
- Pivotal Estimation in High-Dimensional Regression via Linear Programming
- Reconstruction From Anisotropic Random Measurements
- Ridge regression and asymptotic minimax estimation over spheres of growing dimension
- Scaled sparse linear regression
- Semiparametric Efficiency in Multivariate Regression Models with Missing Data
- Semiparametric Regression for Repeated Outcomes with Nonignorable Nonresponse
- Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness and sensitivity analysis
- Sharp adaptation for inverse problems with random noise
- Simultaneous analysis of Lasso and Dantzig selector
- Square-root lasso: pivotal recovery of sparse signals via conic programming
- Statistics for high-dimensional data. Methods, theory and applications.
- Testing Statistical Hypotheses
- The Asymptotic Variance of Semiparametric Estimators
- Unified methods for censored longitudinal data and causality
Cited in
(24)- Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism
- Permutation testing in high-dimensional linear models: an empirical investigation
- In defense of the indefensible: a very naïve approach to high-dimensional inference
- A significance test for the elastic net and its asymptotic distribution with general predictors
- Comments on: ``High-dimensional simultaneous inference with the bootstrap
- Rejoinder on: ``High-dimensional simultaneous inference with the bootstrap
- Sign-based test for mean vector in high-dimensional and sparse settings
- Double-estimation-friendly inference for high-dimensional misspecified models
- Linear hypothesis testing in dense high-dimensional linear models
- Estimation and Inference for High-Dimensional Generalized Linear Models with Knowledge Transfer
- Optimal sparsity testing in linear regression model
- Statistical significance in high-dimensional linear models
- Testing a single regression coefficient in high dimensional linear models
- A unified theory of confidence regions and testing for high-dimensional estimating equations
- A significance test for the lasso
- CorrT
- scientific article; zbMATH DE number 7370643 (Why is no real title available?)
- Maximization of ESI. Jaynes principle in testing significant inputs of linear model
- Testability of high-dimensional linear models with nonsparse structures
- Relaxing the assumptions of knockoffs by conditioning
- Two-sample testing of high-dimensional linear regression coefficients via complementary sketching
- Distribution and correlation-free two-sample test of high-dimensional means
- Projection-based Inference for High-dimensional Linear Models
- De-biasing the Lasso with degrees-of-freedom adjustment
This page was built for publication: Significance testing in non-sparse high-dimensional linear models
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1616315)