Significance testing in non-sparse high-dimensional linear models

DOI10.1214/18-EJS1443MaRDI QIDQ1616315zbMATH OpenOpenAlexWikidataFDO

Authors Yinchu Zhu, Jelena Bradic

Publication date 1 November 2018

Published in Electronic Journal of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1610.02122, https://projecteuclid.org/euclid.ejs/1538791404

zbMATH Keywords

moment conditions CorrT test importance of variables restructured regression

Mathematics Subject Classification ID

Linear regression; mixed models (62J05) Hypothesis testing in multivariate analysis (62H15)

Abstract: In high-dimensional linear models, the sparsity assumption is typically made, stating that most of the parameters are equal to zero. Under the sparsity assumption, estimation and, recently, inference have been well studied. However, in practice, sparsity assumption is not checkable and more importantly is often violated; a large number of covariates might be expected to be associated with the response, indicating that possibly all, rather than just a few, parameters are non-zero. A natural example is a genome-wide gene expression profiling, where all genes are believed to affect a common disease marker. We show that existing inferential methods are sensitive to the sparsity assumption, and may, in turn, result in the severe lack of control of Type-I error. In this article, we propose a new inferential method, named CorrT, which is robust to model misspecification such as heteroscedasticity and lack of sparsity. CorrT is shown to have Type I error approaching the nominal level for extit{any} models and Type II error approaching zero for sparse and many dense models. In fact, CorrT is also shown to be optimal in a variety of frameworks: sparse, non-sparse and hybrid models where sparse and dense signals are mixed. Numerical experiments show a favorable performance of the CorrT test compared to the state-of-the-art methods.

Recommendations

Cites work

Cited in

(24)

Describes a project that uses

Uses Software

This page was built for publication: Significance testing in non-sparse high-dimensional linear models

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1616315)