Impacts of high dimensionality in finite samples

DOI10.1214/13-AOS1149MaRDI QIDQ385798zbMATH OpenFDO

Publication date 11 December 2013

Published in The Annals of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1311.2742, https://projecteuclid.org/euclid.aos/1382547520

sure independence screening concentration phenomenon geometric representation

Multivariate analysis (62H99) Characterization and structure theory for multivariate probability distributions; copulas (62H05) Geometric probability and stochastic geometry (60D99)

Abstract: High-dimensional data sets are commonly collected in many contemporary applications arising in various fields of scientific research. We present two views of finite samples in high dimensions: a probabilistic one and a nonprobabilistic one. With the probabilistic view, we establish the concentration property and robust spark bound for large random design matrix generated from elliptical distributions, with the former related to the sure screening property and the latter related to sparse model identifiability. An interesting concentration phenomenon in high dimensions is revealed. With the nonprobabilistic view, we derive general bounds on dimensionality with some distance constraint on sparse models. These results provide new insights into the impacts of high dimensionality in finite samples.

Recommendations

Cites work

Cited in

(7)

This page was built for publication: Impacts of high dimensionality in finite samples

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q385798)