How close is the sample covariance matrix to the actual covariance matrix?

DOI10.1007/S10959-010-0338-ZzbMATH Open1365.62208arXiv1004.3484OpenAlexW2094644779MaRDI QIDQ715740FDOQ715740

Publication date: 1 November 2012

Published in: Journal of Theoretical Probability (Search for Journal in Brave)

Abstract: Given a probability distribution in R^n with general (non-white) covariance, a classical estimator of the covariance matrix is the sample covariance matrix obtained from a sample of N independent points. What is the optimal sample size N = N(n) that guarantees estimation with a fixed accuracy in the operator norm? Suppose the distribution is supported in a centered Euclidean ball of radius sqrt{n}. We conjecture that the optimal sample size is N = O(n) for all distributions with finite fourth moment, and we prove this up to an iterated logarithmic factor. This problem is motivated by the optimal theorem of Rudelson which states that N = O(n log n) for distributions with finite second moment, and a recent result of Adamczak, Litvak, Pajor and Tomczak-Jaegermann which guarantees that N = O(n) for sub-exponential distributions.

Full work available at URL: https://arxiv.org/abs/1004.3484

Recommendations

zbMATH Keywords

estimation of covariance matrices sample covariance matrices random matrices with independent columns

Mathematics Subject Classification ID

Estimation in multivariate analysis (62H12) Random matrices (probabilistic aspects) (60B20)

Cites Work

Cited In (57)

This page was built for publication: How close is the sample covariance matrix to the actual covariance matrix?

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q715740)