How close is the sample covariance matrix to the actual covariance matrix?
From MaRDI portal
(Redirected from Publication:715740)
Abstract: Given a probability distribution in R^n with general (non-white) covariance, a classical estimator of the covariance matrix is the sample covariance matrix obtained from a sample of N independent points. What is the optimal sample size N = N(n) that guarantees estimation with a fixed accuracy in the operator norm? Suppose the distribution is supported in a centered Euclidean ball of radius sqrt{n}. We conjecture that the optimal sample size is N = O(n) for all distributions with finite fourth moment, and we prove this up to an iterated logarithmic factor. This problem is motivated by the optimal theorem of Rudelson which states that N = O(n log n) for distributions with finite second moment, and a recent result of Adamczak, Litvak, Pajor and Tomczak-Jaegermann which guarantees that N = O(n) for sub-exponential distributions.
Recommendations
- Sample size determination in estimating a covariance matrix
- Sample covariance matrix for random vectors with heavy tails
- Covariance Matrix Estimation From Linearly-Correlated Gaussian Samples
- On the covariance between the sample mean and variance
- Comparison between two types of large sample covariance matrices
- Sample Covariance Matrices of Heavy-Tailed Distributions
- Estimating covariance matrices
- Estimating the covariance of random matrices
Cites work
- scientific article; zbMATH DE number 49190 (Why is no real title available?)
- scientific article; zbMATH DE number 1302647 (Why is no real title available?)
- scientific article; zbMATH DE number 1149836 (Why is no real title available?)
- Approximating the moments of marginals of high-dimensional distributions
- Asymptotic theory of finite dimensional normed spaces. With an appendix by M. Gromov: Isoperimetric inequalities in Riemannian manifolds
- Concentration of mass on convex bodies
- Euclidean structure in finite dimensional normed spaces
- Frame expansions with erasures: an approach through the non-commutative operator theory
- Generalized thresholding of large covariance matrices
- Limit of the smallest eigenvalue of a large dimensional sample covariance matrix
- Non-asymptotic theory of random matrices: extreme singular values
- Optimization of a convex program with a polynomial perturbation
- Partial estimation of covariance matrices
- Quantitative estimates of the convergence of the empirical covariance matrix in log-concave ensembles
- RANDOM POINTS IN ISOTROPIC UNCONDITIONAL CONVEX BODIES
- Random vectors in the isotropic position
- Random walks and anO*(n5) volume algorithm for convex bodies
- Sampling convex bodies: a random matrix approach
- Sharp bounds on the rate of convergence of the empirical covariance matrix
- Some estimates of norms of random matrices
- Spectral norm of products of random and deterministic matrices
- The Expected Norm of Random Matrices
- Weak convergence and empirical processes. With applications to statistics
Cited in
(57)- Covariance estimation for distributions with \({2+\varepsilon}\) moments
- The method of perpendiculars of finding estimates from below for minimal singular eigenvalues of random matrices
- Exploring the toolkit of Jean Bourgain
- Mahalanobis metric based clustering for fixed effects model
- Sub-Gaussian estimators of the mean of a random vector
- What Should Be Done When an Estimated Between-Group Covariance Matrix Is Not Nonnegative Definite?
- A simple tool for bounding the deviation of random matrices on geometric sets
- Affine invariant integrated rank-weighted statistical depth: properties and finite sample analysis
- Multivariate factorizable expectile regression with application to fMRI data
- Marcinkiewicz-type discretization of \(L^p\)-norms under the Nikolskii-type inequality assumption
- The power of adaptivity in source identification with time queries on the path
- Multilevel maximum likelihood estimation with application to covariance matrices
- Robust high-dimensional factor models with applications to statistical machine learning
- On the interval of fluctuation of the singular values of random matrices
- Convergence-enhanced subspace channel estimation for MIMO-OFDM systems with virtual carriers
- Robust long-term aircraft heavy maintenance check scheduling optimization under uncertainty
- Streaming principal component analysis from incomplete data
- Modeling High-Dimensional Time Series: A Factor Model With Dynamically Dependent Factors and Diverging Eigenvalues
- Bayesian beta regression for bounded responses with unknown supports
- UNIFORM-IN-SUBMODEL BOUNDS FOR LINEAR REGRESSION IN A MODEL-FREE FRAMEWORK
- Fast convergence on blind and semi-blind channel estimation for MIMO-OFDM systems
- Linear system identifiability from single-cell data
- On the predictive risk in misspecified quantile regression
- Multiscale geometric methods for data sets. I: Multiscale SVD, noise and curvature.
- Likelihood ratio tests for a large directed acyclic graph
- On generic chaining and the smallest singular value of random matrices with heavy tails
- Distributed estimation in heterogeneous reduced rank regression: with application to order determination in sufficient dimension reduction
- A time-distance trade-off for GDD with preprocessing: instantiating the DLW heuristic
- Generalized canonical correlation analysis for classification
- Convergence and finite sample approximations of entropic regularized Wasserstein distances in Gaussian and RKHS settings
- Portfolio construction by mitigating error amplification: the bounded-noise portfolio
- On the finite-sample analysis of \(\Theta\)-estimators
- Row products of random matrices
- Bernstein-von Mises theorems for functionals of the covariance matrix
- Sampling discretization and related problems
- On the finite-sample analysis of \(\Theta\)-estimators
- Preconditioning filter bank decomposition using structured normalized tight frames
- Exponential-Family Embedding With Application to Cell Developmental Trajectories for Single-Cell RNA-Seq Data
- Bootstrap consistency for quadratic forms of sample averages with increasing dimension
- Factorisable multitask quantile regression
- Quantitative estimates of the convergence of the empirical covariance matrix in log-concave ensembles
- Folded concave penalized sparse linear regression: sparsity, statistical performance, and algorithmic theory for local solutions
- Covariance estimation under one-bit quantization
- Restricted isometry property for random matrices with heavy-tailed columns
- From low- to high-dimensional moments without magic
- Estimation of a multiplicative correlation structure in the large dimensional case
- Optimal variable selection in multi-group sparse discriminant analysis
- Estimating covariance and precision matrices along subspaces
- Identification of alterations in the Jacobian of biochemical reaction networks from steady state covariance data at two conditions
- Partial estimation of covariance matrices
- Principal component analysis of hybrid functional and vector data
- Ridge estimation of covariance matrix from data in two classes.
- Asymptotic geometric analysis: achievements and perspective
- Optimal modeling of nonlinear systems: method of variable injections
- Covariance estimation under missing observations and \(L_4 - L_2\) moment equivalence
- Fast random vector transforms in terms of pseudo-inverse within the Wiener filtering paradigm
- The famous American economist H. Markowitz and mathematical overview of his portfolio selection theory
This page was built for publication: How close is the sample covariance matrix to the actual covariance matrix?
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q715740)