Asymptotic properties of the first principal component and equality tests of covariance matrices in high-dimension, low-sample-size context
From MaRDI portal
Publication:899373
DOI10.1016/J.JSPI.2015.10.007zbMATH Open1381.62146arXiv1503.07302OpenAlexW2963212318MaRDI QIDQ899373FDOQ899373
Authors: Aki Ishii, Kazuyoshi Yata, Makoto Aoshima
Publication date: 28 December 2015
Published in: Journal of Statistical Planning and Inference (Search for Journal in Brave)
Abstract: A common feature of high-dimensional data is that the data dimension is high, however, the sample size is relatively low. We call such data HDLSS data. In this paper, we study asymptotic properties of the first principal component in the HDLSS context and apply them to equality tests of covariance matrices for high dimensional data sets. We consider HDLSS asymptotic theories as the dimension grows for both the cases when the sample size is fixed and the sample size goes to infinity. We introduce an eigenvalue estimator by the noise-reduction methodology and provide asymptotic distributions of the largest eigenvalue in the HDLSS context. We construct a confidence interval of the first contribution ratio. We give asymptotic properties both for the first PC direction and PC score as well. We apply the findings to equality tests of two covariance matrices in the HDLSS context. We provide numerical results and discussions about the performances both on the estimates of the first PC and the equality tests of two covariance matrices.
Full work available at URL: https://arxiv.org/abs/1503.07302
Recommendations
- PCA and eigen-inference for a spiked covariance model with largest eigenvalues of same asymptotic order
- PCA consistency in high dimension, low sample size context
- Statistical inference for high-dimension, low-sample-size data
- Boundary behavior in high dimension, low sample size asymptotics of PCA
- The statistics and mathematics of high dimension low sample size asymptotics
Cites Work
- PCA consistency in high dimension, low sample size context
- A two-sample test for high-dimensional data with applications to gene-set testing
- Title not available (Why is that?)
- Geometric Representation of High Dimension, Low Sample Size Data
- PCA consistency for the power spiked model in high-dimensional settings
- The high-dimension, low-sample-size geometric representation holds under mild conditions
- PCA Consistency for Non-Gaussian Data in High Dimension, Low Sample Size Context
- Effective PCA for high-dimension, low-sample-size data with noise reduction via geometric representations
- Boundary behavior in high dimension, low sample size asymptotics of PCA
- Testing the equality of several covariance matrices with fewer observations than the dimension
- Asymptotic normality for inference on multisample, high-dimensional mean vectors under mild conditions
- Two-stage procedures for high-dimensional data
- Effective PCA for high-dimension, low-sample-size data with singular value decomposition of cross data matrix
Cited In (14)
- A test of sphericity for high-dimensional data and its application for detection of divergently spiked noise
- A High-Dimensional Two-Sample Test for Non-Gaussian Data under a Strongly Spiked Eigenvalue Model
- Reconstruction of a high-dimensional low-rank matrix
- A survey of high dimension low sample size asymptotics
- Statistical inference under the strongly spiked eigenvalue model
- Shrinkage priors for single-spiked covariance models
- Comparison of Correction Factors and Sample Size Required to Test the Equality of the Smallest Eigenvalues in Principal Component Analysis
- Projected tests for high-dimensional covariance matrices
- Equality tests of high-dimensional covariance matrices under the strongly spiked eigenvalue model
- Inference on high-dimensional mean vectors under the strongly spiked eigenvalue model
- Hypothesis tests for high-dimensional covariance structures
- Test for high-dimensional outliers with principal component analysis
- A classifier under the strongly spiked eigenvalue model in high-dimension, low-sample-size context
- Title not available (Why is that?)
This page was built for publication: Asymptotic properties of the first principal component and equality tests of covariance matrices in high-dimension, low-sample-size context
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q899373)