Consistency of AIC and BIC in estimating the number of significant components in high-dimensional principal component analysis (Q1650069): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(5 intermediate revisions by 4 users not shown)
Property / author
 
Property / author: Zhi-Dong Bai / rank
Normal rank
 
Property / author
 
Property / author: Zhi-Dong Bai / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4769776 / rank
 
Normal rank
Property / cites work
 
Property / cites work: No eigenvalues outside the support of the limiting spectral distribution of large-dimensional sample covariance matrices / rank
 
Normal rank
Property / cites work
 
Property / cites work: On sample eigenvalues in a generalized spiked population model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Limit of the smallest eigenvalue of a large dimensional sample covariance matrix / rank
 
Normal rank
Property / cites work
 
Property / cites work: Selection of components in principal component analysis: A comparison of methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: High-dimensional consistency of rank estimation criteria in multivariate linear model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Consistency of high-dimensional AIC-type and \(C_p\)-type criteria in multivariate linear regression / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multivariate Statistics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotic distribution of the LR statistic for equality of the smallest eigenvalues in high-dimensional principal component analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: On estimating the dimensionality in canonical correlation analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the distribution of the largest eigenvalue in principal components analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Consistency and Sparsity for Principal Components Analysis in High Dimensions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Principal component analysis. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5405154 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asymptotic properties of criteria for selection of variables in multiple regression / rank
 
Normal rank
Property / cites work
 
Property / cites work: Strong consistency of the information criterion for model selection in multivariate analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5449216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4230824 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A high-dimensional test for the equality of the smallest eigenvalues of a covariance matrix / rank
 
Normal rank
Property / cites work
 
Property / cites work: Estimating the dimension of a model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4344404 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Selection of the order of an autoregressive model by Akaike's information criterion / rank
 
Normal rank
Property / cites work
 
Property / cites work: Strong convergence of the empirical distribution of eigenvalues of large dimensional random matrices / rank
 
Normal rank
Property / cites work
 
Property / cites work: A consistency property of the AIC for multivariate linear models when the dimension and the sample size are large / rank
 
Normal rank
Property / cites work
 
Property / cites work: Can the strengths of AIC and BIC be shared? A conflict between model indentification and regression estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: On detection of the number of signals in presence of white noise / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2799861551 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 11:29, 30 July 2024

scientific article
Language Label Description Also known as
English
Consistency of AIC and BIC in estimating the number of significant components in high-dimensional principal component analysis
scientific article

    Statements

    Consistency of AIC and BIC in estimating the number of significant components in high-dimensional principal component analysis (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    29 June 2018
    0 references
    This paper deals with the problem of estimating the number of significant components in principal component analysis (PCA), which is known as the dimensionality in PCA. Specifically, let \(y_{1}\),\dots,\(y_{n}\) be a random sample of size \(n\) from a \(p\)-dimensional population with mean \(\mu\) and covariance matrix \(\Sigma\). The problem of estimating the dimensionality is considered as a problem of selecting an appropriate model from the set \(\{M_{0}, M_{1},\dots,M_{p-1}\}\), where \[ M_{k}=\lambda_{k}>\lambda_{k+1}=\dots=\lambda_{p}=\lambda, \] with \(\lambda_{1}\geqq\dots\geqq\lambda_{p}\) the population eigenvalues of the covariance matrix \(\Sigma\). In this context, the authors consider two estimation criteria, AIC [\textit{H. Akaike}, in: 2nd International Symposium on Information Theory, Tsahkadsor 1971, 267--281 (1973; Zbl 0283.62006)] and BIC [\textit{G. Schwarz}, Ann. Stat. 6, 461--464 (1978; Zbl 0379.62005)], and their purpose is to examine the consistency of the estimation criteria under a high-dimensional framework where \(p,n\rightarrow \infty\) such that \(p/n\rightarrow c>0\). It is assumed that the number of significant components, say \(k\), is fixed; that the number of candidate models is greater than \(k\) and that the fourth population moment is finite. Both the cases of \(p<n\) (\(0<c<1\)) and \(p>n\) (\(c>1\)) are discussed. In this last case, modified AIC and BIC criteria given on p.~1060 are considered. The main results of the paper are obtained by techniques from random matrix theory and are summarized as follows: {\parindent=6mm \begin{itemize}\item[a)] For \(0<c<1\), if \(\lambda_1\) is bounded then under the so-called gap condition (C3) given on p.~1057 of the paper, AIC is strongly consistent, but BIC is not. Furthermore, if \(\lambda_k\rightarrow \infty\) AIC is always strongly consistent regardless of whether the gap condition holds, while if \(\lambda_k/\log n\rightarrow \infty\) then BIC is strongly consistent. \item[b)] For \(c>1\), if \(\lambda_1\) is bounded then under the so-called modified gap condition (C5) given on p.~1060 of the paper, the modified AIC is strongly consistent, but the modified BIC is not. Furthermore, if \(\lambda_k\rightarrow \infty\) the modified AIC is always strongly consistent regardless of whether the modified gap condition holds, while if \(\lambda_k/\log n\rightarrow \infty\) then the modified BIC is strongly consistent. \end{itemize}} Finally, simulation studies show that the sufficient conditions given are essential.
    0 references
    0 references
    principal component analysis
    0 references
    dimensionality
    0 references
    AIC
    0 references
    BIC
    0 references
    consistency
    0 references
    high-dimensional asymptotic framework
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references