Convergence and prediction of principal component scores in high-dimensional settings

From MaRDI portal
Publication:620562

DOI10.1214/10-AOS821zbMATH Open1204.62097arXiv1211.2970WikidataQ42702618 ScholiaQ42702618MaRDI QIDQ620562FDOQ620562

Seunggeun Lee, Fred A. Wright, Fei Zou

Publication date: 19 January 2011

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: A number of settings arise in which it is of interest to predict Principal Component (PC) scores for new observations using data from an initial sample. In this paper, we demonstrate that naive approaches to PC score prediction can be substantially biased toward 0 in the analysis of large matrices. This phenomenon is largely related to known inconsistency results for sample eigenvalues and eigenvectors as both dimensions of the matrix increase. For the spiked eigenvalue model for random matrices, we expand the generality of these results, and propose bias-adjusted PC score prediction. In addition, we compute the asymptotic correlation coefficient between PC scores from sample and population eigenvectors. Simulation and real data examples from the genetics literature show the improved bias and numerical properties of our estimators.


Full work available at URL: https://arxiv.org/abs/1211.2970





Cites Work


Cited In (19)

Uses Software


   Recommendations





This page was built for publication: Convergence and prediction of principal component scores in high-dimensional settings

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q620562)