Number of relevant directions in Principal Component Analysis and Wishart random matrices
From MaRDI portal
Publication:6229869
arXiv1112.5391MaRDI QIDQ6229869FDOQ6229869
Publication date: 22 December 2011
Abstract: We compute analytically, for large , the probability that a Wishart random matrix has eigenvalues exceeding a threshold , including its large deviation tails. This probability plays a benchmark role when performing the Principal Component Analysis of a large empirical dataset. We find that , where is the Dyson index of the ensemble and is a rate function that we compute explicitly in the full range and for any . The rate function displays a quadratic behavior modulated by a logarithmic singularity close to its minimum . This is shown to be a consequence of a phase transition in an associated Coulomb gas problem. The variance of the number of relevant components is also shown to grow universally (independent of as for large .
This page was built for publication: Number of relevant directions in Principal Component Analysis and Wishart random matrices
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6229869)