Bi-cross-validation for factor analysis

DOI10.48550/ARXIV.1503.03515MaRDI QIDQ104117zbMATH OpenOpenAlexFDO

Authors A. B. Owen, J. Wang, Art B. Owen, Jingshu Wang

Publication date 11 March 2015

Published in Statistical Science (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1503.03515, https://projecteuclid.org/euclid.ss/1455115917

zbMATH Keywords

random matrix theory parallel analysis scree plot unwanted variation

Mathematics Subject Classification ID

Factor analysis and principal components; correspondence analysis (62H25)

Abstract: Factor analysis is over a century old, but it is still problematic to choose the number of factors for a given data set. The scree test is popular but subjective. The best performing objective methods are recommended on the basis of simulations. We introduce a method based on bi-cross-validation, using randomly held-out submatrices of the data to choose the number of factors. We find it performs better than the leading methods of parallel analysis (PA) and Kaiser's rule. Our performance criterion is based on recovery of the underlying factor-loading (signal) matrix rather than identifying the true number of factors. Like previous comparisons, our work is simulation based. Recent advances in random matrix theory provide principled choices for the number of factors when the noise is homoscedastic, but not for the heteroscedastic case. The simulations we choose are designed using guidance from random matrix theory. In particular, we include factors too small to detect, factors large enough to detect but not large enough to improve the estimate, and two classes of factors large enough to be useful. Much of the advantage of bi-cross-validation comes from cases with factors large enough to detect but too small to be well estimated. We also find that a form of early stopping regularization improves the recovery of the signal matrix.

Recommendations

Cites work

Cited in

(21)

This page was built for publication: Bi-cross-validation for factor analysis

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q104117)