Scaled Torus Principal Component Analysis

From MaRDI portal
Publication:6180732

DOI10.1080/10618600.2022.2119985arXiv2110.04758MaRDI QIDQ6180732FDOQ6180732


Authors: Eduardo García-Portugués, J. S. Marron Edit this on Wikidata


Publication date: 22 January 2024

Published in: Journal of Computational and Graphical Statistics (Search for Journal in Brave)

Abstract: A particularly challenging context for dimensionality reduction is multivariate circular data, i.e., data supported on a torus. Such kind of data appears, e.g., in the analysis of various phenomena in ecology and astronomy, as well as in molecular structures. This paper introduces Scaled Torus Principal Component Analysis (ST-PCA), a novel approach to perform dimensionality reduction with toroidal data. ST-PCA finds a data-driven map from a torus to a sphere of the same dimension and a certain radius. The map is constructed with multidimensional scaling to minimize the discrepancy between pairwise geodesic distances in both spaces. ST-PCA then resorts to principal nested spheres to obtain a nested sequence of subspheres that best fits the data, which can afterwards be inverted back to the torus. Numerical experiments illustrate how ST-PCA can be used to achieve meaningful dimensionality reduction on low-dimensional torii, particularly with the purpose of clusters separation, while two data applications in astronomy (three-dimensional torus) and molecular biology (on a seven-dimensional torus) show that ST-PCA outperforms existing methods for the investigated datasets.


Full work available at URL: https://arxiv.org/abs/2110.04758







Cites Work






This page was built for publication: Scaled Torus Principal Component Analysis

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6180732)