Angle-based joint and individual variation explained
From MaRDI portal
(Redirected from Publication:130628)
Abstract: Integrative analysis of disparate data blocks measured on a common set of experimental subjects is a major challenge in modern data analysis. This data structure naturally motivates the simultaneous exploration of the joint and individual variation within each data block resulting in new insights. For instance, there is a strong desire to integrate the multiple genomic data sets in The Cancer Genome Atlas to characterize the common and also the unique aspects of cancer genetics and cell biology for each source. In this paper we introduce Angle-Based Joint and Individual Variation Explained capturing both joint and individual variation within each data block. This is a major improvement over earlier approaches to this challenge in terms of a new conceptual understanding, much better adaption to data heterogeneity and a fast linear algebra computation. Important mathematical contributions are the use of score subspaces as the principal descriptors of variation structure and the use of perturbation theory as the guide for variation segmentation. This leads to an exploratory data analysis method which is insensitive to the heterogeneity among data blocks and does not require separate normalization. An application to cancer data reveals different behaviors of each type of signal in characterizing tumor subtypes. An application to a mortality data set reveals interesting historical lessons. Software and data are available at GitHub <https://github.com/MeileiJiang/AJIVE_Project>.
Recommendations
- Joint and individual variation explained (JIVE) for integrated analysis of multiple data types
- Covariate‐driven factorization by thresholding for multiblock data
- Incorporating covariates into integrated factor analysis of multi-view data
- Structural learning and integrative decomposition of multi-view data
- Integrative factorization of bidimensionally linked matrices
Cites work
- scientific article; zbMATH DE number 47363 (Why is no real title available?)
- A flag representation for finite collections of subspaces of mixed dimensions
- A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis
- Canonical analysis of several sets of variables
- Canonical ridge and econometrics of joint production
- Joint and individual variation explained (JIVE) for integrated analysis of multiple data types
- Multivariate T-Distributions and Their Applications
- Numerical Methods for Computing Angles Between Linear Subspaces
- On principal angles between subspaces in \(\mathbb{R}^n\)
- Overview of object oriented data analysis
- Perturbation bounds in connection with singular value decomposition
- Quantifying the association between gene expressions and DNA-markers by penalized canonical correlation analysis
- RELATIONS BETWEEN TWO SETS OF VARIATES
- Rate-optimal perturbation bounds for singular subspaces with applications to high-dimensional statistics
- Relations among \(m\) sets of measures
- Sparse canonical correlation analysis with application to genomic data integration
Cited in
(27)- Hierarchical nuclear norm penalization for multi-view data integration
- Comments on ``Data science, big data and statistics
- Jackstraw inference for AJIVE data integration
- Joint and individual variation explained (JIVE) for integrated analysis of multiple data types
- High-Dimensional Factor Regression for Heterogeneous Subpopulations
- Joint and individual analysis of breast cancer histologic images and genomic covariates
- Persistent topology of protein space
- A survey of high dimension low sample size asymptotics
- Functional random effects modeling of brain shape and connectivity
- CDPA: common and distinctive pattern analysis between high-dimensional datasets
- Double-Matched Matrix Decomposition for Multi-View Data
- Simultaneous non-Gaussian component analysis (SING) for data integration in neuroimaging
- Decomposition of Variation of Mixed Variables by a Latent Mixed Gaussian Copula Model
- Multiview cluster aggregation and splitting, with an application to multiomic breast cancer data
- AJIVE
- Joint association and classification analysis of multi‐view data
- D-CCA: A Decomposition-Based Canonical Correlation Analysis for High-Dimensional Datasets
- Group linear non-Gaussian component analysis with applications to neuroimaging
- Covariate‐driven factorization by thresholding for multiblock data
- sJIVE: supervised joint and individual variation explained
- RaJIVE
- scientific article; zbMATH DE number 7370626 (Why is no real title available?)
- Perturbed factor analysis: accounting for group differences in exposure profiles
- Analysis of joint shape variation from multi-object complexes
- scientific article; zbMATH DE number 7626765 (Why is no real title available?)
- Sparse and integrative principal component analysis for multiview data
- Percolate: an exponential family JIVE model to design DNA-based predictors of drug response
This page was built for publication: Angle-based joint and individual variation explained
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q130628)