Resistant multiple sparse canonical correlation
From MaRDI portal
Abstract: Canonical Correlation Analysis (CCA) is a multivariate technique that takes two datasets and forms the most highly correlated possible pairs of linear combinations between them. Each subsequent pair of linear combinations is orthogonal to the preceding pair, meaning that new information is gleaned from each pair. By looking at the magnitude of coefficient values, we can find out which variables can be grouped together, thus better understanding multiple interactions that are otherwise difficult to compute or grasp intuitively. CCA appears to have quite powerful applications to high throughput data, as we can use it to discover, for example, relationships between gene expression and gene copy number variation. One of the biggest problems of CCA is that the number of variables (often upwards of 10,000) makes biological interpretation of linear combinations nearly impossible. To limit variable output, we have employed a method known as Sparse Canonical Correlation Analysis (SCCA), while adding estimation which is resistant to extreme observations or other types of deviant data. In this paper, we have demonstrated the success of resistant estimation in variable selection using SCCA. Additionally, we have used SCCA to find multiple canonical pairs for extended knowledge about the datasets at hand. Again, using resistant estimators provided more accurate estimates than standard estimators in the multiple canonical correlation setting. R code is available and documented at https://github.com/hardin47/rmscca.
Recommendations
- Sparse canonical correlation analysis with application to genomic data integration
- Sparse canonical correlation analysis from a predictive point of view
- Extensions of sparse canonical correlation analysis with applications to genomic data
- An iterative penalized least squares approach to sparse canonical correlation analysis
- Sparse canonical covariance analysis for high-throughput data
Cites work
- scientific article; zbMATH DE number 3464727 (Why is no real title available?)
- scientific article; zbMATH DE number 1975294 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- A note on oligonucleotide expression values not being normally distributed
- A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis
- Extensions of sparse canonical correlation analysis with applications to genomic data
- Inferring gene-gene interactions and functional modules using sparse canonical correlation analysis
- Least Median of Squares Regression
- Minimax estimation in sparse canonical correlation analysis
- Projection pursuit
- RELATIONS BETWEEN TWO SETS OF VARIATES
- Robust canonical correlations: a comparative study
- Sparse canonical correlation analysis with application to genomic data integration
Cited in
(4)
This page was built for publication: Resistant multiple sparse canonical correlation
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q306679)