Genome-wide association studies with high-dimensional phenotypes
From MaRDI portal
Publication:2344239
Abstract: High-dimensional phenotypes hold promise for richer findings in association studies, but testing of several phenotype traits aggravates the grand challenge of association studies, that of multiple testing. Several methods have recently been proposed for testing jointly all traits in a high-dimensional vector of phenotypes, with prospect of increased power to detect small effects that would be missed if tested individually. However, the methods have rarely been compared to the extent of enabling assessment of their relative merits and setting up guidelines on which method to use, and how to use it. We compare the methods on simulated data and with a real metabolomics data set comprising 137 highly correlated variables and approximately 550,000 SNPs. Applying the methods to genome-wide data with hundreds of thousands of markers inevitably requires division of the problem into manageable parts facilitating parallel processing, parts corresponding to individual genetic variants, pathways, or genes, for example. Here we utilize a straightforward formulation according to which the genome is divided into blocks of nearby correlated genetic markers, tested jointly for association with the phenotypes. This formulation is computationally feasible, reduces the number of tests, and lets the methods take advantage of combining information over several correlated variables not only on the phenotype side, but also on the genotype side. Our experiments show that canonical correlation analysis has higher power than alternative methods, while remaining computationally tractable for routine use in the GWAS setting, provided the number of samples is sufficient compared to the numbers of phenotype and genotype variables tested. Sparse canonical correlation analysis and regression models with latent confounding factors show promising performance when the number of samples is small.
Recommendations
- Multiple phenotype association tests using summary statistics in genome-wide association studies
- Sparse canonical correlation analysis with application to genomic data integration
- Fast and accurate genome-wide association test of multiple quantitative traits
- Efficient and accurate multiple-phenotypes regression method for high dimensional data considering population structure
- Combining high-dimensional classification and multiple hypotheses testing for the analysis of big data in genetics
Cites work
- scientific article; zbMATH DE number 3673370 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- A sparse PLS for variable selection when integrating omics data
- Bayesian canonical correlation analysis
- Canonical Correlation Analysis: An Overview with Application to Learning Methods
- Extensions of sparse canonical correlation analysis with applications to genomic data
- Quantifying the association between gene expressions and DNA-markers by penalized canonical correlation analysis
- RELATIONS BETWEEN TWO SETS OF VARIATES
- Sparse canonical correlation analysis with application to genomic data integration
- THE STATISTICAL SIGNIFICANCE OF CANONICAL CORRELATIONS
Cited in
(12)- Efficient and accurate multiple-phenotypes regression method for high dimensional data considering population structure
- Complex phylogenetic profiling reveals fundamental genotype-phenotype associations
- Candidate genes associated with susceptibility for SARS-coronavirus
- Phenotyping genetic diseases using an extension of \(\mu\)-scores for multivariate data
- Analysis of phenotype-genotype associations using genomic informational field theory (GIFT)
- Exploratory failure time analysis in large scale genomics
- Combining high-dimensional classification and multiple hypotheses testing for the analysis of big data in genetics
- Genomic control, a new approach to genetic-based association studies.
- Construction and three-way ordination of the Wheat Phenome Atlas
- Analysis of multiple diverse phenotypes via semiparametric canonical correlation analysis
- Fast and accurate genome-wide association test of multiple quantitative traits
- Cross-Trait Prediction Accuracy of Summary Statistics in Genome-Wide Association Studies
This page was built for publication: Genome-wide association studies with high-dimensional phenotypes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2344239)