Bayesian variable selection regression for genome-wide association studies and other large-scale problems
From MaRDI portal
Abstract: We consider applying Bayesian Variable Selection Regression, or BVSR, to genome-wide association studies and similar large-scale regression problems. Currently, typical genome-wide association studies measure hundreds of thousands, or millions, of genetic variants (SNPs), in thousands or tens of thousands of individuals, and attempt to identify regions harboring SNPs that affect some phenotype or outcome of interest. This goal can naturally be cast as a variable selection regression problem, with the SNPs as the covariates in the regression. Characteristic features of genome-wide association studies include the following: (i) a focus primarily on identifying relevant variables, rather than on prediction; and (ii) many relevant covariates may have tiny effects, making it effectively impossible to confidently identify the complete "correct" subset of variables. Taken together, these factors put a premium on having interpretable measures of confidence for individual covariates being included in the model, which we argue is a strength of BVSR compared with alternatives such as penalized regression methods. Here we focus primarily on analysis of quantitative phenotypes, and on appropriate prior specification for BVSR in this setting, emphasizing the idea of considering what the priors imply about the total proportion of variance in outcome explained by relevant covariates. We also emphasize the potential for BVSR to estimate this proportion of variance explained, and hence shed light on the issue of "missing heritability" in genome-wide association studies.
Recommendations
- Scalable variational inference for Bayesian variable selection in regression, and its accuracy in genetic association studies
- Bayesian large-scale multiple regression with summary statistics from genome-wide association studies
- Fast model-fitting of Bayesian variable selection regression using the iterative complex factorization algorithm
- Assessing a spatial boost model for quantitative trait GWAS
- Multiple loci mapping via model-free variable selection
Cites work
- scientific article; zbMATH DE number 4070082 (Why is no real title available?)
- scientific article; zbMATH DE number 1906319 (Why is no real title available?)
- scientific article; zbMATH DE number 845714 (Why is no real title available?)
- A review of Bayesian variable selection methods: what, how and which
- Bayes Model Averaging with Selection of Regressors
- Bayesian Analysis of Binary and Polychotomous Response Data
- Bayesian Model Averaging for Linear Regression Models
- Bayesian Variable Selection in Linear Regression
- Least angle regression. (With discussion)
- Mixtures of g Priors for Bayesian Variable Selection
- Monte Carlo sampling methods using Markov chains and their applications
- Nonparametric regression using Bayesian variable selection
- Optimal predictive model selection.
- Rao-Blackwellisation of sampling schemes
- Regularization and Variable Selection Via the Elastic Net
- Small-world MCMC and convergence to multi-modal distributions: from slow mixing to fast mixing
- Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties
Cited in
(43)- HDBRR: a statistical package for high-dimensional Bayesian ridge regression without MCMC
- On the null distribution of Bayes factors in linear regression
- Bayesian variable selection using Knockoffs with applications to genomics
- Variable selection in Bayesian generalized linear‐mixed models: An illustration using candidate gene case‐control association studies
- Consistent skinny Gibbs in probit regression
- Bayesian large-scale multiple regression with summary statistics from genome-wide association studies
- Approximate large-scale Bayesian spatial modeling with application to quantitative magnetic resonance imaging
- A Bayesian graphical model for genome-wide association studies (GWAS)
- Bayesian model selection in complex linear systems, as illustrated in genetic association studies
- Debiased lasso for generalized linear models with a diverging number of covariates
- Neuronized Priors for Bayesian Sparse Linear Regression
- Heritability estimation in high dimensional sparse linear mixed models
- Bayesian clustering of spatial functional data with application to a human mobility study during COVID-19
- A global-local approach for detecting hotspots in multiple-response regression
- Scalable variational inference for Bayesian variable selection in regression, and its accuracy in genetic association studies
- Bayesian beta regression for bounded responses with unknown supports
- An Integrative Bayesian Modeling Approach to Imaging Genetics
- Correlation between relatives given complete genotypes: from identity by descent to identity by function
- A novel Bayesian approach for variable selection in linear regression models
- pi-MASS
- Bayesian Variable Selection for Gaussian Copula Regression Models
- Filtering the Rejection Set While Preserving False Discovery Rate Control
- Application of whole-genome prediction methods for genome-wide association studies: a Bayesian approach
- Comparison between the SSVS and the LASSO for genome wide association studies
- Detection boundary and higher criticism approach for rare and weak genetic effects
- Skinny Gibbs: a consistent and scalable Gibbs sampler for model selection
- Variable prioritization in nonlinear black box methods: a genetic association case study
- Bayesian Model Averaging: A Systematic Review and Conceptual Classification
- scientific article; zbMATH DE number 7370554 (Why is no real title available?)
- Simultaneous Bayesian analysis of contingency tables in genetic association studies
- Sticky PDMP samplers for sparse and local inference problems
- Assessing a spatial boost model for quantitative trait GWAS
- Tree-based quantitative trait mapping in the presence of external covariates
- Bayesian variable selection for post-analytic interrogation of susceptibility loci
- Fast model-fitting of Bayesian variable selection regression using the iterative complex factorization algorithm
- Accelerating a Gibbs sampler for variable selection on genomics data with summarization and variable pre-selection combining an array DBMS and R
- MMVBVS
- Variable selection for nonparametric Gaussian process priors: Models and computational strategies
- A Bayesian Partially Observable Online Change Detection Approach with Thompson Sampling
- Incorporating biological information into linear models: a Bayesian approach to the selection of pathways and genes
- Improving the efficiency of genomic selection
- A Bayesian method for estimating gene‐level polygenicity under the framework of transcriptome‐wide association study
- A Scalable Empirical Bayes Approach to Variable Selection in Generalized Linear Models
This page was built for publication: Bayesian variable selection regression for genome-wide association studies and other large-scale problems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q141819)