Contrastive latent variable modeling with application to case-control sequencing experiments
From MaRDI portal
Publication:2170381
DOI10.1214/21-AOAS1534zbMATH Open1498.62228arXiv2102.06731OpenAlexW3131858063MaRDI QIDQ2170381FDOQ2170381
Publication date: 5 September 2022
Published in: The Annals of Applied Statistics (Search for Journal in Brave)
Abstract: High-throughput RNA-sequencing (RNA-seq) technologies are powerful tools for understanding cellular state. Often it is of interest to quantify and summarize changes in cell state that occur between experimental or biological conditions. Differential expression is typically assessed using univariate tests to measure gene-wise shifts in expression. However, these methods largely ignore changes in transcriptional correlation. Furthermore, there is a need to identify the low-dimensional structure of the gene expression shift to identify collections of genes that change between conditions. Here, we propose contrastive latent variable models designed for count data to create a richer portrait of differential expression in sequencing data. These models disentangle the sources of transcriptional variation in different conditions, in the context of an explicit model of variation at baseline. Moreover, we develop a model-based hypothesis testing framework that can test for global and gene subset-specific changes in expression. We test our model through extensive simulations and analyses with count-based gene expression data from perturbation and observational sequencing experiments. We find that our methods can effectively summarize and quantify complex transcriptional changes in case-control experimental sequencing data.
Full work available at URL: https://arxiv.org/abs/2102.06731
Recommendations
- Detecting differential expression in RNA-sequence data using quasi-likelihood with shrunken dispersion estimates
- A semi-parametric Bayesian approach for differential expression analysis of RNA-seq data
- Large scale maximum average power multiple inference on time-course count data with application to RNA-seq analysis
- A two-stage Poisson model for testing RNA-Seq data
- BNP-seq: Bayesian nonparametric differential expression analysis of sequencing count data
Applications of statistics to biology and medical sciences; meta analysis (62P10) Hypothesis testing in multivariate analysis (62H15)
Cites Work
- A general framework for multiple testing dependence
- Two sample tests for high-dimensional covariance matrices
- Title not available (Why is that?)
- Title not available (Why is that?)
- Bayes Factors
- Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings
- Testing differential networks with applications to the detection of gene-gene interactions
- Multivariate analysis and Jacobi ensembles: largest eigenvalue, Tracy-Widom limits and rates of convergence
- Testing the equality of several covariance matrices with fewer observations than the dimension
- Two-sample tests for high-dimension, strongly spiked eigenvalue models
- Testing high-dimensional covariance matrices, with application to detecting schizophrenia risk genes
- Equality tests of high-dimensional covariance matrices under the strongly spiked eigenvalue model
Uses Software
This page was built for publication: Contrastive latent variable modeling with application to case-control sequencing experiments
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2170381)