Modeling persistent trends in distributions
From MaRDI portal
Publication:4559709
Abstract: We present a nonparametric framework to model a short sequence of probability distributions that vary both due to underlying effects of sequential progression and confounding noise. To distinguish between these two types of variation and estimate the sequential-progression effects, our approach leverages an assumption that these effects follow a persistent trend. This work is motivated by the recent rise of single-cell RNA-sequencing experiments over a brief time course, which aim to identify genes relevant to the progression of a particular biological process across diverse cell populations. While classical statistical tools focus on scalar-response regression or order-agnostic differences between distributions, it is desirable in this setting to consider both the full distributions as well as the structure imposed by their ordering. We introduce a new regression model for ordinal covariates where responses are univariate distributions and the underlying relationship reflects consistent changes in the distributions over increasing levels of the covariate. This concept is formalized as a "trend" in distributions, which we define as an evolution that is linear under the Wasserstein metric. Implemented via a fast alternating projections algorithm, our method exhibits numerous strengths in simulations and analyses of single-cell gene expression data.
Recommendations
- Wasserstein gradients for the temporal evolution of probability distributions
- A compositional model to assess expression changes from single-cell RNA-seq data
- Distributional (Single) Index Models
- Contrastive latent variable modeling with application to case-control sequencing experiments
- Modelling of repeated ordered measurements by isotonic sequential regression
Cites work
- scientific article; zbMATH DE number 5708571 (Why is no real title available?)
- scientific article; zbMATH DE number 3973706 (Why is no real title available?)
- Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes
- Correctness of Kruskal's algorithms for monotone regression with ties
- Estimation of conditional densities and sensitivity measures in nonlinear dynamical systems
- Methods for Estimating a Conditional Distribution Function
- Noncrossing quantile regression curve estimation
- Permutation tests. A practical guide to resampling methods for testing hypotheses
- Quantile Estimation in Dependent Sequences
- Resampling-based multiple testing for microarray data analysis (With comments)
- Small-Sample Quantile Estimators in a Large Nonparametric Model
- The bootstrap: To smooth or not to smooth?
This page was built for publication: Modeling persistent trends in distributions
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4559709)