Gamma-based clustering via ordered means with application to gene-expression analysis
From MaRDI portal
(Redirected from Publication:620546)
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Applications of statistics to biology and medical sciences; meta analysis (62P10) Applications of mathematical programming (90C90) Genetics and epigenetics (92D10) Inequalities; stochastic orderings (60E15) Dynamic programming (90C39)
Abstract: Discrete mixture models provide a well-known basis for effective clustering algorithms, although technical challenges have limited their scope. In the context of gene-expression data analysis, a model is presented that mixes over a finite catalog of structures, each one representing equality and inequality constraints among latent expected values. Computations depend on the probability that independent gamma-distributed variables attain each of their possible orderings. Each ordering event is equivalent to an event in independent negative-binomial random variables, and this finding guides a dynamic-programming calculation. The structuring of mixture-model components according to constraints among latent means leads to strict concavity of the mixture log likelihood. In addition to its beneficial numerical properties, the clustering method shows promising results in an empirical study.
Recommendations
- scientific article; zbMATH DE number 1928681
- An optimal hierarchical clustering algorithm for gene expression data
- scientific article; zbMATH DE number 1945188
- A sequential clustering algorithm with applications to gene expression data
- Some statistical properties of gene expression clustering for array data
- Clustering gene expression data using a posterior split-merge-birth procedure
- Statistical inference for simultaneous clustering of gene expression data
- scientific article; zbMATH DE number 5274665
Cites work
- scientific article; zbMATH DE number 3942813 (Why is no real title available?)
- scientific article; zbMATH DE number 47310 (Why is no real title available?)
- scientific article; zbMATH DE number 194758 (Why is no real title available?)
- scientific article; zbMATH DE number 1911984 (Why is no real title available?)
- scientific article; zbMATH DE number 2202354 (Why is no real title available?)
- A Unified Approach for Simultaneous Gene Clustering and Differential Expression Identification
- Bayesian testing of many hypotheses \(\times \) many genes: a study of sleep apnea
- Compound gamma bivariate distributions
- Dealing With Label Switching in Mixture Models
- Detecting differential gene expression with a semiparametric hierarchical mixture method
- Factor graphs and the sum-product algorithm
- Finite mixture models
- Gaga: a parsimonious and flexible model for differential expression analysis
- Hidden Markov Models for Microarray Time Course Data in Multiple Biological Conditions
- Identifiability of Finite Mixtures of Elliptical Distributions
- Limit theorems for hybridization reactions on oligonucleotide microarrays
- Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
- Mixture Densities, Maximum Likelihood and the EM Algorithm
- Mixture Modeling for Genome‐Wide Localization of Transcription Factors
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- On the Identifiability of Finite Mixtures
- Random-set methods identify distinct aspects of the enrichment signal in gene-set analysis
- Semiparametric models and likelihood -- the power of ranks
- Statistical Methods for Expression Quantitative Trait Loci (eQTL) Mapping
- Statistical analysis of finite mixture distributions
- Statistical significance for genomewide studies
- The 500th Anniversary of the Sharing Problem (The Oldest Problem in the Theory of Probability)
- The analysis of gene expression data. Methods and software
- The elements of statistical learning. Data mining, inference, and prediction
- The gamma distribution and weighted multimodal gamma distributions as models of population abundance
Cited in
(4)- Robust clustering using exponential power mixtures
- Clustering for multivariate continuous and discrete longitudinal data
- High-dimensional count data clustering based on an exponential approximation to the multinomial beta-Liouville distribution
- Explaining mixture models through semantic pattern mining and banded matrix visualization
This page was built for publication: Gamma-based clustering via ordered means with application to gene-expression analysis
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q620546)