Computing the probability of gene trees concordant with the species tree in the multispecies coalescent
From MaRDI portal
(Redirected from Publication:2054865)
Abstract: The multispecies coalescent process models the genealogical relationships of genes sampled from several species, enabling useful predictions about phenomena such as the discordance between the gene tree and the species phylogeny due to incomplete lineage sorting. Conversely, knowledge of large collections of gene trees can inform us about several aspects of the species phylogeny, such as its topology and ancestral population sizes. A fundamental open problem in this context is how to efficiently compute the probability of a gene tree topology, given the species phylogeny. Although a number of algorithms for this task have been proposed, they either produce approximate results, or, when they are exact, they do not scale to large data sets. In this paper, we present some progress towards exact and efficient computation of the probability of a gene tree topology. We provide a new algorithm that, given a species tree and the number of genes sampled for each species, calculates the probability that the gene tree topology will be concordant with the species tree. Moreover, we provide an algorithm that computes the probability of any specific gene tree topology concordant with the species tree. Both algorithms run in polynomial time and have been implemented in Python. Experiments show that they are able to analyse data sets where thousands of genes are sampled, in a matter of minutes to hours.
Recommendations
- Split probabilities and species tree inference under the multispecies coalescent model
- Determining species tree topologies from clade probabilities under the coalescent
- Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent
- The gene evolution model and computing its associated probabilities
- The probability of topological concordance of gene trees and species trees.
Cites work
- scientific article; zbMATH DE number 2126323 (Why is no real title available?)
- scientific article; zbMATH DE number 3323598 (Why is no real title available?)
- A new scaling and squaring algorithm for the matrix exponential
- Enumeration of compact coalescent histories for matching gene trees and species trees
- Introduction to algorithms
- Line-of-descent and genealogical processes, and their applications in population genetics models
- On the number of non-equivalent ancestral configurations for matching gene trees and species trees
- Properties of phylogenetic trees generated by Yule-type speciation models
- Statistical Inference of Phylogenies
- The probability distribution of ranked gene trees on a species tree
- The probability of topological concordance of gene trees and species trees.
Cited in
(11)- On the number of non-equivalent ancestral configurations for matching gene trees and species trees
- Enumeration of coalescent histories for caterpillar species trees and \(p\)-pseudocaterpillar gene trees
- Site pattern probabilities under the multispecies coalescent and a relaxed molecular clock: theory and applications
- Statistically consistent rooting of species trees under the multispecies coalescent model
- Polynomial-Time Statistical Estimation of Species Trees Under Gene Duplication and Loss
- Coalescent histories for discordant gene trees and species trees
- Determining species tree topologies from clade probabilities under the coalescent
- The probability of topological concordance of gene trees and species trees.
- The distributions under two species-tree models of the total number of ancestral configurations for matching gene trees and species trees
- Algorithmic improvements to species delimitation and phylogeny estimation under the multispecies coalescent
- Split probabilities and species tree inference under the multispecies coalescent model
This page was built for publication: Computing the probability of gene trees concordant with the species tree in the multispecies coalescent
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2054865)