Pairwise sequence alignment at arbitrarily large evolutionary distance

From MaRDI portal
Publication:6406035

arXiv2207.12543MaRDI QIDQ6406035FDOQ6406035


Authors: Brandon Legried, Sebastien Roch Edit this on Wikidata


Publication date: 25 July 2022

Abstract: Ancestral sequence reconstruction is a key task in computational biology. It consists in inferring a molecular sequence at an ancestral species of a known phylogeny, given descendant sequences at the tip of the tree. In addition to its many biological applications, it has played a key role in elucidating the statistical performance of phylogeny estimation methods. Here we establish a formal connection to another important bioinformatics problem, multiple sequence alignment, where one attempts to best align a collection of molecular sequences under some mismatch penalty score by inserting gaps. Our result is counter-intuitive: we show that perfect pairwise sequence alignment with high probability is possible in principle at arbitrary large evolutionary distances - provided the phylogeny is known and dense enough. We use techniques from ancestral sequence reconstruction in the taxon-rich setting together with the probabilistic analysis of sequence evolution models involving insertions and deletions.













This page was built for publication: Pairwise sequence alignment at arbitrarily large evolutionary distance

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6406035)