The combinatorics of tandem duplication

From MaRDI portal
Publication:494419

DOI10.1016/J.DAM.2015.05.014zbMATH Open1319.05009arXiv1402.0104OpenAlexW1528552012MaRDI QIDQ494419FDOQ494419


Authors: Luca Penso-Dolfin, Taoyang Wu, Chris D. Greenman Edit this on Wikidata


Publication date: 1 September 2015

Published in: Discrete Applied Mathematics (Search for Journal in Brave)

Abstract: Tandem duplication is an evolutionary process whereby a segment of DNA is replicated and proximally inserted. The different configurations that can arise from this process give rise to some interesting combinatorial questions. Firstly, we introduce an algebraic formalism to represent this process as a word producing automaton. The number of words arising from n tandem duplications can then be recursively derived. Secondly, each single word accounts for multiple evolutions. With the aid of a bi-coloured 2d- tree, a Hasse diagram corresponding to a partially ordered set is constructed, from which we can count the number of evolutions corresponding to a given word. Thirdly, we implement some subtree prune and graft operations on this structure to show that the total number of possible evolutions arising from n tandem duplications is prodk=1n(4k(2k+1)). The space of structures arising from tandem duplication thus grows at a super-exponential rate with leading order term mathcalO(4frac12n2).


Full work available at URL: https://arxiv.org/abs/1402.0104




Recommendations




Cites Work


Cited In (9)





This page was built for publication: The combinatorics of tandem duplication

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q494419)