Waiting for regulatory sequences to appear
From MaRDI portal
Publication:2467108
Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Genetics and epigenetics (92D10) Problems related to evolution (92D15) Protein sequences, DNA sequences (92D20) Central limit and other weak theorems (60F05) Computational methods for problems pertaining to biology (92-08)
Abstract: One possible explanation for the substantial organismal differences between humans and chimpanzees is that there have been changes in gene regulation. Given what is known about transcription factor binding sites, this motivates the following probability question: given a 1000 nucleotide region in our genome, how long does it take for a specified six to nine letter word to appear in that region in some individual? Stone and Wray [Mol. Biol. Evol. 18 (2001) 1764--1770] computed 5,950 years as the answer for six letter words. Here, we will show that for words of length 6, the average waiting time is 100,000 years, while for words of length 8, the waiting time has mean 375,000 years when there is a 7 out of 8 letter match in the population consensus sequence (an event of probability roughly 5/16) and has mean 650 million years when there is not. Fortunately, in biological reality, the match to the target word does not have to be perfect for binding to occur. If we model this by saying that a 7 out of 8 letter match is good enough, the mean reduces to about 60,000 years.
Recommendations
- On the waiting time until coordinated mutations get fixed in regulatory sequences
- A stochastic model for the evolution of transcription factor binding site abundance
- Waiting for \(m\) mutations
- Waiting times for clumps of patterns and for structured motifs in random sequences
- On expected waiting time until given words appear in random sequence
Cites work
- scientific article; zbMATH DE number 5819433 (Why is no real title available?)
- scientific article; zbMATH DE number 3540347 (Why is no real title available?)
- scientific article; zbMATH DE number 2070281 (Why is no real title available?)
- scientific article; zbMATH DE number 1761225 (Why is no real title available?)
- Lower bounds for covering times for reversible Markov chains and random walks on graphs
- Mathematical population genetics. I: Theoretical introduction.
- Probability approximations via the Poisson clumping heuristic
- Two moments suffice for Poisson approximations: The Chen-Stein method
Cited in
(5)- Using statistical methods to model the fine-tuning of molecular machines and systems
- On the waiting time until coordinated mutations get fixed in regulatory sequences
- OPTIMAL CONTROL OF GENETIC DIVERSITY IN THE MORAN MODEL WITH POPULATION GROWTH
- A stochastic model for the evolution of transcription factor binding site abundance
- A waiting time problem arising from the study of multi-stage carcinogenesis
This page was built for publication: Waiting for regulatory sequences to appear
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2467108)