Coupling hidden Markov models for the discovery of Cis-regulatory modules in multiple species

From MaRDI portal
Publication:995727

DOI10.1214/07-AOAS103zbMATH Open1129.62111arXiv0708.4318OpenAlexW2021723272MaRDI QIDQ995727FDOQ995727


Authors: Qing Zhou, Wing H. Wong Edit this on Wikidata


Publication date: 10 September 2007

Published in: The Annals of Applied Statistics (Search for Journal in Brave)

Abstract: Cis-regulatory modules (CRMs) composed of multiple transcription factor binding sites (TFBSs) control gene expression in eukaryotic genomes. Comparative genomic studies have shown that these regulatory elements are more conserved across species due to evolutionary constraints. We propose a statistical method to combine module structure and cross-species orthology in de novo motif discovery. We use a hidden Markov model (HMM) to capture the module structure in each species and couple these HMMs through multiple-species alignment. Evolutionary models are incorporated to consider correlated structures among aligned sequence positions across different species. Based on our model, we develop a Markov chain Monte Carlo approach, MultiModule, to discover CRMs and their component motifs simultaneously in groups of orthologous sequences from multiple species. Our method is tested on both simulated and biological data sets in mammals and Drosophila, where significant improvement over other motif and module discovery methods is observed.


Full work available at URL: https://arxiv.org/abs/0708.4318




Recommendations





Cited In (9)





This page was built for publication: Coupling hidden Markov models for the discovery of Cis-regulatory modules in multiple species

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q995727)