Literature DB >> 18333759

Computation-based discovery of cis-regulatory modules by hidden Markov model.

Jing Wu1, Jun Xie.   

Abstract

A key component in genome sequence analysis is the identification of regions of the genome that contain regulatory information. In higher eukaryotes, this information is organized into modular units called cis-regulatory modules. Each module contains multiple binding sites for a specific combination of several transcription factors. In this article, we propose a hidden Markov model (HMM) to identify transcription factor binding sites (TFBSs) and cis-regulatory modules (CRMs). For a given genomic sequence, we first select potential TFBSs from a large database (e.g., TRANSFAC), then construct an HMM where the TFBSs are only counted when they occur within a specialized CRM state. The novel features of the proposed method include that it does not assume a small set of TFBSs for a given gene; on the other hand, the method utilizes information from a large collection of well-characterized TFBSs and therefore is computationally more efficient and robust than the de novo methods. Our approach is applied to three data sets with experimentally evaluated TFBSs. The method shows better specificity and sensitivity than other similar computational tools in identifying CRMs and TFBSs. The executable codes of our programs and module predictions across the fly Drosophila genome are available at www.stat.purdue.edu/~ jingwu/module/.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18333759     DOI: 10.1089/cmb.2008.0024

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  Genome-wide analysis of binding sites and direct target genes of the orphan nuclear receptor NR2F1/COUP-TFI.

Authors:  Celina Montemayor; Oscar A Montemayor; Alex Ridgeway; Feng Lin; David A Wheeler; Scott D Pletcher; Fred A Pereira
Journal:  PLoS One       Date:  2010-01-27       Impact factor: 3.240

2.  Prediction of clustered RNA-binding protein motif sites in the mammalian genome.

Authors:  Chaolin Zhang; Kuang-Yung Lee; Maurice S Swanson; Robert B Darnell
Journal:  Nucleic Acids Res       Date:  2013-05-18       Impact factor: 16.971

3.  The orientation of transcription factor binding site motifs in gene promoter regions: does it matter?

Authors:  Monika Lis; Dirk Walther
Journal:  BMC Genomics       Date:  2016-03-03       Impact factor: 3.969

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.