Literature DB >> 14668220

Combining phylogenetic data with co-regulated genes to identify regulatory motifs.

Ting Wang1, Gary D Stormo.   

Abstract

MOTIVATION: Discovery of regulatory motifs in unaligned DNA sequences remains a fundamental problem in computational biology. Two categories of algorithms have been developed to identify common motifs from a set of DNA sequences. The first can be called a 'multiple genes, single species' approach. It proposes that a degenerate motif is embedded in some or all of the otherwise unrelated input sequences and tries to describe a consensus motif and identify its occurrences. It is often used for co-regulated genes identified through experimental approaches. The second approach can be called 'single gene, multiple species'. It requires orthologous input sequences and tries to identify unusually well conserved regions by phylogenetic footprinting. Both approaches perform well, but each has some limitations. It is tempting to combine the knowledge of co-regulation among different genes and conservation among orthologous genes to improve our ability to identify motifs.
RESULTS: Based on the Consensus algorithm previously established by our group, we introduce a new algorithm called PhyloCon (Phylogenetic Consensus) that takes into account both conservation among orthologous genes and co-regulation of genes within a species. This algorithm first aligns conserved regions of orthologous sequences into multiple sequence alignments, or profiles, then compares profiles representing non-orthologous sequences. Motifs emerge as common regions in these profiles. Here we present a novel statistic to compare profiles of DNA sequences and a greedy approach to search for common subprofiles. We demonstrate that PhyloCon performs well on both synthetic and biological data. AVAILABILITY: Software available upon request from the authors. http://ural.wustl.edu/softwares.html

Mesh:

Year:  2003        PMID: 14668220     DOI: 10.1093/bioinformatics/btg329

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  128 in total

1.  CisModule: de novo discovery of cis-regulatory modules by hierarchical mixture modeling.

Authors:  Qing Zhou; Wing H Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2004-08-05       Impact factor: 11.205

2.  Known and novel post-transcriptional regulatory sequences are conserved across plant families.

Authors:  Justin N Vaughn; Sally R Ellingson; Flavio Mignone; Albrecht von Arnim
Journal:  RNA       Date:  2012-01-11       Impact factor: 4.942

3.  Fast, sensitive discovery of conserved genome-wide motifs.

Authors:  Nnamdi E Ihuegbu; Gary D Stormo; Jeremy Buhler
Journal:  J Comput Biol       Date:  2012-02       Impact factor: 1.479

4.  Motif analysis unveils the possible co-regulation of chloroplast genes and nuclear genes encoding chloroplast proteins.

Authors:  Ying Wang; Jun Ding; Henry Daniell; Haiyan Hu; Xiaoman Li
Journal:  Plant Mol Biol       Date:  2012-06-26       Impact factor: 4.076

5.  Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

Authors:  Gurmukh Sahota; Gary D Stormo
Journal:  Bioinformatics       Date:  2010-08-31       Impact factor: 6.937

6.  Genome-wide identification of cis-regulatory motifs and modules underlying gene coregulation using statistics and phylogeny.

Authors:  Hervé Rouault; Khalil Mazouni; Lydie Couturier; Vincent Hakim; François Schweisguth
Journal:  Proc Natl Acad Sci U S A       Date:  2010-07-29       Impact factor: 11.205

7.  An alignment-free method to identify candidate orthologous enhancers in multiple Drosophila genomes.

Authors:  Manonmani Arunachalam; Karthik Jayasurya; Pavel Tomancak; Uwe Ohler
Journal:  Bioinformatics       Date:  2010-07-11       Impact factor: 6.937

8.  Imogene: identification of motifs and cis-regulatory modules underlying gene co-regulation.

Authors:  Hervé Rouault; Marc Santolini; François Schweisguth; Vincent Hakim
Journal:  Nucleic Acids Res       Date:  2014-03-25       Impact factor: 16.971

9.  De novo prediction of DNA-binding specificities for Cys2His2 zinc finger proteins.

Authors:  Anton V Persikov; Mona Singh
Journal:  Nucleic Acids Res       Date:  2013-10-03       Impact factor: 16.971

10.  Discovery, validation, and genetic dissection of transcription factor binding sites by comparative and functional genomics.

Authors:  Jason Gertz; Linda Riles; Peter Turnbaugh; Su-Wen Ho; Barak A Cohen
Journal:  Genome Res       Date:  2005-08       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.