Literature DB >> 10481030

Comparison of five methods for finding conserved sequences in multiple alignments of gene regulatory regions.

N Stojanovic1, L Florea, C Riemer, D Gumucio, J Slightom, M Goodman, W Miller, R Hardison.   

Abstract

Conserved segments in DNA or protein sequences are strong candidates for functional elements and thus appropriate methods for computing them need to be developed and compared. We describe five methods and computer programs for finding highly conserved blocks within previously computed multiple alignments, primarily for DNA sequences. Two of the methods are already in common use; these are based on good column agreement and high information content. Three additional methods find blocks with minimal evolutionary change, blocks that differ in at most k positions per row from a known center sequence and blocks that differ in at most k positions per row from a center sequence that is unknown a priori. The center sequence in the latter two methods is a way to model potential binding sites for known or unknown proteins in DNA sequences. The efficacy of each method was evaluated by analysis of three extensively analyzed regulatory regions in mammalian beta-globin gene clusters and the control region of bacterial arabinose operons. Although all five methods have quite different theoretical underpinnings, they produce rather similar results on these data sets when their parameters are adjusted to best approximate the experimental data. The optimal parameters for the method based on information content varied little for different regulatory regions of the beta-globin gene cluster and hence may be extrapolated to many other regulatory regions. The programs based on maximum allowed mismatches per row have simple parameters whose values can be chosen a priori and thus they may be more useful than the other methods when calibration against known functional sites is not available.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10481030      PMCID: PMC148654          DOI: 10.1093/nar/27.19.3899

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  26 in total

1.  Generation and comparative analysis of approximately 3.3 Mb of mouse genomic sequence orthologous to the region of human chromosome 7q11.23 implicated in Williams syndrome.

Authors:  Udaya DeSilva; Laura Elnitski; Jacquelyn R Idol; Johannah L Doyle; Weiniu Gan; James W Thomas; Scott Schwartz; Nicole L Dietrich; Stephen M Beckstrom-Sternberg; Jennifer C McDowell; Robert W Blakesley; Gerard G Bouffard; Pamela J Thomas; Jeffrey W Touchman; Webb Miller; Eric D Green
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

2.  A predictive model for regulatory sequences directing liver-specific transcription.

Authors:  W Krivan; W W Wasserman
Journal:  Genome Res       Date:  2001-09       Impact factor: 9.043

3.  Web-based visualization tools for bacterial genome alignments.

Authors:  L Florea; C Riemer; S Schwartz; Z Zhang; N Stojanovic; W Miller; M McClelland
Journal:  Nucleic Acids Res       Date:  2000-09-15       Impact factor: 16.971

4.  Identification and characterization of multi-species conserved sequences.

Authors:  Elliott H Margulies; Mathieu Blanchette; David Haussler; Eric D Green
Journal:  Genome Res       Date:  2003-12       Impact factor: 9.043

5.  CONREAL: conserved regulatory elements anchored alignment algorithm for identification of transcription factor binding sites by phylogenetic footprinting.

Authors:  Eugene Berezikov; Victor Guryev; Ronald H A Plasterk; Edwin Cuppen
Journal:  Genome Res       Date:  2003-12-12       Impact factor: 9.043

6.  Analysis of multiple genomic sequence alignments: a web resource, online tools, and lessons learned from analysis of mammalian SCL loci.

Authors:  Michael A Chapman; Ian J Donaldson; James Gilbert; Darren Grafham; Jane Rogers; Anthony R Green; Berthold Göttgens
Journal:  Genome Res       Date:  2004-01-12       Impact factor: 9.043

Review 7.  Cross-species sequence comparisons: a review of methods and available resources.

Authors:  Kelly A Frazer; Laura Elnitski; Deanna M Church; Inna Dubchak; Ross C Hardison
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

8.  Definition of the tempo of sequence diversity across an alignment and automatic identification of sequence motifs: Application to protein homologous families and superfamilies.

Authors:  Alex C W May
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

9.  EnteriX 2003: Visualization tools for genome alignments of Enterobacteriaceae.

Authors:  Liliana Florea; Michael McClelland; Cathy Riemer; Scott Schwartz; Webb Miller
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

10.  Regulatory potential scores from genome-wide three-way alignments of human, mouse, and rat.

Authors:  Diana Kolbe; James Taylor; Laura Elnitski; Pallavi Eswara; Jia Li; Webb Miller; Ross Hardison; Francesca Chiaromonte
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.