Literature DB >> 8211139

Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment.

C E Lawrence1, S F Altschul, M S Boguski, J S Liu, A F Neuwald, J C Wootton.   

Abstract

A wealth of protein and DNA sequence data is being generated by genome projects and other sequencing efforts. A crucial barrier to deciphering these sequences and understanding the relations among them is the difficulty of detecting subtle local residue patterns common to multiple sequences. Such patterns frequently reflect similar molecular structures and biological properties. A mathematical definition of this "local multiple alignment" problem suitable for full computer automation has been used to develop a new and sensitive algorithm, based on the statistical method of iterative sampling. This algorithm finds an optimized local alignment model for N sequences in N-linear time, requiring only seconds on current workstations, and allows the simultaneous detection and optimization of multiple patterns and pattern repeats. The method is illustrated as applied to helix-turn-helix proteins, lipocalins, and prenyltransferases.

Mesh:

Substances:

Year:  1993        PMID: 8211139     DOI: 10.1126/science.8211139

Source DB:  PubMed          Journal:  Science        ISSN: 0036-8075            Impact factor:   47.728


  425 in total

1.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

2.  Reevaluation of the determinants of tyrosine sulfation.

Authors:  H B Nicholas; S S Chan; G L Rosenquist
Journal:  Endocrine       Date:  1999-12       Impact factor: 3.633

3.  Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis.

Authors:  H J Bussemaker; H Li; E D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

4.  DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches.

Authors:  J D Thompson; F Plewniak; J Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2000-08-01       Impact factor: 16.971

5.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

6.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms.

Authors:  Tianhua Niu; Zhaohui S Qin; Xiping Xu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2001-11-26       Impact factor: 11.025

7.  PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences.

Authors:  Magali Lescot; Patrice Déhais; Gert Thijs; Kathleen Marchal; Yves Moreau; Yves Van de Peer; Pierre Rouzé; Stephane Rombauts
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

8.  The evolution of DNA regulatory regions for proteo-gamma bacteria by interspecies comparisons.

Authors:  Nikolaus Rajewsky; Nicholas D Socci; Martin Zapotocky; Eric D Siggia
Journal:  Genome Res       Date:  2002-02       Impact factor: 9.043

9.  PROSPECT improves cis-acting regulatory element prediction by integrating expression profile data with consensus pattern searches.

Authors:  W Fujibuchi; J S Anderson; D Landsman
Journal:  Nucleic Acids Res       Date:  2001-10-01       Impact factor: 16.971

10.  A computational analysis of sequence features involved in recognition of short introns.

Authors:  L P Lim; C B Burge
Journal:  Proc Natl Acad Sci U S A       Date:  2001-09-25       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.