Literature DB >> 11254392

Pairwise sequence alignment below the twilight zone.

J D Blake1, F E Cohen.   

Abstract

Improved sequence alignment at low pairwise identity is important for identifying potential remote homologues in database searches and for obtaining accurate alignments as a prelude to modeling structures by homology. Our work is motivated by two observations: structural data provide superior training examples for developing techniques to improve the alignment of remote homologues; and general substitution patterns for remote homologues differ from those of closely related proteins. We introduce a new set of amino acid residue interchange matrices built from structural superposition data. These matrices exploit known structural homology as a means of characterizing the effect evolution has on residue-substitution profiles. Given their origin, it is not surprising that the individual residue-residue interchange frequencies are chemically sensible. The structural interchange matrices show a significant increase both in pairwise alignment accuracy and in functional annotation/fold recognition accuracy across distantly related sequences. We demonstrate improved pairwise alignment by using superpositions of homologous domains extracted from a structural database as a gold standard and go on to show an increase in fold recognition accuracy using a database of homologous fold families. This was applied to the unassigned open reading frames from the genome of Helicobacter pylori to identify five matches, two of which are not represented by new annotations in the sequence databases. In addition, we describe a new cyclic permutation strategy to identify distant homologues that experienced gene duplication and subsequent deletions. Using this method, we have identified a potential homologue to one additional previously unassigned open reading frame from the H. pylori genome. Copyright 2001 Academic Press.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11254392     DOI: 10.1006/jmbi.2001.4495

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  41 in total

1.  Persistently conserved positions in structurally similar, sequence dissimilar proteins: roles in preserving protein fold and function.

Authors:  Iddo Friedberg; Hanah Margalit
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

2.  Detection of homologous proteins by an intermediate sequence search.

Authors:  Bino John; Andrej Sali
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

Review 3.  On the evolution of structure in aminoacyl-tRNA synthetases.

Authors:  Patrick O'Donoghue; Zaida Luthey-Schulten
Journal:  Microbiol Mol Biol Rev       Date:  2003-12       Impact factor: 11.056

4.  Sequence conserved for subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

Review 5.  Structural genomics: computational methods for structure analysis.

Authors:  Sharon Goldsmith-Fischman; Barry Honig
Journal:  Protein Sci       Date:  2003-09       Impact factor: 6.725

6.  Alignment of protein sequences by their profiles.

Authors:  Marc A Marti-Renom; M S Madhusudhan; Andrej Sali
Journal:  Protein Sci       Date:  2004-04       Impact factor: 6.725

7.  Automatic generation and evaluation of sparse protein signatures for families of protein structural domains.

Authors:  Matthew J Blades; Jon C Ison; Ranjeeva Ranasinghe; John B C Findlay
Journal:  Protein Sci       Date:  2005-01       Impact factor: 6.725

8.  Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments.

Authors:  Hongyi Zhou; Yaoqi Zhou
Journal:  Proteins       Date:  2005-02-01

9.  An information theoretic approach to macromolecular modeling: I. Sequence alignments.

Authors:  Tiba Aynechi; Irwin D Kuntz
Journal:  Biophys J       Date:  2005-11       Impact factor: 4.033

Review 10.  Advances in homology protein structure modeling.

Authors:  Zhexin Xiang
Journal:  Curr Protein Pept Sci       Date:  2006-06       Impact factor: 3.272

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.