Literature DB >> 11827492

Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.

Golan Yona1, Michael Levitt.   

Abstract

This paper presents a novel approach to profile-profile comparison. The method compares two input profiles (like those that are generated by PSI-BLAST) and assigns a similarity score to assess their statistical similarity. Our profile-profile comparison tool, which allows for gaps, can be used to detect weak similarities between protein families. It has also been optimized to produce alignments that are in very good agreement with structural alignments. Tests show that the profile-profile alignments are indeed highly correlated with similarities between secondary structure elements and tertiary structure. Exhaustive evaluations show that our method is significantly more sensitive in detecting distant homologies than the popular profile-based search programs PSI-BLAST and IMPALA. The relative improvement is the same order of magnitude as the improvement of PSI-BLAST relative to BLAST. Our new tool often detects similarities that fall within the twilight zone of sequence similarity. Copyright 2002 Elsevier Science Ltd.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11827492     DOI: 10.1006/jmbi.2001.5293

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  107 in total

1.  More for less in structural genomics.

Authors:  A Heger; L Holm
Journal:  J Struct Funct Genomics       Date:  2003

2.  The SUPERFAMILY database in 2004: additions and improvements.

Authors:  Martin Madera; Christine Vogel; Sarah K Kummerfeld; Cyrus Chothia; Julian Gough
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Finding weak similarities between proteins by sequence profile comparison.

Authors:  Anna R Panchenko
Journal:  Nucleic Acids Res       Date:  2003-01-15       Impact factor: 16.971

4.  ORFeus: Detection of distant homology using sequence profiles and predicted secondary structure.

Authors:  Krzysztof Ginalski; Jakub Pas; Lucjan S Wyrwicz; Marcin von Grotthuss; Janusz M Bujnicki; Leszek Rychlewski
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  Three monophyletic superfamilies account for the majority of the known glycosyltransferases.

Authors:  Jing Liu; Arcady Mushegian
Journal:  Protein Sci       Date:  2003-07       Impact factor: 6.725

6.  Scoring profile-to-profile sequence alignments.

Authors:  Guoli Wang; Roland L Dunbrack
Journal:  Protein Sci       Date:  2004-06       Impact factor: 6.725

7.  Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function.

Authors:  Nalin C W Goonesekere; Byungkook Lee
Journal:  Nucleic Acids Res       Date:  2004-05-20       Impact factor: 16.971

8.  Detecting distant homology with Meta-BASIC.

Authors:  Krzysztof Ginalski; Marcin von Grotthuss; Nick V Grishin; Leszek Rychlewski
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

9.  Wurst: a protein threading server with a structural scoring function, sequence profiles and optimized substitution matrices.

Authors:  Andrew E Torda; James B Procter; Thomas Huber
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

10.  Alignment of protein sequences by their profiles.

Authors:  Marc A Marti-Renom; M S Madhusudhan; Andrej Sali
Journal:  Protein Sci       Date:  2004-04       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.