Literature DB >> 8831796

Multiple sequence information for threading algorithms.

T R Defay1, F E Cohen.   

Abstract

Threading algorithms attempt to solve the inverse protein folding problem: given a group of structures and a sequence, identify the structure that is most compatible with this sequence. A recent study of this class of algorithms by S. J. Wodak and colleagues suggests that while threading algorithms are capable of recognizing many folding motifs, their performance in truly blind predictions is disappointing, and the underlying alignments upon which the selections are based are frequently errant. To help overcome this problem we have developed a Test of Optimal Mutagenesis algorithm (TOM) that exploits information inherent in the variation between several homologues in a multiple sequence alignment. This information is used to help select the correct structural motif for the sequence from a database of known structures. A total of 305 high-resolution structures were selected to represent the set of known folds; 56 proteins were chosen that had at least one close structural match in this set. To test TOM, we attempted to determine which of the 305 folds was a match to each of the 56 protein sequences. TOM correctly predicts a close structural match for 45% of these proteins. THREADER, an algorithm chosen as a literature standard, correctly matched 20% of the test set. By comparing the performance of TOM, THREADER, and TOM NOVAR (a version of TOM without variability information), we conclude that the tendency of an amino acid to be buried or exposed is the dominant determinant of the success of threading algorithms. In addition, the structural alignments produced by TOM suggest that the exact alignment of just 30 to 50% of the residues in a sequence with the correct fold is necessary to select it as the highest scoring match in a set of folds.

Mesh:

Year:  1996        PMID: 8831796     DOI: 10.1006/jmbi.1996.0515

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  7 in total

1.  Feasibility in the inverse protein folding protocol.

Authors:  M Ota; K Nishikawa
Journal:  Protein Sci       Date:  1999-05       Impact factor: 6.725

2.  The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.

Authors:  Parag Mallick; Robert Weiss; David Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-02       Impact factor: 11.205

3.  Protein fold recognition without Boltzmann statistics or explicit physical basis.

Authors:  T Huber; A E Torda
Journal:  Protein Sci       Date:  1998-01       Impact factor: 6.725

4.  Analysis of the "thermodynamic information content" of a Homo sapiens structural database reveals hierarchical thermodynamic organization.

Authors:  Scott A Larson; Vincent J Hilser
Journal:  Protein Sci       Date:  2004-07       Impact factor: 6.725

5.  JEvTrace: refinement and variations of the evolutionary trace in JAVA.

Authors:  Marcin P Joachimiak; Fred E Cohen
Journal:  Genome Biol       Date:  2002-11-26       Impact factor: 13.583

6.  An approach to large scale identification of non-obvious structural similarities between proteins.

Authors:  Artem Cherkasov; Steven J M Jones
Journal:  BMC Bioinformatics       Date:  2004-05-17       Impact factor: 3.169

7.  Structural characterization of genomes by large scale sequence-structure threading.

Authors:  Artem Cherkasov; Steven J M Jones
Journal:  BMC Bioinformatics       Date:  2004-04-03       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.