Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Multiple sequence information for threading algorithms.

Literature DB >> 8831796

Multiple sequence information for threading algorithms.

Abstract

Threading algorithms attempt to solve the inverse protein folding problem: given a group of structures and a sequence, identify the structure that is most compatible with this sequence. A recent study of this class of algorithms by S. J. Wodak and colleagues suggests that while threading algorithms are capable of recognizing many folding motifs, their performance in truly blind predictions is disappointing, and the underlying alignments upon which the selections are based are frequently errant. To help overcome this problem we have developed a Test of Optimal Mutagenesis algorithm (TOM) that exploits information inherent in the variation between several homologues in a multiple sequence alignment. This information is used to help select the correct structural motif for the sequence from a database of known structures. A total of 305 high-resolution structures were selected to represent the set of known folds; 56 proteins were chosen that had at least one close structural match in this set. To test TOM, we attempted to determine which of the 305 folds was a match to each of the 56 protein sequences. TOM correctly predicts a close structural match for 45% of these proteins. THREADER, an algorithm chosen as a literature standard, correctly matched 20% of the test set. By comparing the performance of TOM, THREADER, and TOM NOVAR (a version of TOM without variability information), we conclude that the tendency of an amino acid to be buried or exposed is the dominant determinant of the success of threading algorithms. In addition, the structural alignments produced by TOM suggest that the exact alignment of just 30 to 50% of the residues in a sequence with the correct fold is necessary to select it as the highest scoring match in a set of folds.

Mesh：

Year: 1996 PMID： 8831796 DOI： 10.1006/jmbi.1996.0515

Source DB: PubMed Journal: J Mol Biol ISSN： 0022-2836 Impact factor: 5.469

Keyword Cloud
Cited

7 in total

1. Feasibility in the inverse protein folding protocol.

Authors: M Ota; K Nishikawa
Journal: Protein Sci Date: 1999-05 Impact factor: 6.725

2. The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.

Authors: Parag Mallick; Robert Weiss; David Eisenberg
Journal: Proc Natl Acad Sci U S A Date: 2002-12-02 Impact factor: 11.205

3. Protein fold recognition without Boltzmann statistics or explicit physical basis.

Authors: T Huber; A E Torda
Journal: Protein Sci Date: 1998-01 Impact factor: 6.725

4. Analysis of the "thermodynamic information content" of a Homo sapiens structural database reveals hierarchical thermodynamic organization.

Authors: Scott A Larson; Vincent J Hilser
Journal: Protein Sci Date: 2004-07 Impact factor: 6.725

Multiple sequence information for threading algorithms.

1. Feasibility in the inverse protein folding protocol.

2. The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.

3. Protein fold recognition without Boltzmann statistics or explicit physical basis.

4. Analysis of the "thermodynamic information content" of a Homo sapiens structural database reveals hierarchical thermodynamic organization.

5. JEvTrace: refinement and variations of the evolutionary trace in JAVA.

6. An approach to large scale identification of non-obvious structural similarities between proteins.

7. Structural characterization of genomes by large scale sequence-structure threading.