Literature DB >> 9485502

Fold assignments for amino acid sequences of the CASP2 experiment.

D W Rice1, D Fischer, R Weiss, D Eisenberg.   

Abstract

New and newly extended methods for fold assignment were tested for their abilities to assign folds to amino acid target sequences of unknown three-dimensional structure. These target sequences, released through the CASP2 experiment, are not obviously related to any sequence of known three-dimensional (3D) structure. We assigned 3D folds to target sequences and filed these predictions with CASP2 before their 3D structures were released. The methods tested were (1) Environmental 3D profiles of Bowie and colleagues [Bowie, J.U., Luthy, R., Eisenberg, D. Science 253:164-170, 1991]; (2) A variation of this is termed Directional Profiles; (3) The H3P2 five-dimensional sequence-structure substitution matrix of Rice and Eisenberg [Rice, D., Eisenberg, D.J. Mol. Biol. 267:1026-1037, 1997]; and (4) The Sequence Derived Property methods of Fischer and Eisenberg [Fischer, D., Eisenberg, D. Prot. Sci. 5:947-955, 1996]. When the 3D structures of the sequences were released, 17 of our predictions were evaluated. Of these 17, we assigned high probabilities to 11, of which 9 were correct. Five of these correct predictions were of known 3D structures similar to the targets and four of these were of new folds. The evaluation demonstrated that our methods were effective in assigning the proper fold to more than half of the CASP2 targets with known folds (5/9) and also were able to detect half of the sequences that corresponded to no known folds (4/8). Even when the correct fold is assigned to a sequence, proper alignment of the sequence to the structure remains a challenge. Our methods were able to produce accurate alignments (< 1.2 mean residue shift error from the structural alignment) for four of the targets, including the particularly difficult alignment (only 7% residue identity in the structurally aligned regions) of the ferrochelatase sequence to the fold of a periplasmic binding protein.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9485502     DOI: 10.1002/(sici)1097-0134(1997)1+<113::aid-prot15>3.3.co;2-3

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  4 in total

1.  Factors limiting the performance of prediction-based fold recognition methods.

Authors:  X de la Cruz; J M Thornton
Journal:  Protein Sci       Date:  1999-04       Impact factor: 6.725

2.  Use of residue pairs in protein sequence-sequence and sequence-structure alignments.

Authors:  J Jung; B Lee
Journal:  Protein Sci       Date:  2000-08       Impact factor: 6.725

Review 3.  Structural genomics: computational methods for structure analysis.

Authors:  Sharon Goldsmith-Fischman; Barry Honig
Journal:  Protein Sci       Date:  2003-09       Impact factor: 6.725

4.  Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium.

Authors:  D Fischer; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1997-10-28       Impact factor: 11.205

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.