Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 SnapDRAGON: a method to delineate protein structural domains from sequence data.

Literature DB >> 11866536

SnapDRAGON: a method to delineate protein structural domains from sequence data.

Abstract

We describe a method to identify protein domain boundaries from sequence information alone based on the assumption that hydrophobic residues cluster together in space. SnapDRAGON is a suite of programs developed to predict domain boundaries based on the consistency observed in a set of alternative ab initio three-dimensional (3D) models generated for a given protein multiple sequence alignment. This is achieved by running a distance geometry-based folding technique in conjunction with a 3D-domain assignment algorithm. The overall accuracy of our method in predicting the number of domains for a non-redundant data set of 414 multiple alignments, representing 185 single and 231 multiple-domain proteins, is 72.4 %. Using domain linker regions observed in the tertiary structures associated with each query alignment as the standard of truth, inter-domain boundary positions are delineated with an accuracy of 63.9 % for proteins comprising continuous domains only, and 35.4 % for proteins with discontinuous domains. Overall, domain boundaries are delineated with an accuracy of 51.8 %. The prediction accuracy values are independent of the pair-wise sequence similarities within each of the alignments. These results demonstrate the capability of our method to delineate domains in protein sequences associated with a wide variety of structural domain organisation. Copyright 2002 Elsevier Science Ltd.

Entities: Species

Mesh：

Substances：
Proteins

Year: 2002 PMID： 11866536 DOI： 10.1006/jmbi.2001.5387

Source DB: PubMed Journal: J Mol Biol ISSN： 0022-2836 Impact factor: 5.469

Keyword Cloud
Cited

36 in total

SnapDRAGON: a method to delineate protein structural domains from sequence data.

1. Characteristics and prediction of domain linker sequences in multi-domain proteins.

2. Rapid protein domain assignment from amino acid sequence using predicted secondary structure.

3. Sequence-based prediction of protein domains.

4. Bayesian data mining of protein domains gives an efficient predictive algorithm and new insight.

5. Prediction of protein domain boundaries from sequence alone.

6. A topological algorithm for identification of structural domains of proteins.

7. HangOut: generating clean PSI-BLAST profiles for domains with long insertions.

Review 8. Molecular physiology of SPAK and OSR1: two Ste20-related protein kinases regulating ion transport.

9. A modular kernel approach for integrative analysis of protein domain boundaries.

10. Ab initio and homology based prediction of protein domains by recursive neural networks.