Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Cross-validation of protein structural class prediction using statistical clustering and neural networks.

Literature DB >> 8358300

Cross-validation of protein structural class prediction using statistical clustering and neural networks.

B A Metfessel¹, P N Saurugger, D P Connelly, S S Rich.

Abstract

We present an approach to predicting protein structural class that uses amino acid composition and hydrophobic pattern frequency information as input to two types of neural networks: (1) a three-layer back-propagation network and (2) a learning vector quantization network. The results of these methods are compared to those obtained from a modified Euclidean statistical clustering algorithm. The protein sequence data used to drive these algorithms consist of the normalized frequency of up to 20 amino acid types and six hydrophobic amino acid patterns. From these frequency values the structural class predictions for each protein (all-alpha, all-beta, or alpha-beta classes) are derived. Examples consisting of 64 previously classified proteins were randomly divided into multiple training (56 proteins) and test (8 proteins) sets. The best performing algorithm on the test sets was the learning vector quantization network using 17 inputs, obtaining a prediction accuracy of 80.2%. The Matthews correlation coefficients are statistically significant for all algorithms and all structural classes. The differences between algorithms are in general not statistically significant. These results show that information exists in protein primary sequences that is easily obtainable and useful for the prediction of protein structural class by neural networks as well as by standard statistical clustering algorithms.

Mesh：

Substances：
Proteins

Year: 1993 PMID： 8358300 PMCID： PMC2142422 DOI： 10.1002/pro.5560020712

Source DB: PubMed Journal: Protein Sci ISSN： 0961-8368 Impact factor: 6.725

18 in total

Cross-validation of protein structural class prediction using statistical clustering and neural networks.

1. Comparison of the predicted and observed secondary structure of T4 phage lysozyme.

2. Selection of representative protein data sets.

3. Use of helical wheels to represent the structures of proteins and to identify segments with helical potential.

4. The Protein Data Bank: a computer-based archival file for macromolecular structures.

5. Structural patterns in globular proteins.

6. Predicting the secondary structure of globular proteins using neural network models.

7. Prediction of protein structural class by discriminant analysis.

8. Prediction of protein structural class from the amino acid sequence.

9. Hydrophobicity of amino acid residues in globular proteins.

10. Amino acid composition and hydrophobicity patterns of protein domains correlate with their structures.

1. A time-series-based feature extraction approach for prediction of protein structural class.

2. Prediction of protein folding class using global description of amino acid sequence.

3. An analysis of protein folding type prediction by seed-propagated sampling and jackknife test.

4. An eigenvalue-eigenvector approach to predicting protein folding types.

5. Characterization of protein secondary structure from NMR chemical shifts.

6. Prediction of protein structural class with Rough Sets.

7. Support vector machines for predicting protein structural class.

8. Some remarks on protein attribute prediction and pseudo amino acid composition.

9. Identification of Cancerlectins Using Support Vector Machines With Fusion of G-Gap Dipeptide.