| Literature DB >> 18267306 |
S Muthukrishnan1, Aarti Garg, G P S Raghava.
Abstract
This study describes a method for predicting and classifying oxygen-binding proteins. Firstly, support vector machine (SVM) modules were developed using amino acid composition and dipeptide composition for predicting oxygen-binding proteins, and achieved maximum accuracy of 85.5% and 87.8%, respectively. Secondly, an SVM module was developed based on amino acid composition, classifying the predicted oxygen-binding proteins into six classes with accuracy of 95.8%, 97.5%, 97.5%, 96.9%, 99.4%, and 96.0% for erythrocruorin, hemerythrin, hemocyanin, hemoglobin, leghemoglobin, and myoglobin proteins, respectively. Finally, an SVM module was developed using dipeptide composition for classifying the oxygen-binding proteins, and achieved maximum accuracy of 96.1%, 98.7%, 98.7%, 85.6%, 99.6%, and 93.3% for the above six classes, respectively. All modules were trained and tested by five-fold cross validation. Based on the above approach, a web server Oxypred was developed for predicting and classifying oxygen-binding proteins (available from http://www.imtech.res.in/raghava/oxypred/).Entities:
Mesh:
Substances:
Year: 2007 PMID: 18267306 PMCID: PMC5054225 DOI: 10.1016/S1672-0229(08)60012-1
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Fig. 1Average (AVG) amino acid composition of six different classes of oxygen-binding proteins. Amino acids are denoted by their single letter codes.
Performance of SVM modules for classifying oxygen-binding proteins
| Protein class | Accuracy (%) | |
|---|---|---|
| Amino acid composition | Dipeptide composition | |
| Erythrocruorin | 95.8 | 96.1 |
| Hemerythrin | 97.5 | 98.7 |
| Hemocyanin | 97.5 | 98.7 |
| Hemoglobin | 96.9 | 85.6 |
| Leghemoglobin | 99.4 | 99.6 |
| Myoglobin | 96.0 | 93.3 |
| Average | 97.2 | 95.3 |