| Literature DB >> 17531801 |
Sudipto Saha1, Jyoti Zack, Balvinder Singh, G P S Raghava.
Abstract
This study describes methods for predicting and classifying voltage-gated ion channels. Firstly, a standard support vector machine (SVM) method was developed for predicting ion channels by using amino acid composition and dipeptide composition, with an accuracy of 82.89% and 85.56%, respectively. The accuracy of this SVM method was improved from 85.56% to 89.11% when combined with PSI-BLAST similarity search. Then we developed an SVM method for classifying ion channels (potassium, sodium, calcium, and chloride) by using dipeptide composition and achieved an overall accuracy of 96.89%. We further achieved a classification accuracy of 97.78% by using a hybrid method that combines dipeptide-based SVM and hidden Markov model methods. A web server VGIchan has been developed for predicting and classifying voltage-gated ion channels using the above approaches.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17531801 PMCID: PMC5054079 DOI: 10.1016/S1672-0229(07)60006-0
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Performance of Various Methods on Prediction of Voltage-Gated Ion Channels
| Method | ACC (%) | MCC | ROC |
|---|---|---|---|
| Amino acid-based SVM (A) | 82.89 | 0.66 | 0.89 |
| Dipeptide-based SVM (B) | 85.56 | 0.71 | 0.93 |
| PSI-BLAST (C) | 84.22 | – | – |
| Hybrid (B+C) | 89.11 | 0.78 | – |
RBF kernel, ᵞ=60; C=100; j=0.1; threshold value=0.3.
RBF kernel, ᵞ=40; C=10; j=1; threshold value=0.4.
E-value=0.01. ACC, Accuracy; MCC, Matthew’s correlation coefficient; ROC, receiver operating characteristic.
Fig. 1The overall performance of the SVM module using amino acid composition and dipeptide composition in predicting voltage-gated ion channels. The ROC plot was obtained between sensitivity (Y-axis) and 1—specificity (X-axis) at different thresholds.
Performance of Various Methods on Classification of Voltage-Gated Ion Channels
| Method | Potassium | Sodium | Calcium | Chloride | Overall ACC (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| ACC (%) | MCC | ACC (%) | MCC | ACC (%) | MCC | ACC (%) | MCC | ||
| Amino acid-based | 100 | 0.86 | 80.00 | 0.88 | 80.00 | 0.86 | 73.33 | 0.84 | 93.78 |
| SVM (A) | |||||||||
| Dipeptide-based | 100 | 0.95 | 88.00 | 0.91 | 92.00 | 0.93 | 86.67 | 0.91 | 96.89 |
| SVM (B) | |||||||||
| PSI-BLAST | 65.62 | – | 92.00 | – | 76.00 | – | 60.00 | – | 69.33 |
| HMM | 98.12 | – | 96.00 | – | 96.00 | – | 86.17 | – | 96.86 |
| SVM (B) + HMM | 99.38 | 0.96 | 96.00 | 0.93 | 96.00 | 0.98 | 86.67 | 0.92 | 97.78 |
Amino acid composition as input vector; RBF kernel, ᵞ=500; C=10; j=0.1.
Dipeptide composition as input vector; RBF kernel, ᵞ=50; C=10; j=1.
E-value=0.01.
E-value=1. ACC, Accuracy; MCC, Matthew’s correlation coefficient.
Fig. 2Snapshot of the input page of VGIchan server.
Fig. 3Snapshot of the results obtained after the analysis of submission.