| Literature DB >> 28497044 |
Wang Xianfang1, Wang Junmei1, Wang Xiaolei1, Zhang Yue1.
Abstract
The conotoxin proteins are disulfide-rich small peptides. Predicting the types of ion channel-targeted conotoxins has great value in the treatment of chronic diseases, epilepsy, and cardiovascular diseases. To solve the problem of information redundancy existing when using current methods, a new model is presented to predict the types of ion channel-targeted conotoxins based on AVC (Analysis of Variance and Correlation) and SVM (Support Vector Machine). First, the F value is used to measure the significance level of the feature for the result, and the attribute with smaller F value is filtered by rough selection. Secondly, redundancy degree is calculated by Pearson Correlation Coefficient. And the threshold is set to filter attributes with weak independence to get the result of the refinement. Finally, SVM is used to predict the types of ion channel-targeted conotoxins. The experimental results show the proposed AVC-SVM model reaches an overall accuracy of 91.98%, an average accuracy of 92.17%, and the total number of parameters of 68. The proposed model provides highly useful information for further experimental research. The prediction model will be accessed free of charge at our web server.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28497044 PMCID: PMC5401747 DOI: 10.1155/2017/2929807
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Transferring the raw protein sequence to 400 features.
Figure 2The flow chart for prediction of ion channel types of conotoxins by AVC-SVM model.
Figure 3Scatter plot of F values for all dipeptides before feature selection.
Figure 4Scatter plot of the F value distribution for the portion dipeptides after rough selection.
Figure 5Scatter plot of F values for the portion dipeptides after correlation analysis.
Results of comparison of different feature selection methods.
| Methods | SnK (%) | SnCa (%) | SnNa (%) | AA (%) | OA (%) |
|---|---|---|---|---|---|
| AVC-SVM | 93.14 | 89.21 | 94.17 | 92.17 | 91.98 |
| ANOVA-SVM | 89.28 | 92.54 | 87.79 | 89.87 | 89.25 |
| BiDi-SVM [ | 83.3 | 83.7 | 93.3 | 86.8 | 87.5 |
| ReliefF-SVM | 87.11 | 85.55 | 76.61 | 83.08 | 82.25 |
| ReCorre-SVM | 78.67 | 73.38 | 82.62 | 78.22 | 77.71 |
Results of efficiency comparison using different feature selection methods.
| Methods | Running time (s) | Dimensions |
|---|---|---|
| AVC-SVM | 0.085 | 68 |
| ANOVA-SVM | 9.350 | 163 |
| BiDi-SVM | 11.939 | 167 |
| ReliefF-SVM | 9.478 | 304 |
| ReCorre-SVM | 7.547 | 99 |
Results of comparison using different prediction algorithms.
| Methods | SnK (%) | SnCa (%) | SnNa (%) | AA (%) | OA (%) |
|---|---|---|---|---|---|
| AVC-SVM | 93.14 | 89.21 | 94.17 | 92.17 | 91.98 |
| AVC-Bayes | 66.67 | 88.89 | 81.82 | 79.12 | 82.61 |
| AVC-ELM | 59.05 | 79.00 | 90.22 | 76.09 | 78.70 |
| AVC-RF | 75.95 | 79.27 | 79.33 | 78.19 | 76.80 |
| AVC-RBF | 64.67 | 59.91 | 73.59 | 66.05 | 66.09 |
Results of comparison using different models.
| Methods | SnK (%) | SnCa (%) | SnNa (%) | AA (%) | OA (%) | Dimensions | Running time (s) |
|---|---|---|---|---|---|---|---|
| AVC-SVM | 93.1 | 89.2 | 94.2 | 92.2 | 92.0 | 68 | 0.085 |
| BiDi-RBF [ | 91.7 | 88.4 | 88.9 | 89.7 | 89.3 | 70 | 11.258 |
| iCTX-Type [ | 83.3 | 97.8 | 89.8 | 90.3 | 91.1 | 50 | 8.743 |
|
| 91.7 | 95.3 | 95.6 | 94.2 | 94.6 | 180 | 10.594 |