| Literature DB >> 28672838 |
Fu-Ying Dao1, Hui Yang2, Zhen-Dong Su3, Wuritu Yang4,5, Yun Wu6, Ding Hui7, Wei Chen8,9, Hua Tang10, Hao Lin11.
Abstract
Conotoxins are disulfide-rich small peptides, which are invaluable peptides that target ion channel and neuronal receptors. Conotoxins have been demonstrated as potent pharmaceuticals in the treatment of a series of diseases, such as Alzheimer's disease, Parkinson's disease, and epilepsy. In addition, conotoxins are also ideal molecular templates for the development of new drug lead compounds and play important roles in neurobiological research as well. Thus, the accurate identification of conotoxin types will provide key clues for the biological research and clinical medicine. Generally, conotoxin types are confirmed when their sequence, structure, and function are experimentally validated. However, it is time-consuming and costly to acquire the structure and function information by using biochemical experiments. Therefore, it is important to develop computational tools for efficiently and effectively recognizing conotoxin types based on sequence information. In this work, we reviewed the current progress in computational identification of conotoxins in the following aspects: (i) construction of benchmark dataset; (ii) strategies for extracting sequence features; (iii) feature selection techniques; (iv) machine learning methods for classifying conotoxins; (v) the results obtained by these methods and the published tools; and (vi) future perspectives on conotoxin classification. The paper provides the basis for in-depth study of conotoxins and drug therapy research.Entities:
Keywords: conotoxin; ion channel; machine learning method; superfamily
Mesh:
Substances:
Year: 2017 PMID: 28672838 PMCID: PMC6152242 DOI: 10.3390/molecules22071057
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1A structural schematic illustration to show the classification of conotoxins in superfamily and ion channel-target. Sixteen major conotoxin superfamilies are A, D, I1, I2, I3, J, L, M, O1, O2, O3, P, S, T, V, and Y. They are also categorized into calcium channel-targeted, sodium channel-targeted, and potassium channel-targeted conotoxins according to their functions.
Figure 2The process framework of conotoxin classification with machine learning methods.
The benchmark datasets of conotoxin superfamily and ion channel-targeted conotoxin.
| S1 | 25 | 13 | 16 | 17 | 116 | [ |
| S2 | 63 | 48 | 95 | 55 | 216 | [ |
| I1 | 24 | 43 | 45 | 112 | [ | |
| I2 | 26 | 49 | 70 | 145 | [ | |
A list of published results for conotoxin superfamilies and ion channel-targeted conotoxin classifications.
| S1 | Multi-class SVMs | 0.840 | 0.923 | 0.869 | 0.941 | 0.893 | 0.881 | [ |
| IDQD | 0.960 | 0.923 | 0.820 | 0.940 | 0.911 | 0.883 | [ | |
| SVM-Freescore | 0.960 | 0.984 | 0.984 | 1 | 0.982 | 0.974 | [ | |
| Toxin-AAM | 0.957 | 0.966 | 0.891 | 0.966 | 0.945 | 0.966 | [ | |
| S2 | PredCFS | 0.960 | 0.984 | 0.984 | 1 | 0.982 | 0.903 | [ |
| dHKNN | 0.957 | 0.966 | 0.891 | 0.966 | 0.945 | 0.919 | [ | |
| I1 | RBF network | 0.917 | 0.884 | 0.889 | 0.897 | 0.893 | [ | |
| iCTX-Type | 0.833 | 0.978 | 0.898 | 0.903 | 0.911 | [ | ||
| Fscore-SVM | 0.917 | 0.953 | 0.953 | 0.942 | 0.946 | [ | ||
| AVC-SVM | 0.931 | 0.942 | 0.892 | 0.922 | 0.920 | [ | ||
| I2 | ICTCPred | 1 | 0.919 | 1 | 0.973 | 0.957 | [ | |
A list of the published prediction tools for conotoxin classification.
| Name | Prediction Type | URL | Reference |
|---|---|---|---|
| PredCSF | Superfamily | [ | |
| ConoDictor | Superfamily | [ | |
| iCTX-Type | ion channel-target | [ |