| Literature DB >> 24991545 |
Hui Ding1, En-Ze Deng1, Lu-Feng Yuan1, Li Liu2, Hao Lin3, Wei Chen4, Kuo-Chen Chou5.
Abstract
Conotoxins are small disulfide-rich neurotoxic peptides, which can bind to ion channels with very high specificity and modulate their activities. Over the last few decades, conotoxins have been the drug candidates for treating chronic pain, epilepsy, spasticity, and cardiovascular diseases. According to their functions and targets, conotoxins are generally categorized into three types: potassium-channel type, sodium-channel type, and calcium-channel types. With the avalanche of peptide sequences generated in the postgenomic age, it is urgent and challenging to develop an automated method for rapidly and accurately identifying the types of conotoxins based on their sequence information alone. To address this challenge, a new predictor, called iCTX-Type, was developed by incorporating the dipeptide occurrence frequencies of a conotoxin sequence into a 400-D (dimensional) general pseudoamino acid composition, followed by the feature optimization procedure to reduce the sample representation from 400-D to 50-D vector. The overall success rate achieved by iCTX-Type via a rigorous cross-validation was over 91%, outperforming its counterpart (RBF network). Besides, iCTX-Type is so far the only predictor in this area with its web-server available, and hence is particularly useful for most experimental scientists to get their desired results without the need to follow the complicated mathematics involved.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24991545 PMCID: PMC4058692 DOI: 10.1155/2014/286419
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1A ribbon drawing to show the human potassium (K) channel. Reproduced from Chou [6] with permission.
Figure 2A ribbon drawing to show the human sodium (Na) channel. Reproduced from Chou [6] with permission.
Figure 3A ribbon drawing to show the calcium (Ca) channel from hepatitis C virus. Reproduced from [4] with permission.
Figure 4A plot to show the IFS curve, where the abscissa and ordinate axis denote the number of features and the overall accuracy, respectively. As shown in the figure, the value of the overall accuracy reached its peak (91.1%) when the top-ranked 50 dipeptide features were taken into account.
List of the 50 optimal features or dipeptides derived according to (7)–(9) as elaborated in the Section 2.3.
| AA | AS | CC | CH | CS | DH | DN | EN | GA | GH |
| GL | GT | GY | HA | HL | HS | IY | KD | KK | KM |
| KP | LN | LV | MC | MY | ND | NQ | NS | PI | QK |
| QT | RC | RD | RF | RN | RT | RW | SC | SG | TE |
| TF | TT | VV | WG | WI | YD | YH | YL | YT | YY |
Comparison of the current method with the one in [7] by the jackknife test on the same benchmark dataset (Supporting Information S1) according to the metrics defined in (11)-(12).
| Method | Number of features counted | ΛK (%) | ΛNa (%) | ΛCa (%) | AA (%) | OA (%) |
|---|---|---|---|---|---|---|
| RBF networka | 70 | 91.7 | 88.4 | 88.9 | 89.7 | 89.3 |
| iCTX-Typeb | 50 | 83.3 | 97.8 | 89.8 | 90.3 | 91.1 |
aSee [7].
bThis paper.
Figure 5A screenshot to show the top page of the iCTX-Type web server. Its website address is http://lin.uestc.edu.cn/server/iCTX-Type.