| Literature DB >> 26713618 |
Yongchun Zuo1, Yang Lv1, Zhuying Wei1, Lei Yang2, Guangpeng Li1, Guoliang Fan3.
Abstract
Defensins as one of the most abundant classes of antimicrobial peptides are an essential part of the innate immunity that has evolved in most living organisms from lower organisms to humans. To identify specific defensins as interesting antifungal leads, in this study, we constructed a more rigorous benchmark dataset and the iDPF-PseRAAAC server was developed to predict the defensin family and subfamily. Using reduced dipeptide compositions were used, the overall accuracy of proposed method increased to 95.10% for the defensin family, and 98.39% for the vertebrate subfamily, which is higher than the accuracy from other methods. The jackknife test shows that more than 4% improvement was obtained comparing with the previous method. A free online server was further established for the convenience of most experimental scientists at http://wlxy.imu.edu.cn/college/biostation/fuwu/iDPF-PseRAAAC/index.asp. A friendly guide is provided to describe how to use the web server. We anticipate that iDPF-PseRAAAC may become a useful high-throughput tool for both basic research and drug design.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26713618 PMCID: PMC4694767 DOI: 10.1371/journal.pone.0145541
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Violin plots show the length distribution of five defensin families.
Fig 2The predictive overall accuracy of defensins families based on different N-peptide composition with S size alphabet (N, S).
Fig 3The heatmap shows the adjacent correlation of 13 reduced amino acids for five different defensin families.
Results obtained by iDPF-PseRAAAC in identifying defensin peptide families with dipeptide case(N = 2).
|
| ||||||||
|---|---|---|---|---|---|---|---|---|
| Family | Subset | Metrics | (2, 20) | (2, 13) | (2, 11) | (2, 9) | (2, 8) | (2, 5) |
| 400 |
| 121 | 81 | 64 | 25 | |||
| Insect |
| Sn(%) | 88.33 |
| 83.33 | 76.67 | 81.67 | 58.33 |
| Sp(%) | 98.53 |
| 96.34 | 95.60 | 96.70 | 91.94 | ||
| MCC | 0.89 |
| 0.80 | 0.73 | 0.79 | 0.51 | ||
| Invertebrate |
| Sn(%) | 64.71 |
| 61.76 | 64.71 | 55.88 | 38.24 |
| Sp(%) | 96.32 |
| 96.99 | 95.32 | 97.66 | 95.99 | ||
| MCC | 0.62 |
| 0.62 | 0.59 | 0.60 | 0.39 | ||
| Plant |
| Sn(%) | 83.33 |
| 90.48 | 80.95 | 64.29 | 57.14 |
| Sp(%) | 99.66 |
| 97.94 | 98.63 | 98.63 | 97.25 | ||
| MCC | 0.89 |
| 0.87 | 0.83 | 0.72 | 0.61 | ||
| Unclassified |
| Sn(%) | 45.00 |
| 47.50 | 42.50 | 17.50 | 17.50 |
| Sp(%) | 96.59 |
| 95.90 | 95.90 | 96.93 | 95.90 | ||
| MCC | 0.49 |
| 0.49 | 0.44 | 0.22 | 0.19 | ||
| Vertebrate |
| Sn(%) | 98.09 |
| 97.45 | 93.63 | 96.82 | 88.54 |
| Sp(%) | 85.80 |
| 91.48 | 85.80 | 71.59 | 65.34 | ||
| MCC | 0.84 |
| 0.89 | 0.79 | 0.70 | 0.55 | ||
| OA(%) | 84.68 |
| 84.38 | 79.88 | 76.28 | 65.47 | ||
The bold values show the best results.
The prediction results for vertebrate subfamilies based on 2-peptide composition of 13 reduced amino acids (N = 2, S = 13).
| Subfamily | Alpha-type | Beta-type | Theta-type | Sn(%) | Sp(%) | MCC |
|---|---|---|---|---|---|---|
| Alpha-type | 69 | 3 | 0 | 95.83 | 100 | 0.97 |
| Beta-type | 0 | 172 | 0 | 100 | 94.81 | 0.96 |
| Theta-type | 0 | 1 | 4 | 80 | 100 | 0.89 |
| Overall accuracy(%) | 98.39 | |||||
Fig 4Comparing the performance of the proposed method with our previous methods.
A: indicated the prediction results of defensin family; B: indicated the prediction results of vertebrate defensin subfamily.
Fig 5A semi-screenshot to show the top page of the iDPF-PseRAAAC web-server.