| Literature DB >> 25388148 |
Abdullah M Khamis1, Magbubah Essack1, Xin Gao1, Vladimir B Bajic1.
Abstract
MOTIVATION: The increased prevalence of multi-drug resistant (MDR) pathogens heightens the need to design new antimicrobial agents. Antimicrobial peptides (AMPs) exhibit broad-spectrum potent activity against MDR pathogens and kills rapidly, thus giving rise to AMPs being recognized as a potential substitute for conventional antibiotics. Designing new AMPs using current in-silico approaches is, however, challenging due to the absence of suitable models, large number of design parameters, testing cycles, production time and cost. To date, AMPs have merely been categorized into families according to their primary sequences, structures and functions. The ability to computationally determine the properties that discriminate AMP families from each other could help in exploring the key characteristics of these families and facilitate the in-silico design of synthetic AMPs.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25388148 PMCID: PMC4380027 DOI: 10.1093/bioinformatics/btu738
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
The number of features characterizing different AMP sequence regions as selected using GA-based optimization of unsupervised k-means clustering
| AMP family/sub-family | NP | NF | NSF | NC | NCF | NPF | ||
|---|---|---|---|---|---|---|---|---|
| Alpha-defensin | 10 | 10 | 34 | 299 | 14 | 14 | 12 | 2 |
| Bacteriocin | 14 | 10 | 24 | 225 | 9 | 12 | 7 | 2 |
| Beta-defensin | 10 | 10 | 41 | 261 | 36 | 14 | 29 | 7 |
| Bombinin | 16 | 10 | 31 | 1095 | 13 | 9 | 4 | 9 |
| Cathelicidin | 16 | 8 | 27 | 521 | 36 | 11 | 17 | 19 |
| Cecropin | 12 | 8 | 30 | 835 | 33 | 11 | 8 | 25 |
| Cyclotide (Bracelet) | 12 | 10 | 12 | 350 | 7 | 14 | 2 | 5 |
| DEFL | 12 | 8 | 37 | 252 | 26 | 14 | 20 | 6 |
| FSAP (Brevinin sub-family) | 14 | 10 | 143 | 1945 | 118 | 12 | 18 | 100 |
| FSAP (Caerin sub-family) | 14 | 10 | 11 | 1946 | 28 | 14 | 1 | 27 |
| FSAP (Dermaseptin) | 16 | 10 | 30 | 327 | 25 | 15 | 16 | 9 |
| Invertebrate def. (Type 1) | 10 | 10 | 21 | 402 | 14 | 14 | 8 | 6 |
| Invertebrate def. (Type 2) | 16 | 8 | 13 | 510 | 9 | 15 | 4 | 5 |
| Type A lantibiotic | 16 | 10 | 11 | 194 | 26 | 14 | 26 | 0 |
| 9162 | 394 | 172 | 222 |
Notes: Annotations of columns are as follows: N-terminal length (dn), C-terminal length (dc), number of peptides (NP), original number of features (NF), number of selected features (NSF), number of clusters (NC), number of compositional features (NCF) and number of physicochemical features (NPF).
The performance of the k-means clustering of 14 target AMP families using features selected by GA
| Target AMP family | Number of features | Accuracy (%) | Sensitivity (%) | Specificity (%) | Precision (%) | |
|---|---|---|---|---|---|---|
| Alpha-defensin | 14 | 99.73 | 94.12 | 100.00 | 100.00 | 96.97 |
| Bacteriocin | 9 | 99.87 | 95.83 | 100.00 | 100.00 | 97.87 |
| Beta-defensin | 36 | 99.60 | 95.12 | 99.86 | 97.50 | 96.30 |
| Bombinin | 13 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| Cathelicidin | 36 | 98.80 | 88.89 | 99.17 | 80.00 | 84.21 |
| Cecropin | 33 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| Cyclotide (Bracelet) | 7 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| DEFL | 26 | 99.47 | 89.19 | 100.00 | 100.00 | 94.29 |
| FSAP (Brevinin) | 118 | 95.88 | 79.02 | 99.84 | 99.12 | 87.94 |
| FSAP (Caerin) | 28 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| FSAP (Dermaseptin) | 25 | 99.60 | 96.67 | 99.72 | 93.55 | 95.08 |
| Invertebrate def. (Type 1) | 14 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| Invertebrate def. (Type 2) | 9 | 99.34 | 61.54 | 100.00 | 100.00 | 76.19 |
| Type A lantibiotic | 26 | 99.47 | 100.00 | 99.46 | 73.33 | 84.62 |
| Average | 99.41 | 92.88 | 99.86 | 95.96 | 93.82 |
Note: The values of other measures (Jaccard index, entropy and purity) are shown in the detailed table available in Supplementary Table S2.
Fig. 1.Bar plots of the precision obtained from four different representations of AMPs