| Literature DB >> 24244550 |
Anastasia Niarchou1, Anastasia Alexandridou, Emmanouil Athanasiadis, George Spyrou.
Abstract
BACKGROUND: Antimicrobial peptides are a promising alternative to conventional antibiotics. Plants are an important source of such peptides; their pharmacological properties are known since antiquity. Access to relevant information, however, is not straightforward, as there are practically no major repositories of experimentally validated and/or predicted plant antimicrobial peptides. PhytAMP is the only database dedicated to plant peptides with confirmed antimicrobial action, holding 273 entries. Data on such peptides can be otherwise retrieved from generic repositories. DESCRIPTION: We present C-PAmP, a database of computationally predicted plant antimicrobial peptides. C-PAmP contains 15,174,905 peptides, 5-100 amino acids long, derived from 33,877 proteins of 2,112 plant species in UniProtKB/Swiss-Prot. Its web interface allows queries based on peptide/protein sequence, protein accession number and species. Users can view the corresponding predicted peptides along with their probability score, their classification according to the Collection of Anti-Microbial Peptides (CAMP), and their PhytAMP id where applicable. Moreover, users can visualise protein regions with a high concentration of predicted antimicrobial peptides. In order to identify potential antimicrobial peptides we used a classification algorithm, based on a modified version of the pseudo amino acid concept. The classifier tested all subsequences ranging from 5 to 100 amino acids of the plant proteins in UniProtKB/Swiss-Prot and stored those classified as antimicrobial with a high probability score (>90%). Its performance measures across a 10-fold cross-validation are more than satisfactory (accuracy: 0.91, sensitivity: 0.93, specificity: 0.90) and it succeeded in classifying 99.5% of the PhytAMP peptides correctly.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24244550 PMCID: PMC3823563 DOI: 10.1371/journal.pone.0079728
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparative overview of other antimicrobial peptide studies.
| Study | Method | Accuracy | Features | Positive set | Negative set | Validation set |
| CAMP | SVMRandom Forests | 91.50%93.2% | 64 (after recursive feature elimination on initial set of 257) physicochemical properties (composition), dipeptide & tripeptide frequencies, distribution & transition of some features along sequences | 2578 experimentally validated CAMP peptides | 4011 random proteins from UniProt, synthesized sequences using random numbers, experimentally verified non-antimicrobial peptides (25) | 30% of positive & negative sets |
| Fjell et al | Quantitative structure-activity relationships (QSAR) | 80.00% | 44 QSAR descriptors | 1433 synthesized peptides, 9 amino-acids long(antibacterial acitivity measured experimentally) | ∼100000 synthesized peptides | |
| Torrent et al | ANNSVM | 90%75% | 8 physicochemical & structural properties (50 hidden neurons) | 1157 CAMP antimicrobial peptides | 991 randomly selected UniProt protein fragments | 290 antimicrobial peptides from CAMEL and RANDOM databases |
| Porto et al | SVM | 83.02% | 4 physicochemical properties | 199 peptides from APD | 199 proteins predicted to be transmembrane | 106 sequences from positive & negative training sets |
| Wang et al | BLASTP & Nearest-Neighbour Algorithm (NNA) | 93.31% | 25 composition & pseudo-amino acid composition features from initial set of 270 (for NNA) | 870 peptides from CAMP (including some predicted) | 8661 protein fragments randomly selected from UniProt | 1136 predicted peptides from CAMP |
Maximum, minimum and average values of Accuracy, Sensitivity, Specificity and Matthews Correlation Coefficient (MCC) for a 10-fold cross-validation.
| Values | Accuracy | Sensitivity | Specificity | MCC |
| Max | 0.94 | 0.96 | 0.94 | 0.87 |
| Min | 0.89 | 0.92 | 0.87 | 0.78 |
| Average | 0.91 | 0.93 | 0.90 | 0.82 |
Figure 1Distribution of predicted probabilities for antimicrobial (a) and non-antimicrobial (b) samples.
Figure 2Snapshots from the C-PAmP Database.
Figure 3Snapshots from the web interface of C-PAmP Database.
Figure 4C-PAmP predictions for 6 antimicrobial regions in protein O24006 of Impatiens balsamina (Balsam) in comparison with the corresponding annotations in UniProtKB/Swiss-Prot.
Figure 5Statistics of the predicted plant antimicrobial peptides.