| Literature DB >> 35884206 |
Diana Lin1, Darcy Sutherland1,2,3, Sambina Islam Aninta1, Nathan Louie1, Ka Ming Nip1,4, Chenkai Li1,4, Anat Yanai1, Lauren Coombe1, René L Warren1, Caren C Helbing5, Linda M N Hoang2,3, Inanc Birol1,2,3.
Abstract
Antibiotic resistance is a global health crisis increasing in prevalence every day. To combat this crisis, alternative antimicrobial therapeutics are urgently needed. Antimicrobial peptides (AMPs), a family of short defense proteins, are produced naturally by all organisms and hold great potential as effective alternatives to small molecule antibiotics. Here, we present rAMPage, a scalable bioinformatics discovery platform for identifying AMP sequences from RNA sequencing (RNA-seq) datasets. In our study, we demonstrate the utility and scalability of rAMPage, running it on 84 publicly available RNA-seq datasets from 75 amphibian and insect species-species known to have rich AMP repertoires. Across these datasets, we identified 1137 putative AMPs, 1024 of which were deemed novel by a homology search in cataloged AMPs in public databases. We selected 21 peptide sequences from this set for antimicrobial susceptibility testing against Escherichia coli and Staphylococcus aureus and observed that seven of them have high antimicrobial activity. Our study illustrates how in silico methods such as rAMPage can enable the fast and efficient discovery of novel antimicrobial peptides as an effective first step in the strenuous process of antimicrobial drug development.Entities:
Keywords: AMP discovery; antimicrobial peptide; antimicrobial resistance; genome mining
Year: 2022 PMID: 35884206 PMCID: PMC9312091 DOI: 10.3390/antibiotics11070952
Source DB: PubMed Journal: Antibiotics (Basel) ISSN: 2079-6382
Figure 1Statistics and attrition as the sequencing data are processed by the rAMPage AMP discovery pipeline. rAMPage processes RNA-seq datasets from raw reads to transcripts to putative AMPs. In this case, a putative AMP is defined as a sequence with an AMPlify score 10 for amphibians or 7 for insects, a length 30 AA, and a charge 2. Datasets with a reference transcriptome used during assembly are indicated with an asterisk. The total number of putative AMPs (n = 1478, including 341 duplicates) are shown in purple, discovered from a total of ~53 million assembled transcripts.
Figure 2Antimicrobial susceptibility and hemolysis test results of seven moderately and highly active putative AMPs. AMPs were tested for their bioactivity against E. coli and S. aureus to determine minimum inhibitory and bactericidal concentrations (MIC and MBC, respectively). AMPs were also tested for their hemolytic activity using pig red blood cells to determine hemolytic concentration (HC50) values. Moderate activity (MIC and MBC in the range of 8–16 μg/mL) and high activity (4 μg/mL) thresholds indicated by the dashed lines. AMPs are ordered by increasing MIC values against E. coli ATCC 25922.
Characteristics of putative AMP sequences with moderate to high in vitro bioactivity against E. coli or S. aureus. Each sequence is separated into the prepro sequence and the predicted mature peptide sequence. Conserved proteolytic cleavage sites are underlined in the prepro sequences.
| Prepro-Sequence | Putative Mature Peptide | ||||||
|---|---|---|---|---|---|---|---|
| Sequence | Length | Charge | AMPlify Score | MIC | Peptide ID | ||
| E. coli † |
| ||||||
| MFTMKKSLLVLFFLGIVSLSLCEEERNADEDDGEMTEEVKR | GILDTLKQLGKAAVQGLLSKAACKLAKTC | 29 | 4 | 80.0 | 2–4 | 4–8 | AmMa1 |
| LGIVSLSLCQEERSADDEEGEVIEEEVKR | GFMDTAKNVAKNVAVTLLYNLKCKITKAC | 29 | 4 | 69.2 | 4 | 64 | OdMa12 |
| MFTMKKSLLFFFLGTIALSLCEEERGADEEENGGEITDEEVKR | GLLLDTVKGAAKNVAGILLNKLKCKVTGDC | 30 | 3 | 61.8 | 8 | 16–32 | PeNi10 |
| MFTMKKSLLLVFFLGTIALSLCEEERGADDDNGGEITDEEIKR | GILTDTLKGAAKNVAGVLLDKLKCKITGGC | 30 | 3 | 61.8 | 8–16 | 32–128 | PeNi11 |
| MFTLRKSLLLLFFLGMVSLSLCEQERDADEDEGEVTEEVKR | GLWTTIKEGVKNFSVGVLDKIRCKITGGC | 29 | 3 | 67.5 | 4–8 | 16–64 | PeNi14 |
| MKLLALVLVLSCVVAYTTARKRGQYWPTNTKIFTTPYRFRREADQGSIVANLKNTPQLPFDDNENLRLVLFDNDPTVDLGEDDKEIPGPQSQPNALSNNLHLIDENDYFSSYTSQPGTYRSFPRNFGTSGRYRWRREAGGHVEPRLRFDAETQRGNSFFTDFADLQRRANGRGIEPTVSATAGIRFRQEADQINPLAVRRERR | SWLSKSVKKLVNKKNYTRLEKLAKKKLFNE | 30 | 8 | 25.5 | 1–2 | >128 | TeRu4 |
| IFLVGCKLFGNFILQRMQLLLALADAVA | KIKIPWGKVKDFLVGGMKAVGKK | 23 | 6 | 45.0 | 1–4 | 2–8 | TeBi1 |
* MIC: Minimum inhibitory concentration. † Escherichia coli ATCC 25922; Staphylococcus aureus ATCC 29213.
Comparison of sequence identities (%) of the discovered AMPs with their best-known AMP blastp matches to the NCBI non-redundant (nr) protein database over the entire sequence (precursor), prepro or mature sequences.
| Peptide ID | Source Organism | Highest Scoring Blastp Match | Sequence Identity (%) | ||
|---|---|---|---|---|---|
| Precursor | Prepro | Mature | |||
| AmMa1 |
| Palustrin-2GN3 (ADM34231.1) | 97 | 100 | 93 |
| OdMa12 |
| Odorranain-F2 (ABG76517.1) | 98 | 100 | 97 |
| PeNi10 |
| Pelophylaxin-1 (Q2WCN8.1) | 82 | 86 | 77 |
| PeNi11 |
| Pelophylaxin-1 (Q2WCN8.1) | 100 | 100 | 100 |
| PeNi14 |
| Palustrin-2HB1 (AIU998997.1) | 90 | 93 | 86 |
| TeRu4 |
| Uncharacterized protein (XP_024884948.1) | 94 | 93 | 97 |
| TeBi1 |
| M-myrmicitoxin(01)-Tb1a (W8GNV3.1) | 100 | - | 100 |
* Highest scoring blastp match when query sequence consists of only the mature sequence instead of the whole precursor. -: no significant alignment.
Figure 3rAMPage workflow. The rAMPage pipeline and downstream selection of putative AMPs for validation.