| Literature DB >> 33013838 |
Nannette Y Yount1,2,3, David C Weaver4, Jaime de Anda5,6, Ernest Y Lee5,6, Michelle W Lee5,6, Gerard C L Wong5,6, Michael R Yeaman1,2,3,7.
Abstract
Antimicrobial compounds first arose in prokaryotes by necessity for competitive self-defense. In this light, prokaryotes invented the first host defense peptides. Among the most well-characterized of these peptides are class II bacteriocins, ribosomally-synthesized polypeptides produced chiefly by Gram-positive bacteria. In the current study, a tensor search protocol-the BACIIα algorithm-was created to identify and classify bacteriocin sequences with high fidelity. The BACIIα algorithm integrates a consensus signature sequence, physicochemical and genomic pattern elements within a high-dimensional query tool to select for bacteriocin-like peptides. It accurately retrieved and distinguished virtually all families of known class II bacteriocins, with an 86% specificity. Further, the algorithm retrieved a large set of unforeseen, putative bacteriocin peptide sequences. A recently-developed machine-learning classifier predicted the vast majority of retrieved sequences to induce negative Gaussian curvature in target membranes, a hallmark of antimicrobial activity. Prototypic bacteriocin candidate sequences were synthesized and demonstrated potent antimicrobial efficacy in vitro against a broad spectrum of human pathogens. Therefore, the BACIIα algorithm expands the scope of prokaryotic host defense bacteriocins and enables an innovative bioinformatics discovery strategy. Understanding how prokaryotes have protected themselves against microbial threats over eons of time holds promise to discover novel anti-infective strategies to meet the challenge of modern antibiotic resistance.Entities:
Keywords: anti-infecive agents; antimicrobial; bacteriocin; computational biology; host-defence
Mesh:
Substances:
Year: 2020 PMID: 33013838 PMCID: PMC7494827 DOI: 10.3389/fimmu.2020.01873
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Components and Process of the BACIIα Algorithm.
Retrieved sequences using BACIIα algorithm by stage of study.
| Bacteriocins | 376 | 53 | 308 | 82 |
| Competence enhancing peptides | 129 | 18 | 6 | 2 |
| Pheromones | 7 | 1 | 1 | 0.3 |
| Autoinducing peptides | 12 | 2 | 8 | 2 |
| Other | 182 | 26 | 52 | 14 |
| Total characterized Sequences | 706 | – | 375 | – |
Bacteriocin peptides retrieved by multi-component formula search.
| IIa | Acidocin | 8912, LF221B, M | |
| IIb | Amylovorin | L alpha, L beta, L471 | |
| IId | Lactococcin | A, A1, G beta, Q beta | |
Source organisms of retrieved dataset proteins.
| Gram-Positive | ||||
| 74 | ||||
| 50 | ||||
| Other bacilli | 14 | |||
| 10 | ||||
| Other | 2 | |||
| 2 | ||||
| Gram-Negative | 24 | |||
| 13 | ||||
| 9 | ||||
| 2 | ||||
| <1.0 | ||||
| <1.0 | ||||
| Non-gram staining | <1.0 | |||
| <1.0 | ||||
| Unclassified | <1.0 | <1.0 |
Biophysical properties of retrieved dataset proteins.
| Known bacteriocins | 308 | 22 | 0.33 (±0.2) | 1.1 (±1.5) | 0.71 (±0.3) | 0.46 (±0.1) | 6.8 (±2.3) |
| Bacteriocin-related | 15 | 1 | 0.51 (±0.1) | 1.9 (±1.9) | 0.85 (±0.9) | 0.37 (±0.2) | 8.5 (±2.3) |
| Non-bacteriocin | 52 | 4 | 0.40 (±0.1) | 0.4 (±1.5) | 0.42 (±0.4) | 0.43 (±0.2) | 7.1 (±2.4) |
| Uncharacterized | 1038 | 73 | 0.39 (±0.2) | 0.1 (±1.5) | 0.68 (±0.4) | 0.41 (±0.4) | 6.4 (±2.1) |
Includes pheromones, competence-inducing peptides and others.
μH, hydrophobic moment; Q, charge; NK/NK+NR relative percentage of lysine vs. arginine; H, hydrophobicity; PI, isoelectric point. Values are presented ± standard deviation.
Figure 2NK/NK+NR Ratio and Mean Hydrophobicity in Study Molecules. Percentage of lysine (NK) relative to arginine (NR) expressed as (NK/NK+NR) vs. hydrophobicity (H) in study αHDPs Preference of lysine as compared to arginine is reflected in an increased value of H for peptides capable of generating NGC in membranes as predicted by the saddle-splay rule.
Figure 3Positional and Spatial Amphipathic Residue Frequency. (A) Relative amino acid percentages are displayed for bacteriocins. (B) Percentages of individual residues associated with either the polar or non-polar search term group are represented as various color blocks. Residues above the x-axis are associated with the polar residue group and residues below the axis are found on the.
Quartile analysis of dataset protein properties vs. SVM scoring.
| Category | Total | μH*Q > 1.0 | SVM | ||||||
| Known bacteriocins | 308 | 43 | 14 | 0.52 | 3.2 | 0.68 | 0.38 | 8.68 | 0.90 |
| Bacteriocin-related | 15 | 9 | 60 | 0.56 | 3.4 | 0.91 | 0.29 | 9.98 | 0.64 |
| Non-bacteriocin | 52 | 10 | 19 | 0.52 | 2.7 | 0.27 | 0.34 | 8.18 | 0.72 |
| Uncharacterized | 1,038 | 85 | 8 | 0.53 | 3.2 | 0.56 | 0.33 | 8.67 | 0.91 |
| Category | Total | μH*Q > 0.50 | SVM | ||||||
| Known bacteriocins | 308 | 79 | 26 | 0.46 | 2.6 | 0.69 | 0.42 | 7.9 | 0.80 |
| Bacteriocin-related | 15 | 10 | 66 | 0.57 | 3.2 | 0.92 | 0.32 | 9.6 | 0.63 |
| Non-bacteriocin | 52 | 15 | 29 | 0.50 | 2.4 | 0.38 | 0.35 | 8.1 | 0.65 |
| Uncharacterized | 1,038 | 208 | 20 | 0.49 | 2.3 | 0.63 | 0.36 | 7.6 | 0.80 |
| Category | Total | μH*Q > 0.25 | SVM | ||||||
| Bacteriocins | 308 | 161 | 52 | 0.36 | 2.1 | 0.75 | 0.42 | 7.2 | 0.58 |
| Bacteriocin-related | 15 | 12 | 80 | 0.52 | 2.8 | 0.76 | 0.35 | 9.1 | 0.54 |
| Non-bacteriocin | 52 | 16 | 31 | 0.50 | 2.3 | 0.4 | 0.35 | 7.9 | 0.67 |
| Uncharacterized | 1,038 | 319 | 31 | 0.44 | 1.9 | 0.65 | 0.38 | 7.2 | 0.65 |
The μH*Q values represent different percentile cutoffs for peptide groups (dark orange, >1.0; middle orange; >0.50; and light orange, >0.25). Definition legend: μH–hydrophobic moment; Q–charge; N.
Figure 4Spearman Correlations Multi-Component BACIIα Formula and ML Classifier. Correlations were carried out to assess the predictive accuracy and monotonic ranking between the BACIα algorithm and the SVM classifier scored peptide sequences. Plots compare HMQ (BACIIα predictive) vs. sigma (classifier probability) scores for study peptides in the top 25th (HMQ25) and 50th (HMQ50) percentiles. The bacteriocin groups (A,B) display scores for identified bacteriocins. The uncharacterized groups (C,D) reflect those peptides which are also predicted to be membrane permeabilizing by the two protocols. All comparisons were found to be significant given a cutoff value of P ≤ 0.05. Correlations were carried out using Mathematica (Wolfram).
Figure 5Genomic Environment Surrounding Putative Bacteriocins. Analysis of 20 kb region surrounding putative bacteriocin genes. Red—putative bacteriocin; gray—hypothetical proteins; dark blue—C39 bacteriocin processing peptidase; medium blue—exinuclease ABC subunit; light blue—ABC transporter, ATP binding protein; green other enzyme; purple—polymerase related protein.
Figure 6(A) Sequence Analysis and Antimicrobial Activity of Putative Bacteriocins. Putative bacteriocins synthesized for assessment of antimicrobial activity. Arrows indicate hydrophobic moment and direction. (A) Peptide 1: A0RKV8 (+4.5), PI−10.7; Bacillus thuringiensis (G+); FKVIVTDAGHYPREWGKQLGKWIGSKIK (24); (B) Peptide 2: D6E338 (+4), PI 10.3; Eubacterium rectale; KRNYSIEKYVKNYlDFIKKAIDIFRPMPI (25); (C); Peptide 3: B3ZXE9 (+6), PI−10.9; Bacillus cereus; KTIATNATYYPNKWAKSAGKWIASKIK (26). (D) Peptide 4: R2S6C2 (+4), PI−10.5; Enterococcus pallens, QYDKTGYKIGKTVGTIVRKGFEIWSIFK (24).
Figure 7Microbicidal activity of study test peptides vs. a panel of prototypic gram-positive (S. aureus), gram-negative (S. typhimurium, P. aeruginosa, A. baumannii) and fungal (C. albicans) pathogens at two pH representing: (A)–bloodstream (pH 7.5); or (B)–phagolysosomal/abscess (pH 5.5). Data represent experiments independently performed a minimum of n = 3 times. Error bars represent the standard error of the mean. All study peptides were found to have statistically significantly greater activity (P < 0.01) than the dilution vehicle (ddH20) in at least one pH condition. Note the differential pH dependent efficacy of Peptide 3 against S. aureus. The relative efficacies of study peptides against representative organisms at pH 7.5 or pH 5.5 are shown in the cluster analyses in panels (C,D), respectively (red, relatively greater efficacy; blue, relatively lesser efficacy).