| Literature DB >> 30200662 |
Rawana N Alkhalili1, Björn Canbäck2.
Abstract
Lanthipeptides are ribosomally synthesized and post-translationally modified polycyclic peptides. Lanthipeptides that have antimicrobial activity are known as lantibiotics. Accordingly, the discovery of novel lantibiotics constitutes a possible solution for the problem of antibiotic resistance. We utilized the publicly available genome sequences and the bioinformatic tools tailored for the detection of lanthipeptides. We designed our strategy for screening of 252 firmicute genomes and detecting class-I lanthipeptide-coding gene clusters. The designed strategy resulted in identifying 69 class-I lanthipeptide sequences, of which more than 10% were putative novel. The identified putative novel lanthipeptides have not been annotated on the original or the RefSeq genomes, or have been annotated merely as coding for hypothetical proteins. Additionally, we identified bacterial strains that have not been previously recognized as lanthipeptide-producers. Moreover, we suggest corrections for certain firmicute genome annotations, and recommend lanthipeptide records for enriching the bacteriocin genome mining tool (BAGEL) databases. Furthermore, we propose Z-geobacillin, a putative class-I lanthipeptide coded on the genome of the thermophilic strain Geobacillus sp. ZGt-1. We provide lists of putative novel lanthipeptide sequences and of the previously unrecognized lanthipeptide-producing bacterial strains, so they can be prioritized for experimental investigation. Our results are expected to benefit researchers interested in the in vitro production of lanthipeptides.Entities:
Keywords: BAGEL; Geobacillus; Z-geobacillin; antiSMASH; antimicrobial; bacteriocins; firmicutes; lanthipeptides; lantibiotics; lantipeptides
Mesh:
Substances:
Year: 2018 PMID: 30200662 PMCID: PMC6165006 DOI: 10.3390/ijms19092650
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Bacterial genomes where a total of eight putative novel class-I lanthipeptides were identified. Annotations of the lanthipeptide-coding gene in the original genome and the RefSeq records are presented.
| Bacterial Species, Strain (Lanthipeptide Reference Number) | RefSeq Genome Accession Number | Annotation of the Lanthipeptide-Coding Gene | |
|---|---|---|---|
| Original Genome Record | RefSeq Genome Record | ||
| NC_017200 | ‘Hypothetical protein’ | Unannotated | |
| LDPD01000000 1 | Lanthipeptide | Unavailable in RefSeq | |
| NC_017542 | Unannotated | ‘Hypothetical protein’ | |
| NC_017542 | Unannotated | ‘Hypothetical protein’ | |
| NC_014622 | Partly inaccurately annotated | ‘Hypothetical protein’ | |
| NC_014622 | Incorrectly annotated as coding for subtilin | ‘Hypothetical protein’ | |
| NC_007795 | Unannotated | Unannotated | |
| NC_022246 | Unannotated | ‘Hypothetical protein’ | |
1 The presented genome sequence accession number for Geobacillus sp. ZGt-1 belongs to the original genome record; the whole genome shotgun (WGS) project. 2 Only the core sequence of this lanthipeptide was briefly mentioned in another study, but the lanthipeptide was not clearly classified as class-I (details are given in the text). 3 S. intermedius has not been reported in literature as a class-I lanthipeptide-producer.
Figure 1Workflow chart summarizing the analyses steps carried out in this study.
Bacterial strains with predicted lanthipeptide production potential, representing possible subjects for future experimental investigations. Strains typed in bold belong to a firmicute species that has not been reported in literature as a producer of class-I lanthipeptides, but the lanthipeptide production potential of its presented strains has been addressed in the current study.
| Bacterial Strain | RefSeq Genome Accession Number | Total Number of Harbored Class-I Lanthipeptides | Lanthipeptide Reference Number 1 | Identity to Experimentally Verified Lanthipeptide 2 | Reference 3 |
|---|---|---|---|---|---|
| NC_006582 | 1 | - 4 | 56% clausin | [ | |
| NC_014023 | 2 | I and II | 56% gallidermin | [ | |
| NC_014976 | 1 | - 4 | 100% subtilomycin | [ | |
| NC_014479 | 1 | - 4 | 100% subtilin | This study | |
| NC_017200 | 1 | - 4 | No hits | This study | |
| NC_020394 | 5 | I and II | 100% thuricin 4A-4 | [ | |
| NC_022873 | 2 | I | 53% gallidermin | This study | |
| NC_006510 | 2 | I | 91% geobacillin I | [ | |
| LDPD01000000 | 1 | - 4 | 91% geobacillin I | This study | |
| NC_016593 | 1 | - 4 | 79% geobacillin I | This study | |
| NC_017486 | 1 | - 4 | 100% nisin A | [ | |
| NC_020450 | 1 | - 4 | 100% nisin Z | This study | |
| NC_023037 | 1 | - 4 | 94% paenilan | [ | |
| NC_017542 | 2 | I | 64% paenilan | [ | |
| NC_014622 | 2 | I | 64% paenilan | [ | |
| NC_017351 | 2 | I | 100% BsaA2 | This study | |
| NC_021670 | 2 | I | 100% BsaA2 | This study | |
| NC_002951 | 2 | I | 100% BsaA2 | [ | |
| NC_017337 | 2 | I | 100% BacCH91 | [ | |
| NC_017349 | 2 | I | 81% BsaA2 | This study | |
| NC_021059 | 2 | I | 100% BsaA2 | This study | |
| NC_002953 | 2 | I | 100% BsaA2 | [ | |
| NC_003923 | 2 | I | 100% BsaA2 | [ | |
| NC_007795 | 2 | I | 100% BsaA2 | [ | |
| NC_009641 | 2 | I | 100% BsaA2 | [ | |
| NC_017347 | 2 | I | 100% BsaA2 | This study | |
| NC_017331 | 2 | I | 100% BsaA2 | This study | |
| NC_007793 | 2 | I | 100% BsaA2 | [ | |
| NC_010079 | 2 | I | 100% BsaA2 | [ | |
| NC_016912 | 2 | I | 100% BsaA2 | This study | |
| NC_022604 | 2 | I | 100% BsaA2 | This study | |
|
| NC_022246 | 1 | - 4 | No hits | This study |
|
| NC_022237 | 1 | - 4 | 81% nisin F | This study |
| NC_015600 | 1 | - 4 | 91% nisin U | [ | |
| NC_007296 | 1 | - 4 | 100% streptin | [ | |
| NC_008021 | 1 | - 4 | 100% streptin | This study | |
| NC_008022 | 1 | - 4 | 100% streptin | [ | |
| NC_008024 | 1 | - 4 | 98% streptin | [ | |
| NC_017618 | 1 | - 4 | 100% suicin 90-1330 | This study | |
| NC_020526 | 1 | - 4 | 100% suicin 90-1330 | This study |
The amino acid (aa) sequences of the lanthipeptides are presented in Table S1. 2 Identity (%) to the experimentally verified lanthipeptides is based on the bacteriocin genome mining tool (BAGEL) BLAST, the literature, or both. 3 The cited references represent studies where the respective strain was mentioned as a potential lanthipeptide producer based on an in silico analysis. In some of these studies, the lanthipeptide aa sequences have not been determined. 4 (-) Indicates the lack of a lanthipeptide reference number because the strain has only one class-I lanthipeptide.
Figure 2Alignment of all identified class-I LanA precursors with the exception of the lanthipeptide of Streptococcus intermedius strain B196 which aligned poorly. Leader peptides are in lower case and are reported as shown in the results of antiSMASH. No manual editing of the alignment was performed. For the full name of bacterial species, see Table S1.
Figure 3Class-I lanthipeptide clusters of three Geobacillus strains, G. kaustophilus HTA426, Geobacillus sp. ZGt-1, and G. thermodenitrificans NG80-2. The illustrated clusters were drawn after the lanthipeptide biosynthesis gene clusters generated by antiSMASH 3, which was run by analyzing the genome FASTA files of the strains [24]. The locus-tags of the genes and accession numbers of the coded proteins are presented in Tables S3–S5 for strains Geobacillus sp. ZGt-1, G. kaustophilus HTA426, and G. thermodenitrificans NG80-2, respectively.
Figure 4Amino acid sequence alignment of the class-I lanthipeptides of G. kaustophilus HTA426 (lanthipeptide I), Geobacillus sp. ZGt-1 (Z-geobacillin), and G. thermodenitrificans NG80-2 (geobacillin I). The leader peptides are typed in bold as opposed to the core peptides. (*) denotes a conserved aa in the three strains. The red arrow indicates the (antiSMASH-predicted) proteolytic cleavage site for the removal of the leader peptide. The alignment was performed using BioEdit [58].
Figure 5Schematic illustration of the putative biosynthesis pathway of Z-geobacillin. Amino acid sequences of the in silico-predicted proteins of strain ZGt-1 were annotated using the pathway mapping tool available from KEGG Automatic Annotation Server (KAAS) [61]. Predicted pathway indicated that the strain has all the genes required for the lanthipeptide biosynthesis, regulation and self-immunity, except for a dedicated protease-coding gene (represented as a white oval shape with a crossing red line). Details of the biosynthesis pathway are given in the text. The scheme was re-drawn after the KAAS-generated lanthipeptide biosynthesis pathway map of strain ZGt-1. Identical colors reflect correlated biological functions.