| Literature DB >> 24586765 |
Kai Blin1, Daniyal Kazempour2, Wolfgang Wohlleben1, Tilmann Weber1.
Abstract
Lanthipeptides are a class of ribosomally synthesised and post-translationally modified peptide (RiPP) natural products from the bacterial secondary metabolism. Their name is derived from the characteristic lanthionine or methyl-lanthionine residues contained in the processed peptide. Lanthipeptides that possess an antibacterial activity are called lantibiotics. Whereas multiple tools exist to identify lanthipeptide gene clusters from genomic data, no programs are available to predict the post-translational modifications of lanthipeptides, such as the proteolytic cleavage of the leader peptide part or tailoring modifications based on the analysis of the gene cluster sequence. antiSMASH is a software pipeline for the identification of secondary metabolite biosynthetic clusters from genomic input and the prediction of products produced by the identified clusters. Here we present a novel antiSMASH module using a rule-based approach to combine signature motifs for biosynthetic enzymes and lanthipeptide-specific cleavage site motifs to identify lanthipeptide clusters in genomic data, assign the specific lanthipeptide class, predict prepeptide cleavage, tailoring reactions, and the processed molecular weight of the mature peptide products.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24586765 PMCID: PMC3930743 DOI: 10.1371/journal.pone.0089420
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Lanthipeptide-related HMM profiles and scores.
| Name | Description | Cutoff | File |
| LANC_like | LanC-like lantibiotics biosynthesis protein | 17 | LANC_like.hmm |
| DUF4135 | Lantibiotic-associated domain | 150 | PF13575.hmm |
| Lant_dehyd_N | Lantibiotic dehydratase, N-terminus | 20 | Lant_dehyd_N.hmm |
| Lant_dehyd_C | Lantibiotic dehydratase, C-terminus | 20 | Lant_dehyd_C.hmm |
| Flavoprotein | Lantibiotic aminovinyl flavoprotein | 20 | PF0241.hmm |
| Trp_halogenase | Tryptophan halogenase | 20 | PF04820.hmm |
| p450 | P450 oxygenase | 60 | PF00067.hmm |
| Pkinase | Protein kinase domain | 30 | PF00069.hmm |
| adh_short | Short-chain dehydrogenase | 100 | PF00106.hmm |
| adh_short_C2 | Short-chain dehydrogenase, C-terminus | 100 | PF13561.hmm |
| Antimicr18 | Lantibiotic antimicrobial peptide 18 | 20 | Antimicrobial18.hmm |
| Gallidermin | Gallidermin | 20 | Gallidermin.hmm |
| L_biotic_A | Lantibiotic, type A | 20 | L_biotic_typeA.hmm |
| TIGR03731 | Lantibiotic, gallidermin/nisin family | 18 | TIGR03731.hmm |
| leader_d | Lantibiotic leader lacticin 481 group | 20 | LE-LAC481.hmm |
| leader_eh | Lantibiotic leader mersacidin cinnamycin group | 20 | LE-MER+2PEP.hmm |
| leader_abc | Lantibiotic leader LanBC modified | 20 | LE-LanBC.hmm |
| mature_d | Lantibiotic peptide lacticin 481 group | 20 | MA-LAC481.hmm |
| mature_ab | Lantibiotic peptide nisin epidermin group | 20 | MA-NIS+EPI.hmm |
| mature_a | Lantibiotic peptide nisin group | 20 | MA-NIS.hmm |
| mature_b | Lantibiotic peptide epidermin group | 20 | MA-EPI.hmm |
| mature_ha | Lantibiotic peptide two component alpha | 20 | MA-2PEPA.hmm |
| mature_h_beta | Lantibiotic peptide two component beta | 20 | MA-2PEPB.hmm |
| lacticin_l | lantibiotic leader lacticin 481 group (Dufour et al) | 20 | LE-DUF.hmm |
| lacticin_mat | lantibiotic peptide lacticin 481 group (Dufour et al) | 20 | MA-DUF.hmm |
| LD_lanti_pre | FxLD family lanthipeptide | 20 | TIGR04363.hmm |
| strep_PEQAXS |
| 20 | strep_PEQAXS.hmm |
Class I cleavage site motif sequences.
| Name | Position | Sequence |
|
| 22…41 | FAFDTTDTTIVASNDDPDTR |
|
| 22…41 | FTFDTTDTIVAESNDDPDTR |
|
| 6…23 | FNLDLVEVSK–SNTGASAR |
|
| 6…24 | FNLDLIKISK-ENNSGASPR |
|
| 6…23 | FNLDLVSVSKK–DSGASPR |
|
| 6…23 | FNLDLLSVSKK–DSGASPR |
|
| 6…23 | FNLDLVSVSKT–DSGASTR |
|
| 11…30 | FDLDVKVNAKESNDSGAEPR |
|
| 11…30 | FNLDVKVNAKESNDSGAEPR |
|
| 7…24 | FDLDVVKVSKQ–DSKITPQ |
|
| 8…26 | FDLEIKKETSQNTD-ELEPQ |
|
| 8…26 | FDLEIKKDNME-NNNELEPQ |
|
| 6…24 | FDLNLNKGVETQK-SDLSPQ |
|
| 7…23 | FDLDIVVK-KQ–DDVVQPN |
|
| 8…23 | FDLDLKTNKK–D-TATPY |
|
| 17…33 | LDLDLSIGVEE–ITAGPA |
Class II cleavage site motif sequences.
| Name | Position | Sequence |
|
| 19…52 | EL-TILGG |
|
| 15…47 | ELDAILGG |
|
| 22…51 | ELMEVAGG |
|
| 16…48 | ELEQILGG |
|
| 20…51 | ELDNLLGG |
|
| 30…74 | EQHSIAGG |
|
| 27…72 | ELKALVGG |
|
| 18…66 | ELEMLIGG |
|
| 16…59 | FDEDVFGA |
|
| 26…65 | EGDESHGG |
|
| 23…57 | ELNEVLGA |
|
| 18…51 | ELDQIIGA |
|
| 24…56 | ELDNVLGA |
|
| 33…69 | ILAGVNGA |
|
| 28…65 | ELSSLAGS |
|
| 12…68 | EMEAIQGS |
|
| 24…59 | NLLNVNGA |
|
| 52…59 | IAATEAFA |
|
| 41…48 | QMDKLVGA |
|
| 35…64 | EDRTIYAA |
|
| 36…66 | RRVVSPYM |
Stability of the prediction motifs.
| Class | # of Sequences | # Found | % Found | # Correct | % Correct |
|
| 16 | 16 | 1.00 | 16 | 1.00 |
|
| 21 | 21 | 1.00 | 17 | 0.81 |
Figure 1Example lanthipeptide output antiSMASH 2.2 output for the microbisporicin [5] gene cluster, showing the predicted leader/core peptide split and the predicted tailoring reactions and weights in the sidebar.
Benchmark results.
| Substance | Class | Predicted Mass (Da) | Actual Mass (Da) | Number of bridges | Source |
|
| I | 3466.7 | 3467.5 | 4 |
|
|
| I | 3029.6 | 3029.0 | 5 |
|
|
| I | 3353.9 | 3354.5 | 5 |
|
|
| I | 3330.9 | 3331.5 | 5 |
|
|
| I | 3326.9 | 3327.3 | 5 |
|
|
| I | 2164.0 | 2164.0 | 4 |
|
|
| I | 2164.0 | 2164.0 | 4 |
|
|
| I | 3346.7 | 3346.0 | 5 |
|
|
| I | 3487.1 | 3488.0 | 3 |
|
|
| I | 3135.6 | 3135.0 | 3 |
|
|
| I | 3261.5 | 3265.0 | 7 |
|
|
| I | 2441.9 | 2442.0 | 3 |
|
|
| I | 2232.4 | 2232.0 | 5 |
|
|
| II | 3243.5 | 3244.0 | 3 |
|
|
| II | 2366.6 | 2368.0 | 3 |
|
|
| II | 2796.1 | 2795.0 | 3 |
|
|
| II | 3250.7 | 3251.0 | 4 |
|
|
| II | 3632.8 | 3021.0 | 4 |
|
|
| II | 3395.9 | 3428.0 | 2 |
|
|
| II | 3322.6 | 3322.3 | 4 |
|
|
| II | 2843.2 | 2847.5 | 3 |
|
|
| II | 2733.1 | 2740.0 | 3 |
|
|
| II | 2043.2 | 2041.0 | 3 |
|
|
| II | 2399.0 | 1826.3 | 3 |
|
|
| II | 1856.2 | 1860.5 | 4 |
|
|
| II | 2145.5 | 2145.0 | 4 |
|
|
| I | 2193.3 | 2194.0 | 5 |
|
|
| I | 3171.8 | 3171.7 | 3 |
|
N-terminal removal of six amino acids not predicted.
Contains a disulphite bridge.
* Not shown experimentally.