| Literature DB >> 25893081 |
Gajender Aleti1, Angela Sessitsch1, Günter Brader1.
Abstract
Bacillus and related genera in the Bacillales within the Firmicutes harbor a variety of secondary metabolite gene clusters encoding polyketide synthases and non-ribosomal peptide synthetases responsible for remarkable diverse number of polyketides (PKs) and lipopeptides (LPs). These compounds may be utilized for medical and agricultural applications. Here, we summarize the knowledge on structural diversity and underlying gene clusters of LPs and PKs in the Bacillales. Moreover, we evaluate by using published prediction tools the potential metabolic capacity of these bacteria to produce type I PKs or LPs. The huge sequence repository of bacterial genomes and metagenomes provides the basis for such genome-mining to reveal the potential for novel structurally diverse secondary metabolites. The otherwise cumbersome task to isolate often unstable PKs and deduce their structure can be streamlined. Using web based prediction tools, we identified here several novel clusters of PKs and LPs from genomes deposited in the database. Our analysis suggests that a substantial fraction of predicted LPs and type I PKs are uncharacterized, and their functions remain to be studied. Known and predicted LPs and PKs occurred in the majority of the plant associated genera, predominantly in Bacillus and Paenibacillus. Surprisingly, many genera from other environments contain no or few of such compounds indicating the role of these secondary metabolites in plant-associated niches.Entities:
Keywords: Genome mining; Lipopeptides; Non-ribosomal protein synthetase; Paenibacillus; Polyketides; Structure prediction
Year: 2015 PMID: 25893081 PMCID: PMC4397504 DOI: 10.1016/j.csbj.2015.03.003
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 2Chemical structures of polyketides of Bacillus and Paenibacillus.
(A) Polyketides from B. amyloliquefaciens FZB42 (a, b, c) and Bacillus sp. AH159-1 (c): (a) difficidins, (b) bacillaenes and (c) macrolactins. Stereochemistry not shown.
(B) Polyketides from Paenibacillus: (a) Paenimacrolidin from Paenibacillus sp. F6-B70. Stereochemistry unknown. (b) Paenilamicin from P. larvae DSM25430.
Fig. 1Architectures of type I polyketide synthases (PKS) showing similarities and dissimilarities in known and predicted PKs. Iterative domains: ACP, acyl carrier protein; PCP, peptidyl carrier protein; A, adenylation; KS, ketosynthase; DH, dehydratase; MT, methyl transferase; KR, ketoreductase; TE, thioesterase. Further details of domains are described in Table 1. Modules and recruited amino acids indicated below, gene names indicated above each illustration.
(A) Gene clusters of the three types of well-known PKS from B. amyloliquefaciens FZB42: (a) difficidin, (b) macrolactin, (c) bacillaene.
Modular regions of predicted PKS: (a) bacillaene variant from P. pini 16418; number and order of the domains differ from B.amyloliquefaciens FZB42 bacillaene, (b) novel PKS from P. polymyxa E681; an adenylation domain specifies glycine, (c) paenimacrolidine like PKS from P. durus DSM 1735, (d) novel PKS form Brevibacillus brevis NBRC 100599; adenylation domains specify ala and ser, also contains the methylation domains- oMT and nMT. These predicted PKS machinery in Paenibacillus may work without thioesterase.
Fig. 3Organization of the non-ribosomal peptide synthetases (NRPS) encoding lipopeptides in Paenibacillus and Bacillus. Iterative domains: A, adenylation; T, thiolation; E, epimerization; MCT, malonyl-CoA transacylase; ACL, acyl-coA ligase; AMT, aminotransferase; dab, 2,4-diaminobutyric acid; orn, ornithine; KS, keto synthetase; TE, thioesterase. Further details of domains are described in Table 1. Modules and recruited amino acids indicated below, gene names indicated above each illustration.
(A) Organization of the known NRPS (a) polymyxin A in P. polymyxa E681, (b) fusaricidin in P. polymyxa E681 and (c) tridecaptin A in P. terrae NRRL B-30644.
(B) Organization of the predicted novel NRPS encoding (a) a heptapeptide in P. polymyxa E681; modular architecture is similar to the known Iturin but predicted amino acid composition is completely different and (b) organization of the known mycosubtilin operon [69], an iturin member from B. subtilis for comparison.
Fig. 4Chemical structures of lipopeptides from Bacillus and Paenibacillus.
(A) Lipopeptides from B. amyloliquefaciens FZB42 (a,b,c): (a) surfactin, (b) bacillomycin (an iturin member), (c) plipastatin (a fengycin member) and (d) kurstakin from B. thuringiensis kurstaki HD-1.
(B) Lipopeptides from Paenibacillus: (a) polymyxin A from P. polymyxa E681, (b) fusaricidin C from P. polymyxa E681, (c) paenibacterin from Paenibacillus sp. OSY-SE (d) tridecaptin from P. terrae NRRL B-30644.
Predicted lipopeptides and type I polyketides from selected members of Bacillales.
| GenBank ID | Organism | Lipopeptide* | Type I polyketide* |
|---|---|---|---|
| CP000154.1 | Polymyxin A, structure and biosynthetic gene cluster confirmed [SKChoi 2009, Catch JR 1949] | Novel polyketide | |
| ARIL00000000.1 | Polymyxin A variant | Novel polyketide (same as above) - | |
| ARIL00000000.1 | Novel fusaricidin variant L-thr-D-val-L-ile-D-ser-D-asn-L-ala; 49% identity to fusaricidin of | No clusters found | |
| CP006941.1 | Predicted heptapeptide variant (pk-nrp) + (thr-ser-ala) + (phe-gln-glu) | Incomplete PKS predicted | |
| CP003235.1 | Predicted heptapeptide variant | Incomplete PKS predicted | |
| CP009288.1 | Incomplete NRPS predicted | Paenimacrolidine | |
| BAVZ00000000.1 | Incomplete NRPS predicted | Bacillaene variant, gly; ala (KS = 14, | |
| ANAT00000000.1 | Bacillomycin D, surfactin, plipastatin; similar to | Bacillaene, macrolactin, difficidin; | |
| AULE00000000.1 | Paenibacterin variant | Incomplete PKS predicted | |
| ARMT00000000.1 | Unknown heptapeptide-architecture similar to Iturin family | Incomplete PKS predicted | |
| CP003355.1 | IturinA | Paenilamicins: A1, B1, A2, B2, -a | |
| CP003763.1 | Kurstakin, structure confirmed [Hathout et al. 2000] | No clusters found | |
| CP004069.1 | Kurstakin variant | No clusters found | |
| CP000560.1 | SurfactinA [Peypoux F 1994, Koumoutsi A 2004] | Bacillaene gly; ala (KS = 14, DH = 8, | |
| JOKF00000000.1 | SurfactinA-similar to FZB42, Plipastatin B (similar to FZB42 but Glu | Macrolactin variant (KS = 11, DH = 3, | |
| NC_014639.1 | SurfactinC | Bacillaene variant, similar to FZB42 | |
| CM000488.1 | SurfactinA; plipastatin B; similar to FZB42 | bacillaene similar to FZB42 | |
| AP008955.1 | Incomplete NRPS predicted | Novel polyketide | |
| AEWH00000000.1 | Incomplete NRPS predicted | Macrolactin like polyketide | |
| APIS00000000.1 | Surfactin, plipastatin B; similar to | Macrolactin like polyketide (KS = 12, DH = 5, KR = 6, ACP = 14) 45% idenity to bacillaene of |
* Sequence prediction using antiSMASH, NaPDos and NRPS/PKS substrate predictor tools, peptides in bold are predicted novel peptides, monomers in both bold and underline differ from described metabolites in that position (in case of polyketides they differ in number and maybe in the order of domains); monomers in underline are known variants, previously described. B. subtilis 3610 and B. amyloliquefaciens FZB42 are reported to produce similar bacillaene [Rebecca A. Butcher 2006, Chen 2009]. However, they differ in number of domains predicted.
Abbreviations: mal, malonyl-CoA; pk, polyketide; dab, 2,4-diaminobutyric acid; KS, ketosynthase; DH, dehydratase; MT, methyl transferase; KR, ketoreductase; orn, ornithine, nrp, unassigned non ribosomal peptide, mdap, N-methyl-diaminopropionic acid, NRPS, non-ribosomal peptide synthetase, PKS, type 1 polyketide synthase.