| Literature DB >> 33739403 |
Abstract
The cyclic lipopeptide antibiotics structurally related to daptomycin were first reported in the 1950s. Several have common lipopeptide initiation, elongation, and termination mechanisms. Initiation requires the use of a fatty acyl-AMP ligase (FAAL), a free-standing acyl carrier protein (ACP), and a specialized condensation (CIII) domain on the first NRPS elongation module to couple the long chain fatty acid to the first amino acid. Termination is carried out by a dimodular NRPS that contains a terminal thioesterase (Te) domain (CAT-CATTe). Lipopeptide BGCs also encode ABC transporters, apparently for export and resistance. The use of this mechanism of initiation, elongation, and termination, coupled with molecular target-agnostic resistance, has provided a unique basis for robust natural and experimental combinatorial biosynthesis to generate a large variety of structurally related compounds, some with altered or different antibacterial mechanisms of action. The FAAL, ACP, and dimodular NRPS genes were used as molecular beacons to identify phylogenetically related BGCs by BLASTp analysis of finished and draft genome sequences. These and other molecular beacons have identified: (i) known, but previously unsequenced lipopeptide BGCs in draft genomes; (ii) a new daptomycin family BGC in a draft genome of Streptomyces sedi; and (iii) novel lipopeptide BGCs in the finished genome of Streptomyces ambofaciens and the draft genome of Streptomyces zhaozhouensis.Entities:
Keywords: zzm321990 Actinoplaneszzm321990 ; zzm321990 Saccharomonosporazzm321990 ; zzm321990 Streptomyceszzm321990 ; A54145; Actinomycete; Amphomycin; CDA; Combinatorial biosynthesis; Daptomycin; Friulimicin; Genome mining; Glycinocin; Malacidin; Parvuline; Taromycin; Telomycin
Mesh:
Substances:
Year: 2021 PMID: 33739403 PMCID: PMC9113097 DOI: 10.1093/jimb/kuab020
Source DB: PubMed Journal: J Ind Microbiol Biotechnol ISSN: 1367-5435 Impact factor: 4.258
Genome and Lipopeptide BGC Sequence Status for Actinomycetes and Uncultured Bacteria
| Microorganism | Lipopeptide (predicted) | BGC status[ | Genome status | Reference |
|---|---|---|---|---|
| Daptomycin | Finished | Draft | Baltz ( | |
|
| Taromycin | Finished | Draft | Reynolds et al. ( |
|
| (desCl-Taromycin) | Finished | Finished | Baltz ( |
|
| A54145 | Finished | ND[ | Miao, Brost, et al. ( |
|
| (A54145) | Fragmented | Draft | Baltz ( |
|
| (A54145) | Fragmented | Draft | Baltz ( |
|
| (A54145) | Fragmented | Draft | Baltz ( |
|
| (A54145) | Fragmented | Draft | This report |
| CDA | Finished | Finished | Hojati et al. ( | |
|
| CDA | Finished | Finished | Ho et al. ( |
|
| (CDA) | Fragmented | Draft | Baltz ( |
|
| (CDA) | Fragmented | Draft | Baltz ( |
|
| Friulimicin | Finished | Finished | Müller et al. ( |
|
| Friulimicin | Finished | NA[ | Kim et al. ( |
|
| Laspartomycin[ | Finished | Draft | Wang et al. ( |
|
| (Glycinocin)[ | Draft (AS)[ | Finished | Kim et al. ( |
|
| (Glycinocin) | Draft (AS) | Finished | This report |
| (Glycinocin) | Fragmented | Draft | This report | |
| (Glycinocin) | Draft (AS) | Draft | Komaki et al. ( | |
| Malacidin | Finished | NA | Hover et al. ( | |
|
| (Malacidin) | Finished? | NA | Owen et al. ( |
|
| Telomycin | Finished | Draft | Fu et al. ( |
|
| Telomycin | Finished | Draft | Johnstone et al. ( |
|
| (Telomycin) | Fragmented | Draft | Zhang et al. ( |
|
| (Telomycin) | Draft (AS) | Finished | Holmes et al. ( |
|
| Enduracidin | Finished | ND | Yin & Zabriski ( |
|
| Amphomycin[ | Fragmented | Draft | Baltz et al. ( |
|
| (Parvuline)[ | Fragmented | Draft | Baltz et al. ( |
|
| (Lipotridecapeptide) | Draft (AS) | Finished | Aigle et al. ( |
|
| (Unknown) | Fragmented | Draft | He et al. ( |
|
| (Unknown) | Fragmented | Draft | Li et al. ( |
aNCBI Genome (https://www.ncbi.nlm.nih.gov/genome/).
bND, not done.
cNA, not assembled.
dLaspartomycin (Las) is a member of the glycinocin (Gly) family of lipopeptides that share a common peptide (Baltz et al., 2005).
eAS, draft BGC from antiSMASH 5.0 (Blin et al., 2019).
fSame as DSM 40017.
gParvuline is a member of the amphomycin family of lipopeptides that also includes A-1437, tsushimycin, and aspartocin (Baltz et al., 2005). The amphomycin family differs from friulimicins at amino acid position 1 (Asp1 in amphomycin, Asn1 in friiulimicin).
Fig. 1Structures of daptomycin and A54145 B1.
Fig. 2Organization of genes encoding lipopeptide initiation, elongation, and termination in actinomycetes. (a) Locations of genes encoding FAAL, ACP (small arrow), and NRPS subunits showing module and domain structures. ACAD family fatty acid dehydrogenases are located in the gaps between FAAL and ACP genes for taromycin, friulimicin, laspartomycin, and malacidin. Trans ATTe tri-domains interact with CT modules at position 2 for friulimicin, laspartomycin, and malacidin. (b) Fatty acid and amino acid specificities of FAAL enzymes and NRPS modules. Note that friulimicin and laspartomycin have a single double bond, whereas taromycin and malacidin have two double bonds in long chain length fatty acids.
DptE Homolog BLASTp Scores for Fatty Acyl-AMP Ligase (FAAL) Enzymes from Actinomycetes and Uncultured Bacteria
| Query protein[ | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Actinomycete | FAAL (predicted protein) | DptE | Tar4 | LptEF[ | LipA( | LipA ( | LipA (M56) | Tem18 | MlcG |
| DptE |
|
| 52 | 48 | 47 | 50 | 47 | 48 | |
| Tar4 | 46 |
| 48 | 43 | 45 | 44 | 46 | 46 | |
| (Tar4) | 47 |
| 47 | 41 | 46 | 46 | 48 | 45 | |
| LptEF | 52 | 48 |
| 48 | 47 | 48 | 48 | 48 | |
| (LptE) | 51 | 49 |
| 48 | 50 | 49 | 49 | 49 | |
| (LptE) | 51 | 48 |
| 48 | 49 | 49 | 49 | 49 | |
| (LptE) | 51 | 48 |
| 47 | 49 | 49 | 47 | 50 | |
| (LptE) | 52 | 49 |
| 48 | 48 | 49 | 47 | 50 | |
| – | – | – | – | – | – | – | – | – | |
| – | – | – | – | – | – | – | – | – | |
| – | – | – | – | – | – | – | – | – | |
| – | – | – | – | – | – | – | – | – | |
| LipA-fri | 48 | 43 | 48 |
| 59 | 61 | 53 | 52 | |
| (LipA-fri) | 46 | 43 | 48 |
| 59 | 59 | 53 | 52 | |
| (LipA-las) | 47 | 45 | 47 | 59 |
|
| 54 | 54 | |
| (LipA-gly) | 50 | 45 | 49 | 61 |
|
| 54 | 54 | |
| (LipA-gly) | 50 | 44 | 48 | 61 |
|
| 54 | 54 | |
| (LipA-gly) | 50 | 45 | 49 | 61 |
|
| 54 | 54 | |
| (LipA-gly) | 49 | 44 | 48 | 62 |
|
| 56 | 54 | |
| Tem18 | 47 | 46 | 48 | 53 | 54 | 54 |
| 55 | |
| Tlo18 = Tem18 | 47 | 46 | 48 | 53 | 54 | 54 |
| 55 | |
| (Tem18) | 47 | 46 | 48 | 53 | 54 | 55 |
| 53 | |
| Tem18 | 46 | 45 | 46 | 52 | 53 | 54 |
| 53 | |
| MlcG | 48 | 46 | 48 | 52 | 54 | 54 | 55 |
| |
| (MlcG) | 48 | 45 | 48 | 53 | 53 | 54 | 55 |
| |
| ABD65965 (ORF45}[ | 42 | 44 | 42 | 41 | 49 | 39 | 40 | 39 | |
| KUN68831[ | 45 | 45 | 48 | 62 | 77 | 76 | 57 | 55 | |
| WP_114531151[ | 48 | 45 | 47 | 61 | 74 | 75 | 57 | 54 | |
| AKZ58686 | 49 | 46 | 50 | 53 | 56 | 57 | 57 | 56 | |
| SOD64433 | 45 | 43 | 47 | 51 | 48 | 48 | 52 | 50 | |
| WP_139649592[ | 56 | 52 | 52 | 48 | 48 | 52 | 48 | 51 | |
aPossible orthologs are shown in bold.
bThese proteins share 82% sequence identities.
cOrf45 has a fused FAAL-ACAD (Yin & Zabriski, 2006).
DptF (ACP) Homolog BLASTp Scores in Actinomycetes and Uncultured Bacteria
| Query ACP protein[ | ||||||||
|---|---|---|---|---|---|---|---|---|
| Microorganism | ACP (predicted) | DptF | Tar7 | LptF[ | LipD-fri | LipD-las | Tem19 | MlcJ |
| DptF |
| 38 | 50 | 34 | 39 | 32 | 41 | |
| Tar7 | 38 |
| 34 | 41 | 38 | 28 | 42 | |
| (Tar7) | 38 |
| 34 | 41 | 39 | 27 | 45 | |
| LptEF | 50 | 34 |
| 39 | 38 | 33 | 36 | |
| (LptF) | 40 | 29 |
| 43 | 39 | 33 | 38 | |
| (LptF) | 39 | 30 |
| 46 | 42 | 28 | 39 | |
| (LptF) | 39 | 30 |
| 42 | 39 | 28 | 41 | |
| (LptF) | 41 | 31 |
| 42 | 39 | 32 | 41 | |
| LipD-fri | 34 | 41 | 39 |
| 70 | 32 | 50 | |
| (LipD-fri) | 35 | 41 | 42 |
| 71 | 35 | 51 | |
| LipD-las | 39 | 38 | 38 | 70 |
| 40 | 47 | |
| (LipD-las) | 33 | 32 | 38 | 65 |
| 37 | 44 | |
| (LipD-las) | 33 | 32 | 38 | 65 |
| 37 | 44 | |
| (LipD-las) | 33 | 32 | 38 | 65 |
| 37 | 44 | |
| (LipD-las) | 31 | 37 | 35 | 61 |
| 35 | 45 | |
| Tem19 | 32 | 28 | 29 | 33 | 41 |
| 36 | |
| Tlo19 = Tem19 | 32 | 31 | 29 | 33 | 39 |
| 36 | |
| (Tem19) | 33 | 28 | 27 | 34 | 39 |
| 36 | |
| (Tem19?) | 27 | 26 | 33 | 36 | 36 | 57 | 37 | |
| MlcI | 41 | 45 | 37 | 50 | 47 | 33 |
| |
| MlcI | 45 | 45 | 36 | 50 | 51 | 34 |
| |
|
| ABD65965 | 37 | 42 | 37 | 47 | 44 | 43 | 39 |
| KUN68828[ | 33 | 35 | 38 | 66 | 68 | 38 | 44 | |
| WP_114531148[ | 36 | 36 | 33 | 63 | 68 | 36 | 48 | |
| WP_053138555 | 34 | 36 | 30 | 49 | 55 | 37 | 49 | |
| SOD64425 | 34 | 32 | 33 | 43 | 43 | 52 | 40 | |
| WP_139649588 | 50 | 47 | 51 | 34 | 37 | 35 | 48 | |
Abbreviations: LipD-fri, ACP from friulimicin BGC; LipD-las, ACP from laspartomycin/glycinocin BGC.
aPossible orthologs are shown in bold.
bLptF, amino acids 651–732 of LptEF.
cThese proteins share 78% sequence identities.
DptF (ACP) Multiprobe Codes
| Microorganism | Lipopeptide (predicted) | DptF ACP homolog | ACP code |
|---|---|---|---|
|
| |||
| Daptomycin | AAX31556 | ||
| Taromycin | WP_024877508 | 2- | |
| (desCl-Taromycin) | WP_082002416 | 2- | |
| A54145 | AAZ23074 | 3-22- | |
| (A54145) | WP_037635382 | 3-23- | |
| (A54145) | WP_051751218 | 3-22- | |
| (A54145) | WP_093851742 | 3-33- | |
| (A54145) | WP_101254957 | 3-33- | |
| Friulimicin | WP_023362358 | 3-33-33333- | |
| Friulimicin | ADK54908 | 3-33-33333- | |
| Laspartomycin[ | AEF16024 | 3-33-33333- | |
| (Glycinocin)[ | WP_099016109 | 3-33-33333- | |
| (Glycinocin) | WP_097235610 | 2-33-22222- | |
| Malacidin | ARU08072 | 3-33-33333- | |
| (Malacidin) | AGS49326 | 3-33-33333- | |
| Telomycin | AKQ13294 | 3-12-12111-22-333-33- | |
| Telomycin | WP_059298593 | 3-12-12111-22-333-33- | |
| (Telomycin) | WP_062930001 | 3-12-12111-22-333-33- | |
| (Telomycin?) | WP_098240746 | 3-12-23322-33-333-33- | |
|
| |||
| Enduracidin | ABD65955 | 3-33-23333-33-333-33-3333 | |
| Glycinocin | AUA09014 | 3-33-33333- | |
| Amphomycin | KUN68828 | 3-33-33333- | |
| Parvuline? | RDD86143 | 3-33-33333- | |
| (Lipotridecapeptide) | WP_053138555 | 3-33-23333-3 | |
| (Unknown) | SOD64425 | 3-22-12222-33-333-33-3333 | |
| (Unknown) | WP_139649588.1 | 3- | |
aLaspartomycin is a member of the glycinocin family.
Acyl CoA Dehydrogenase Superfamily (ACAD) BLASTp Scores for Actinomycetes and Uncultured Bacteria
| Query protein[ | |||||||
|---|---|---|---|---|---|---|---|
| Actinomycete or uncultured bacterium | Fatty acid dehydrogenase | Tar5 | MlcH | Fri (LipB) | Las (Orf22) | Tar6 | MlcI |
| Tar5 |
| 48 | 45 | 46 | 32 | 30 | |
| (Tar5) |
| 50 | 46 | 48 | 31 | 30 | |
| MlcH | 48 |
| 53 | 50 | 34 | 30 | |
| MlcH | 50 |
| 53 | 52 | 33 | 30 | |
| LipA-Fri | 46 | 53 |
| 60 | 31 | 29 | |
| LipA-Fri | 45 | 54 |
| 60 | 32 | 28 | |
| LipA-Las | 46 | 51 | 60 |
| 32 | 27 | |
| (LipA-Las/Gly) | 46 | 51 | 59 |
| 30 | 29 | |
| (LipA-Las/Gly) | 46 | 51 | 59 |
| 30 | 29 | |
| (LipA-Las/Gly) | 44 | 49 | 57 | 65 | 29 | 28 | |
| WP_059206888[ | 49 | 51 | 65 | 68 | 35 | 31 | |
| WP_114531150[ | 48 | 53 | 64 | 67 | 35 | 31 | |
| AKZ58687 | 51 | 54 | 50 | 51 | 35 | 29 | |
| WP_139649592 | 54 | 50 | 48 | 46 | 35 | - | |
| Tar6 | 32 | 34 | 31 | 30 |
| 48 | |
| (Tar6) | 33 | 33 | 34 | 31 |
| 48 | |
| MlcI | 30 | 30 | 29 | 27 | 48 |
| |
| MlcI | 29 | 29 | 28 | 27 | 46 |
| |
| AKZ58688 | 31 | 35 | 31 | 31 | 50 | 56 | |
| WP_139649590 | 33 | 36 | 29 | 34 | 52 | 48 | |
aPossible orthologs are shown in bold.
bThese proteins share 78% sequence identities.
DptD NRPS (CAT-CATTe) BLASTp Scores in Select Actinomycetes and Uncultured Bacteria
| Query protein[ | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Actinomycete or uncultured bacterium | DptD homolog (predicted) | DptD | Tar10 | LptD | PSIII | PstD | LpmD | Tem22 | MlcM |
| DptD |
| 55 | 53 | 52 | 49 | 48 | 51 | 46 | |
| Tar10 | 56 |
| 50 | 50 | 47 | 47 | 49 | 45 | |
| (DptD) | 56 |
| 50 | 51 | 48 | 48 | 47 | 46 | |
| LptD | 53 | 50 |
| 50 | 49 | 49 | 46 | 46 | |
| (LptD) | 53 | 50 |
| 50 | 49 | 49 | 46 | 46 | |
| (LptD) | 53 | 50 |
| 50 | 49 | 49 | 46 | 46 | |
| (LptD) | 52 | 50 |
| 50 | 48 | 48 | 47 | 46 | |
| (LptD) | 52 | 50 |
| 49 | 48 | 49 | 46 | 46 | |
| PSIII | 52 | 50 | 50 |
| 49 | 50 | 49 | 48 | |
| PSIII | 52 | 50 | 50 |
| 49 | 50 | 48 | 48 | |
| (PSIII) | 52 | 50 | 50 |
| 49 | 50 | 48 | 48 | |
| (PSIII) | 49 | 46 | 48 |
| 49 | 50 | 49 | 46 | |
| PstD | 49 | 47 | 49 | 49 |
| 64 | 54 | 51 | |
| PstD | 48 | 47 | 49 | 49 |
| 64 | 53 | 51 | |
| LpmD | 49 | 47 | 49 | 50 | 64 |
| 54 | 51 | |
| (LpmD} | 50 | 49 | 49 | 51 | 65 |
| 55 | 51 | |
| (LpmD) | 50 | 49 | 49 | 51 | 65 |
| 55 | 51 | |
| (LpmD) | 50 | 49 | 49 | 51 | 65 |
| 55 | 51 | |
| (LpmD) | 49 | 47 | 49 | 51 | 65 |
| 54 | 52 | |
| Tem22 | 48 | 49 | 48 | 48 | 54 | 54 |
| 48 | |
| Tlo22 = Tem22 | 48 | 48 | 48 | 48 | 54 | 54 |
| 48 | |
| (Tem22} | 48 | 47 | 46 | 48 | 53 | 53 |
| 49 | |
| (Tem22) | 48 | 46 | 45 | 48 | 52 | 52 |
| 48 | |
| MlcM | 47 | 45 | 46 | 48 | 51 | 51 | 48 |
| |
| (MlcM) | 47 | 45 | 46 | 47 | 51 | 51 | 49 |
| |
| WP_059209689[ | 51 | 48 | 50 | 52 | 67 | 74 | 54 | 53 | |
| WP_114529432[ | 49 | 49 | 50 | 52 | 67 | 72 | 55 | 53 | |
| AKZ258685 | 47 | 47 | 46 | 48 | 47 | 47 | 47 | 45 | |
| SOD64429 | 53 | 51 | 48 | 51 | 50 | 50 | 48 | 46 | |
| WP_139647437 | 65 | 57 | 54 | 53 | 49 | 49 | 48 | 47 | |
aPossible orthologs are shown in bold.
bThese proteins share 86% sequence identities.
DptD Homolog Binding Pockets in Select Actinomycetes and Uncultured Bacteria
| Actinomycete or uncultured bacterium | DptD homolog (predicted) | Binding pocket amino acid 1 | Specificity amino acid 1 | Binding pocket amino acid 2 | Specificity amino acid 2 |
|---|---|---|---|---|---|
| DptD | DLGKTGVINK | 3mGlu | DAWTTTGVGK | Kyn | |
| Tar10 | DLGKTGVVNK | 3mGlu | DAWTTTGVAK | Kyn | |
| (Tar10) | DLGKTGVVNK | 3mGlu | DAWTTTGVGK | Kyn | |
| LptD | DLGKTGVVNK | 3mGlu | DGLFVGIAVK | Ile/Val | |
| (LptD) | DLGKTGVVNK | 3mGlu | DGLFVGIAVK | Ile/Val | |
| (LptD) | DLGKTGVVNK | 3mGlu | DGLFVGIAVK | Ile/Val | |
| (LptD) | DLGKTGVVNK | 3mGlu | DGLFVGIAVK | Ile/Val | |
| (LptD) | DLGKTGVVNK | 3mGlu | DGLFVGIAVK | Ile/Val | |
| PSIII | DQGKTGVGHK | 3mGlu | DGWAVASVCK | Trp | |
| PSIII | DQGKTGVGHK | 3mGlu | DGWAVASVCK | Trp | |
| (PSIII) | DQGKTGVGHK | 3mGlu | DGWAVASVCK | Trp | |
| (PSIII) | DQGKTGVGHK | 3mGlu | DGWAVASVCK | Trp | |
| PstD | DAYWWGGTFK | Val | DVQYVAHVVK | Pro | |
| PstD | DAYWWGGTFK | Val | DVQYVAHVVK | Pro | |
| LpmD | DAYFWGVCFK | Val | DVQYIAHVVK | Pro | |
| (LpmD} | DAYFWGVTFK | Val | DVQYVAHVVK | Pro | |
| (LpmD) | DAYFWGVTFK | Val | DVQYVAHVVK | Pro | |
| (LpmD) | DAYFWGVTFK | Val | DVQYVAHVVK | Pro | |
| (LpmD) | DAYWWGGTFK | Val | DVQYVAHVVK | Pro | |
| Tem22 | DAYLTGLMNK | Ile | DVQFVSQVMK | Pro | |
| Tlo22 = Tem22 | DAYLTGLMNK | Ile | DVQFVSQVMK | Pro | |
| (Tem22} | DAYLTGLMNK | Ile | DVQFVSQVMK | Pro | |
| (Tem22) | DAYLTGLMNK | Ile | DVQFVAQVVK | Pro | |
| MlcM | DAFWLGGTFK | Val | DVQYVGHAIK | Pro | |
| (MlcM) | DAFWLGGTFK | Val | DVQYVGHAIK | Pro | |
| WP_059209689 | DAYWWGGTKF | Val | DVQYVAHVVK | Pro | |
| WP_114529432 | DAYWWGGTFK | Val | DVQYVAHVVK | Pro | |
| AKZ258685 | DFWNVGMVHK | Thr | DIYHLGLLCK | Hpg | |
| SOD64429 | DHTKIGAITKINK | Asp | DAWTTTGVAK | Kyn | |
| WP_139647437 | DLGKTGVINK | 3mGlu | DAWTTTGVGK | Kyn |
DptI Homolog BLASTp Scores in Actinomycetes
| Query protein[ | ||||||
|---|---|---|---|---|---|---|
| Actinomycete | DptI homolog | DptI | Tar13 | (DptI-ss) | LptI | GlmT |
|
| DptI |
|
|
| 38 | 37 |
| Tar13 |
|
|
| 38 | 38 | |
| (DptI-sv)[ |
|
|
| 39 | 40 | |
|
| (DptI-ss) |
|
|
| 42 | 38 |
| LptI | 38 | 38 | 42 |
| 32 | |
|
| (LptI) | 38 | 38 | 40 |
| 34 |
|
| (LptI) | 37 | 40 | 40 |
| 35 |
|
| (LptI) | 41 | 41 | 41 |
| 39 |
|
| (LptI) | 46 | 44 | 44 |
| 43 |
|
| GlmT | 37 | 39 | 38 | 32 |
|
|
| GlmT | 37 | 39 | 38 | 32 |
|
| (GlmT) | 37 | 40 | 36 | 36 |
| |
| (GlmT) | 38 | 40 | 37 | 35 |
| |
aProteins most highly related to each other are shown in bold.
b( ), assigned bioinformatically. All others assigned functionally, and supported by chemical structures.
Fig. 3Organization of lipopeptide BGC genes involved in initiation, elongation, and termination of a cryptic lipopeptide encoded by S. ambofaciens (). The gene organization and NRPS subunit structures are distantly related to those of daptomycin, taromycin, and A54145, all of which encode tridecapeptides. The amino acid binding specificities differ substantially, however. The Ca2+-binding tetrapeptides of daptomycin, taromycin, A54145, laspartomycin are shown in bold italics. The amino acid regions that could be converted to canonical Ca2+-binding sites by insertion of a single Asp or Gly module for Sambo-LP or malacidin, respectively, are shown in bold italics in parentheses.