| Literature DB >> 25635820 |
Abstract
Streptomyces spp. are robust producers of medicinally-, industrially- and agriculturally-important small molecules. Increased resistance to antibacterial agents and the lack of new antibiotics in the pipeline have led to a renaissance in natural product discovery. This endeavor has benefited from inexpensive high quality DNA sequencing technology, which has generated more than 140 genome sequences for taxonomic type strains and environmental Streptomyces spp. isolates. Many of the sequenced streptomycetes belong to the same species. For instance, Streptomyces albus has been isolated from diverse environmental niches and seven strains have been sequenced, consequently this species has been sequenced more than any other streptomycete, allowing valuable analyses of strain-level diversity in secondary metabolism. Bioinformatics analyses identified a total of 48 unique biosynthetic gene clusters harboured by Streptomyces albus strains. Eighteen of these gene clusters specify the core secondary metabolome of the species. Fourteen of the gene clusters are contained by one or more strain and are considered auxiliary, while 16 of the gene clusters encode the production of putative strain-specific secondary metabolites. Analysis of Streptomyces albus strains suggests that each strain of a Streptomyces species likely harbours at least one strain-specific biosynthetic gene cluster. Importantly, this implies that deep sequencing of a species will not exhaust gene cluster diversity and will continue to yield novelty.Entities:
Mesh:
Year: 2015 PMID: 25635820 PMCID: PMC4312078 DOI: 10.1371/journal.pone.0116457
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1An approximately maximum likelihood phylogenetic tree of sequenced Streptomyces species.
A phylogeny was inferred for Mycobacterium tuberculosis and 120 sequenced streptomycetes based on concatenated partial sequences of aptD-gyrB-recA-rpoB. FastTree local support values (expressed as a whole number) are indicated at each node. Streptomyces albus strains are highlighted in red. The scale bar indicates 5% estimated sequence divergence.
Accessions and genomic features of Streptomyces albus strains.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| NC_020990 | 6.83 | Closed | Unknown | [ |
|
| ASHE00000000 | 7.10 | 20 | Nordic fjords | [ |
|
| ASHF00000000 | 7.22 | 34 | Nordic fjords | [ |
|
| AEWS00000000 | 6.46 | 7,074 | Baltic Sea sediment | [ |
|
| AMPN00000000 | 7.15 | 513 | North Sea sponges | [ |
|
| ARDM00000000 | 7.00 | 36 | Plant-ants | [ |
|
| CADY00000000 | 7.61 | 269 | Leaf-cutting ants | [ |
Pairwise comparison of gene clusters encoding putative secondary metabolites from Streptomyces albus strains.
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
|
| 25 (100%) | 20 (80%) | 19 (76%) | 20 (80%) | 23 (92%) | 21 (84%) | 2 (8%) |
|
| 30 (100%) | 28 (93%) | 20 (66%) | 21 (70%) | 21 (70%) | 1 (3%) | |
|
| 29 (100%) | 19 (65%) | 20 (68%) | 20 (68%) | 1 (3%) | ||
|
| 26 (100%) | 21 (80%) | 21 (80%) | 3 (11%) | |||
|
| 27 (100%) | 22 (81%) | 3 (11%) | ||||
|
| 28 (100%) | 6 (21%) |
The percentage in braces reflects the total number of gene clusters conserved in the pairwise comparison with respect to the strains listed vertically.
The core secondary metabolome of Streptomyces albus.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Hopene / Squalene synthase | Hopanoids | 308626..335220 | 647179..673767 | 608447..635035 | Multiple contigs | NZ_KB890705.1 522286..548874 | CADY01000122.1 571..33079 |
| NRPS-independent siderophore synthase | Desferrioxamine | 4740450..4752270 | 4783002..4794822 | 4778566..4790386 | NZ_AMPN01000107.1 27451..38705 | NZ_KB890704.1 497713..509533 | CADY01000052.1 71864..83684 |
| NRPS-independent siderophore synthase | Aerobactin-like | 1268164..1283196 | 1461319..1476318 | 1419952..1434951 | Multiple contigs | NZ_KB890708.199818..114849 | CADY01000147.1 1..13852 |
| Ectoine synthase | Ectoine | 5635346..5645744 | 5646629..5657027 | 5645248..5655646 | NZ_AMPN01000262.1 1748..15421 | NZ_KB890727.1 65721..76119 | CADY01000033.1 3246..13644 |
| Phytoene / polyprenyl synthetase | Carotenoids | 6401161..6430221 | 6435309..6461258 | 6426764..6452713 | NZ_AMPN01000036.1 9070..37323 | NZ_KB890733.1 196543..229868 | CADY01000098.1 59202..91885 |
| Terpene synthase | Geosmin | 1531759..1554059 | 1713992..1736292 | 1672644..1694944 | NZ_AMPN01000357.1 6256..26511 | NZ_KB890732.1 367671..389971 | CADY01000157.1 6106..28406 |
| Terpene | Albaflavenone | 1865241..1887220 | 2122786..2144075 | 2082541..2103830 | NZ_AMPN01000386.1 17700..38989 | NZ_KB890732.1 22700..43989 | CADY01000162.1 47721..69010 |
| Type III PKS | Tetrahydroxynapthalene | 6520374..6561471 | 6555445..6596542 | 6545260..6586357 | NZ_AMPN01000015.1 1..12125 | NZ_KB890733.1 61315..102412 | CADY01000095.1 47527..88624 |
| Type I PKS | Candicidin | 6566408..6721648 | 6586760..6741995 | 6576564..6731817 | Multiple contigs | Multiple contigs | Multiple contigs |
| Type I PKS | Unknown | 6776680..6838639 | 6797026..6858963 | 6786847..6849240 | Multiple contigs | NZ_KB890710.1 1..60636 | Multiple contigs |
| Hybrid NRPS / PKS | Antimycin | 6730563..6755198 | 6750919..6775544 | 6740741..6765365 | NZ_AMPN01000430.1 1.7480 | Mutiple contigs | CADY01000091.1 16873..41495 |
| Hybrid NRPS / PKS | Alteramide | 224752..275005 | 560646..610065 | 522971..572390 | NZ_AMPN01000189.1 1..20256 | NZ_KB890705.1 438436..488689 | CADY01000120.1 1..40866 |
| Hybrid NRPS / PKS | Unknown | 6755272..6776675 | 6775619..6797021 | 6765440..6786842 | Multiple contigs | NZ_KB890710.1 60641..82030 | CADY01000091.1 1..16795 |
| NRPS | Gramicidin-like | 3877105..3982798 | 3970595..4076299 | 3946534..4052238 | Multiple contigs | Multiple contigs | CADY01000205.1 1..90613 |
| NRPS | Unknown | 4469477..4514441 | 4546088..4590455 | 4542814..4587181 | NZ_AMPN01000006.1 1..22395 | NZ_KB890704.1 738253..783217 | CADY01000068.1 1..34880 |
| Bacteriocin | Unknown | 415649..425903 | 740656..750895 | 701944..712183 | NZ_AMPN01000269.1 2352..8553 | NZ_KB890705.1 614326..624580 | CADY01000124.1 50376..61254 |
| Bacteriocin | Unknown | 879961..891289 | 1167101..1178414 | 1125708..1137021 | NZ_AMPN01000026.1 11838..21948 | NZ_KB890709.1 5759..17087 | CADY01000132.1 5648..13695 |
| Lantipeptide | SapB | 2713288..2735999 | 2912898..2935531 | 2877890..2900523 | NZ_AMPN01000379.1 313..18317 | NZ_KB890715.1 87381..116434 | CADY01000176.1 1574..29018 |
*denotes genomic coordinates were edited manually
** denotes a gene cluster which was not annotated by antiSMASH 2.0
***denotes a gene cluster which is spread over multiple contigs (refer to S2 Table).
Auxiliary secondary metabolites produced by Streptomyces albus.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Hybrid NRPS / PKS | Unknown | 3011..61711 | 288401..349562 | — | NZ_AMPN01000108.1 1..17827 | NZ_KB890705.1 211618..272818 | Multiple contigs |
| NRPS | Unknown | 1136316..1199422 | — | — | — | Multiple contigs | — |
| Lantipeptide | Unknown | 2376688..2409159 | 2603848..2636308 | 2566778..2599136 | — | NZ_KB890736.1 70394..102866 | CADY01000169.1 12631..53654 |
| NRPS | Unknown | 3553726..3604015 | — | — | Multiple contigs | NZ_KB890716.1 353451..403740 | Multiple contigs |
| NRPS | Indigoidin | 6336942..6381213 | — | — | — | NZ_KB890733.1 244920..289191 | — |
| Terpene | Unknown | — | 274831..297419 | 284406..306988 | NZ_AMPN01000169.1 36623..53502 | NZ_KB890705.1 195282..217864 | CADY01000116.1 145638..168220 |
| Hybrid NRPS / PKS | Kijanimycin-like | — | 397572..502620 | 366365..471413 | — | — | — |
| Type I PKS | Unknown | — | 771534..818373 | 731491..778330 | — | — | — |
| NRPS | Unknown | — | 3152743..3211020 | 3129401..3187816 | — | — | — |
| NRPS | Unknown | — | 3750249..3812487 | 3726395..3788633 | — | — | — |
| NRPS | Unknown | — | 4898187..4960375 | 4891468..4953657 | — | — | — |
| NRPS | Unknown | 1..49621 (6957897..7007517) | 2794..56173 (6969866..7023245) | ||||
| Type I PKS-butyrolactone | Unknown | 30737..134245 (6888273..6976781) | 37289..140797 (6900242..6988750) | ||||
| Type II PKS | Enterocin | — | 6259612..6302316 | 6251573..6294277 | — | — | — |
* Denotes a gene cluster harboured in duplicate; genomic coordinates for the additional copy are provide in braces.
**Denotes a gene cluster spread over multiple contigs, which are presented in S2 Table.
Strain-specific gene clusters encoded by Streptomyces albus.
|
|
|
|
|
|---|---|---|---|
|
| |||
| NRPS / Oligosaccharide | Paulomycin | 684407..718548 |
|
| Bacteriocin | Unknown | 2560714..2571226 |
|
|
| |||
| Lantipeptide | Unknown | 1862908..1885125 |
|
|
| |||
| Other | Unknown | CM002272.1 609..41190 |
|
|
| |||
| Type II PKS | Unknown | NZ_AMPN01000020.1 1..23214 |
|
| Butyrolactone | Unknown | NZ_AMPN01000075.1 188..10692 |
|
| Bacteriocin | Unknown | NZ_AMPN01000145.1 80715..91067 |
|
|
| |||
| NRPS | Unknown | NZ_KB890705.1 132033..188388 |
|
| NRPS | Unknown | NZ_KB890711.1 46489..98964 |
|
| Type I PKS | Unknown | NZ_KB890711.1 107544..132946 NZ_KB890725.1 1..5945 |
|
|
| |||
| Other | Unknown | CADY01000053.1 1..9842 |
|
| Type I PKS / Type III PKS | Kendomycin | CADY01000062.1 1..35064 |
|
| NRPS | Mannopeptimycin-like | CADY01000178.1 54079..109040 |
|
| Butyrolactone | Unknown | CADY01000186.1 1..29569 |
|
| Bacteriocin | Unknown | CADY01000195.1 1..2452 | — |
| Type II PKS | Fredericamycin | CADY01000200.1 57212..86518 |
|
*Denotes a gene cluster encoded on a plasmid
**Denotes a gene cluster composed of partial antiSMASH gene clusters that likely represent a single cluster according to NaPDoS analysis
***AntiSMASH did not report a cumulative MultiGeneBlast score, because the gene cluster is spread over multiple contigs
Figure 2Diagrammatic workflow of the NUCmer approach used to piece together a biosynthetic gene clusters spread over more than one contig.
NUCmer is part of the MUMmer [55] and can be downloaded from http://sourceforge.net/projects/mummer/. NUCmer will align contigs from draft genomes to an intact gene cluster with high shared nucleotide identity. Commands used to perform an analysis of this type are given. Black arrows represent a biosynthetic gene cluster; black and red lines represent contigs in a draft genome sequence.