| Literature DB >> 27785333 |
Hisayuki Komaki1, Arisa Ishikawa2, Natsuko Ichikawa3, Akira Hosoyama3, Moriyuki Hamada1, Enjuro Harunari2, Takuya Nihira4, Watanalai Panbangred5, Yasuhiro Igarashi2.
Abstract
Streptomyces sp. MWW064 (=NBRC 110611) produces an antitumor cyclic depsipeptide rakicidin D. Here, we report the draft genome sequence of this strain together with features of the organism and generation, annotation and analysis of the genome sequence. The 7.9 Mb genome of Streptomyces sp. MWW064 encoded 7,135 putative ORFs, of which 6,044 were assigned with COG categories. The genome harbored at least three type I polyketide synthase (PKS) gene clusters, seven nonribosomal peptide synthetase (NRPS) gene clusters, and four hybrid PKS/NRPS gene clusters, from which a hybrid PKS/NRPS gene cluster responsible for rakicidin synthesis was successfully identified. We propose the biosynthetic pathway based on bioinformatic analysis, and experimentally proved that the pentadienoyl unit in rakicidins is derived from serine and malonate.Entities:
Keywords: Biosynthesis; Nonribosomal peptide synthetase; Polyketide synthase; Rakicidin; Streptomyces
Year: 2016 PMID: 27785333 PMCID: PMC5073741 DOI: 10.1186/s40793-016-0205-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Streptomyces sp. MWW064 [13]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species undetermined | - | ||
| strain: MWW064 | TAS [ | ||
| Gram stain | Gram-positive | NAS | |
| Cell shape | Branched mycelia | IDA | |
| Motility | Not reported | ||
| Sporulation | Sporulating | IDA | |
| Temperature range | 15 °C to 37 °C | IDA | |
| Optimum temperature | 28 °C | IDA | |
| pH range; Optimum | 5 to 9; 7 | IDA | |
| Carbon source | D-glucose, inositol | IDA | |
| MIGS-6 | Habitat | Marine sediment | TAS [ |
| MIGS-6.3 | Salinity | 0 % to 3 % NaCl | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Not reported | |
| MIGS-4 | Geographic location | Samut Sakhon province, Thailand | TAS [ |
| MIGS-5 | Sample collection | February 2, 2008 | NAS |
| MIGS-4.1 | Latitude | 13° 32’ 55” N | NAS |
| MIGS-4.2 | Longitude | 100° 16’ 39” E | NAS |
| MIGS-4.4 | Altitude | 8.6 m. above sea level | NAS |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [34]
Fig. 1Scanning electron micrograph of Streptomyces sp. MWW064 grown on 1/2 ISP 2 agar for 7 days at 28 °C. Bar, 2 μm
Fig. 2Phylogenetic tree of Streptomyces sp. MWW064 and phylogenetically close type strains, showing over 98.5 % similarity, based on 16S rRNA gene sequences. The accession numbers for 16S rRNA genes are shown in parentheses. The tree uses sequences aligned by ClustalX2 [9], and constructed by the neighbor-joining method [35]. All positions containing gaps were eliminated. The building of the tree also involves a bootstrapping process repeated 1,000 times to generate a majority consensus tree, and only bootstrap values above 50 % are shown at branching points. Streptomyces albus NBRC 13014T was used as an outgroup
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Improved-high-quality draft |
| MIGS-28 | Libraries used | 454 shotgun library, Illumina paired-end library |
| MIGS 29 | Sequencing platforms | 454 GS FLX+, Illumina HiSeq1000 |
| MIGS 31.2 | Fold coverage | 8.9 ×, 93.5 ×, respectively |
| MIGS 30 | Assemblers | Newbler v2.8, GenoFinisher |
| MIGS 32 | Gene calling method | Progidal |
| Locus Tag | SSP35 | |
| GenBank ID | BBUY00000000 | |
| GenBank Date of Release | February 20, 2016 | |
| GOLD ID | Not registered | |
| BIOPROJECT | PRJDB3538 | |
| MIGS 13 | Source Material Identifier | NBRC 110611 |
| Project relevance | Industrial |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 7,904,619 | 100.0 |
| DNA coding (bp) | 6,855,885 | 86.7 |
| DNA G + C (bp) | 5,597,799 | 70.8 |
| DNA scaffolds | 99 | - |
| Total genes | 7,206 | - |
| Protein coding genes | 7,135 | 99.0 |
| RNA genes | 71 | 0.99 |
| Pseudogenes | - | - |
| Genes in internal clusters | 2,610 | 36.2 |
| Genes with function prediction | 4,515 | 62.7 |
| Genes assigned to COGs | 6,044 | 83.9 |
| Genes with Pfam domains | 4,870 | 67.6 |
| Genes with signal peptides | 559 | 7.8 |
| Genes with transmembrane helices | 1,550 | 21.5 |
| CRISPR repeats | 1 | - |
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 279 | 4.6 | Translation, ribosomal structure and biogenesis |
| A | 4 | 0.1 | RNA processing and modification |
| K | 696 | 11.5 | Transcription |
| L | 452 | 7.5 | Replication, recombination and repair |
| B | 6 | 0.1 | Chromatin structure and dynamics |
| D | 55 | 0.9 | Cell cycle control, Cell division, chromosome partitioning |
| V | 132 | 2.2 | Defense mechanisms |
| T | 432 | 7.1 | Signal transduction mechanisms |
| M | 294 | 4.9 | Cell wall/membrane biogenesis |
| N | 33 | 0.5 | Cell motility |
| U | 95 | 1.6 | Intracellular trafficking and secretion |
| O | 223 | 3.7 | Posttranslational modification, protein turnover, chaperones |
| C | 386 | 6.4 | Energy production and conversion |
| G | 474 | 7.8 | Carbohydrate transport and metabolism |
| E | 651 | 10.8 | Amino acid transport and metabolism |
| F | 134 | 2.2 | Nucleotide transport and metabolism |
| H | 253 | 4.2 | Coenzyme transport and metabolism |
| I | 323 | 5.3 | Lipid transport and metabolism |
| P | 404 | 6.7 | Inorganic ion transport and metabolism |
| Q | 385 | 6.4 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 1,032 | 17.1 | General function prediction only |
| S | 440 | 7.3 | Function unknown |
| - | 1,091 | 18.1 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Modular PKS and NRPS gene clusters in Streptomyces sp. MWW064
| Gene cluster | Encoded in | No. of modular PKS and NRPS genes | No. of modules | Backbone of predicted product |
|---|---|---|---|---|
|
| scaffold 9 | 6 | 7 | R-C3-C3-Ser-C2-Gly-X |
|
| scaffold 5 | 6 | 14 | C2-C2-C2-C2-C2-Gly-C2-C2-C2-C2-C2-C2-C2-C2 |
|
| scaffold 2 | 4 | 3 | C?-C?-X |
|
| scaffold 11 | 1 | 2 | X-C2 |
|
| scaffold 18 | 5 | 5 | C?-C3-C2-C2-C? |
|
| scaffold 23 | 1 | 1 | C? |
| other | scaffolds 11, 39, 45 | >3 | >10 | C2-C2-C3-C2, C2-C2, C2-C2-C2, C2 |
|
| scaffold 11 | 4 | 4 | X-X-Val-X |
|
| scaffold 18 | 3 | 3 | R-Val-X |
|
| scaffold 9 | 2 | 3 | R-Cys-mCys |
|
| scaffold 13 | 3 | 4 | Val-Gly-Ser-Pro |
|
| scaffold 2 | 1 | 1 | Ser |
|
| scaffold 12 | 1 | 1 | Thr |
| other | scaffolds 3, 5 | >2 | >6 | X-X-X-X-X, Cys |
anot completely sequenced. R, starter unit; C3, C3 unit derived from methylmalonyl-CoA; C2, C2 unit derived from malonyl-CoA; X, unpredictable amino acid; C?, unpredictable carbon unit derived from acyl-CoA; mCys, methylated cysteine
ORFs in the rakicidin-biosynthetic gene cluster of Streptomyces sp. MWW064
| ORF (locus tag) | Size (aa) | Deduced function | Protein homolog [origin] | Identity/similarity (%) | Accession number |
|---|---|---|---|---|---|
| SSP35_09_01970 | 243 | unknown | hypothetical protein [ | 63/73 | WP_030067339 |
| SSP35_09_01960 | 331 | transcriptional regulator | hypothetical protein DT87_01625 [ | 59/69 | KDQ65969 |
| SSP35_09_01950 | 358 | 3-oxoacyl-ACP synthase | 3-oxoacyl-ACP synthase [ | 77/87 | WP_030791445 |
| SSP35_09_01940 | 79 | ACP | phosphopantetheine-binding protein [ | 68/79 | ADI05068 |
| SSP35_09_01930 | 406 | ketosynthase | 3-oxoacyl-ACP synthase [ | 81/89 | WP_031089521 |
| SSP35_09_01920 | 146 | unknown | methylmalonyl-CoA epimerase [ | 80/87 | WP_018222873 |
| SSP35_09_01910 (RakAB) | 2,902 | PKS | hypothetical protein [ | 63/73 | WP_018385948 |
| SSP35_09_01900 (RakC) | 1,624 | PKS | non-ribosomal peptide synthetase [ | 60/70 | EWM63000 |
| SSP35_09_01890 (RakD) | 1,126 | NRPS | hypothetical protein [ | 68/78 | WP_018385946 |
| SSP35_09_01880 (RakEF) | 1,950 | PKS | hypothetical protein [ | 64/73 | WP_018385945 |
| SSP35_09_01870 (RakG) | 1,556 | NRPS | hypothetical protein, partial [ | 64/74 | WP_030498976 |
| SSP35_09_01860 (RakH) | 1,565 | NRPS | amino acid adenylation domain protein [ | 38/55 | ACC80782 |
| SSP35_09_01850 | 563 | ABC transporter | hypothetical protein [ | 62/73 | WP_030498978 |
| SSP35_09_01840 (RakL) | 251 | type-II thioesterase | hypothetical protein [ | 64/73 | WP_018385940 |
| SSP35_09_01830 (RakN) | 1,013 | NRPS | non-ribosomal peptide synthetase [ | 55/63 | EWM63010 |
| SSP35_09_01820 (RakO) | 331 | asparagine oxygenase | clavaminate synthase [ | 64/75 | WP_026235187 |
| SSP35_09_01810 | 809 | unknown | penicillin amidase [ | 63/74 | WP_026359955 |
| SSP35_09_01800 | 205 | transcriptional regulator | putative LuxR family transcriptional regulator [ | 71/81 | AIR96926 |
SSP35_09_01910, SSP35_09_01900, SSP35_09_01890, SSP35_09_01880, SSP35_09_01870, SSP35_09_01860, SSP35_09_01840, SSP35_09_01830, and SSP35_09_01820 are corresponding to RakA plus RakB (RakAB), RakC, RakD, RakE plus RakF (RakEF), RakG, RakH, RakL, RakN, and RakO, previously reported in the reference [8], respectively, and SSP35_09_01940 may possibly be corresponding to RakI
Fig. 3Genetic map of rakicidin biosynthetic gene cluster of Streptomyces sp. MWW064 and the biosynthetic mechanism of rakicidin D
Incorporation of 13C-labeled precursors into rakicidin D
| Position | δC | Relative enrichmentsa | |
|---|---|---|---|
| [2-13C]acetate | [1-13C]-L-serine | ||
| 1 | 169.2 | 0.89 | 1.58 |
| 2 | 54.9 | 1.14 | 1.19 |
| 3 | 72.5 | 1.57 | 1.07 |
| 4 | 172.7 | 1.31 | 1.95 |
| 5 | 167.6 | 0.68 |
|
| 6 | 52.5 | 0.77 | 1.15 |
| 7 | 36.5 | 1.00 | 1.00 |
| 8 | 166.0 | 0.77 | 1.43 |
| 9 | 118.8 | 3.46 | 1.54 |
| 10 | 138.4 | 1.03 |
|
| 11 | 137.9 | 0.95 | 0.67 |
| 12 | 117.1 | 1.02 | 1.33 |
| 13 | 172.5 | 1.61 | 1.26 |
| 14 | 41.7 | 1.86 | 1.36 |
| 15 | 78.1 | 1.25 | 1.21 |
| 16 | 33.9 | 1.91 | 1.39 |
| 17 | 32.8 | 1.30 | 1.55 |
| 18 | 27.0 |
| 1.26 |
| 19 | 28.9 | 1.15 | 1.58 |
| 20 | 31.3 |
| 1.43 |
| 21 | 22.1 | 1.05 | 1.28 |
| 22 | 14.0 |
| 1.61 |
| 23 | 15.4 | 1.51 | 1.07 |
| 24 | 13.3 | 1.75 | 1.32 |
a 13C signal intensity of each peak in the labeled 1 divided by that of the corresponding signal in the unlabeled 1, respectively, normalized to give an enrichment ratio of 1 for the unenriched peak of C7. The numbers in bold type indicate 13C-enriched atoms from 13C-labeled precursors
Fig. 4Incorporation of 13C-labeled precursors into rakicidin D