| Literature DB >> 26380643 |
Hisayuki Komaki1, Natsuko Ichikawa2, Akira Hosoyama2, Nobuyuki Fujita2, Yasuhiro Igarashi3.
Abstract
Streptomyces sp. TP-A0598, isolated from seawater, produces lydicamycin, structurally unique type I polyketide bearing two nitrogen-containing five-membered rings, and four congeners TPU-0037-A, -B, -C, and -D. We herein report the 8 Mb draft genome sequence of this strain, together with classification and features of the organism and generation, annotation and analysis of the genome sequence. The genome encodes 7,240 putative ORFs, of which 4,450 ORFs were assigned with COG categories. Also, 66 tRNA genes and one rRNA operon were identified. The genome contains eight gene clusters involved in the production of polyketides and nonribosomal peptides. Among them, a PKS/NRPS gene cluster was assigned to be responsible for lydicamycin biosynthesis and a plausible biosynthetic pathway was proposed on the basis of gene function prediction. This genome sequence data will facilitate to probe the potential of secondary metabolism in marine-derived Streptomyces.Entities:
Keywords: Biosynthetic gene; Lydicamycin; Polyketide synthase; Streptomyces; TPU-0037
Year: 2015 PMID: 26380643 PMCID: PMC4572681 DOI: 10.1186/s40793-015-0046-5
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Chemical structures of lydicamycin and its congeners produced by Streptomyces sp. TP-A0598
Fig. 2Scanning electron micrograph of Streptomyces sp. TP-A0598 grown on ten-fold diluted ISP 2 medium agar for 11 days at 28 °C. Bar, 5 μm
Classification and general features of Streptomyces sp. TP-A0598
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain TP-A0598 | TAS [ | ||
| Gram stain | Not tested, likely positive | NAS | |
| Cell shape | Branched mycelia | TAS [ | |
| Motility | Not reported | ||
| Sporulation | Sporulating | TAS [ | |
| Temperature range | Grows from 15 °C to 37 °C | IDA | |
| Optimum temperature | 30 °C | IDA | |
| pH range; Optimum | 5-9; 7 | IDA | |
| Carbon source | D-glucose, sucrose, inositol, L-rhamnose, D-mannitol, D-raffinose, D-fructose, L-arabinose, D-xylose | TAS [ | |
| MIGS-6 | Habitat | Marine | TAS [ |
| MIGS-6.3 | Salinity | Grows from 0 % to 7 % NaCl | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Not reported | |
| MIGS-4 | Geographic location | 2,600 meters off the shore at Namerikawa, Toyama, Japan | TAS [ |
| MIGS-5 | Sample collection | Not reported | |
| MIGS-4.1 | Latitude | Not reported | |
| MIGS-4.2 | Longitude | Not reported | |
| MIGS-4.4 | Attitude | −321 m | TAS [ |
aEvidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are the Gene Ontology project [26]
Fig. 3Phylogenetic tree highlighting the position of Streptomyces sp. TP-A0598 relative to phylogenetically close type strains within the genus Streptomyces. The strains and their corresponding GenBank accession numbers for 16S rRNA genes are shown in parentheses. The tree uses sequences aligned by ClustalX2 [8], and constructed by the neighbor-joining method [27]. All positions containing gaps were eliminated. The building of the tree also involves a bootstrapping process repeated 1000 times to generate a majority consensus tree [28], and only bootstrap values above 50 % are shown at branching points. Kitasatospora setae [29] was used as an outgroup
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Improved-high-quality draft |
| MIGS-28 | Libraries used | 454 shotgun library, Illumina pair-end library |
| MIGS 29 | Sequencing platforms | 454 GS FLX+, Illumina HiSeq1000 |
| MIGS 31.2 | Fold coverage | 8.4 ×, 93 ×, respectively |
| MIGS 30 | Assemblers | Newbler v2.6 |
| MIGS 32 | Gene calling method | Progidal v2.6 |
| Locus Tag | TPA0598 | |
| GenBank ID | BBNO00000000 | |
| GenBank Date of Release | January 6, 2015 | |
| GOLD ID | Not registered | |
| BIOPROJECT | PRJDB3150 | |
| MIGS 13 | Source Material Identifier | NBRC 110027 |
| Project relevance | Industrial |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 8,319,549 | 100.0 |
| DNA coding (bp) | 7,149,098 | 85.9 |
| DNA G + C (bp) | 5,915,420 | 71.0 |
| DNA scaffolds | 20 | 100.0 |
| Total genes | 7,344 | 100.0 |
| Protein-coding genes | 7,240 | 98.6 |
| RNA genes | 75 | 1.0 |
| Pseudo genes | 29 | 0.4 |
| Genes in internal clusters | 761 | 10.4 |
| Genes with functional prediction | 3,207 | 43.7 |
| Genes assigned to COGs | 4,450 | 60.6 |
| Genes with Pfam domains | 4,543 | 61.9 |
| Genes with signal peptides | 653 | 8.9 |
| Genes with transmembrane helices | 1,770 | 24.1 |
| CRISPR repeats | 5 | - |
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 196 | 2.70 | Translation |
| A | 2 | 0.03 | RNA processing and modification |
| K | 519 | 7.17 | Transcription |
| L | 155 | 2.14 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 40 | 0.55 | Cell cycle control, mitosis and meiosis |
| V | 127 | 1.75 | Defense mechanisms |
| T | 210 | 2.91 | Signal transduction mechanisms |
| M | 192 | 2.65 | Cell wall/membrane biogenesis |
| N | 0 | 0.00 | Cell motility |
| U | 34 | 0.47 | Intracellular trafficking and secretion |
| O | 138 | 1.91 | Posttranslational modification, protein turnover, chaperones |
| C | 271 | 3.74 | Energy production and conversion |
| G | 318 | 4.39 | Carbohydrate transport and metabolism |
| E | 424 | 5.86 | Amino acid transport and metabolism |
| F | 105 | 1.45 | Nucleotide transport and metabolism |
| H | 161 | 2.22 | Coenzyme transport and metabolism |
| I | 187 | 2.58 | Lipid transport and metabolism |
| P | 177 | 2.44 | Inorganic ion transport and metabolism |
| Q | 141 | 1.95 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 631 | 8.72 | General function prediction only |
| S | 422 | 5.83 | Function unknown |
| - | 2,790 | 38.50 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Open reading frames in the lydicamycin biosynthetic gene cluster
| orf (locus tag) | size (aa) | proposed function | BLAST search | |
|---|---|---|---|---|
| protein homolog, | %b | |||
| TPA0598_03_00650a | 473 | acyl-CoA ligase | hypothetical protein, | 94/96 |
| TPA0598_03_00660 | 929 | LuxR family transcriptional regulator | LuxR family transcriptional regulator, | 91/94 |
| TPA0598_03_00670a | 274 | unknown | hypothetical protein, | 53/64 |
| TPA0598_03_00680 | 632 | two-component system histidine kinase | hypothetical protein, | 93/95 |
| TPA0598_03_00690 | 218 | two-compornent system response regulator | LuxR family transcriptional regulator, | 99/99 |
| TPA0598_03_00700a | 336 | transacylase | ACP S-malonyltransferase, | 89/93 |
| TPA0598_03_00710a | 123 | unknown | hypothetical protein, | 88/95 |
| TPA0598_03_00720 | 64 | unknown | hypothetical protein JCGZ_17256, | 43/54 |
| TPA0598_03_00730a | 80 | unknown | putative protein-disulfide isomerase, | 56/63 |
| TPA0598_03_00740 | 3,598 | PKS | polyketide synthase, | 58/69 |
| TPA0598_03_00750 | 7,054 | PKS | Beta-ketoacyl synthase, | 57/68 |
| TPA0598_03_00760 | 3,548 | PKS | Beta-ketoacyl synthase, | 56/67 |
| TPA0598_03_00770 | 1,846 | PKS | Beta-ketoacyl synthase, | 62/73 |
| TPA0598_03_00780 | 5,648 | PKS | polyketide synthase type I, | 58/69 |
| TPA0598_03_00790 | 3,662 | PKS | hypothetical protein, | 94/96 |
| TPA0598_03_00800 | 3,265 | PKS | polyketide synthase, | 54/66 |
| TPA0598_03_00810 | 270 | unknown | hypothetical protein, | 95/96 |
| TPA0598_03_00820 | 1,031 | NRPS | hypothetical protein, | 94/96 |
| TPA0598_03_00830 | 300 | unknown | hypothetical protein, | 96/98 |
| TPA0598_03_00840 | 1,923 | PKS | hypothetical protein, | 91/94 |
| TPA0598_03_00850a | 429 | cytochrome P450 | cytochrome P450, | 92/96 |
| TPA0598_03_00860 | 260 | unknown | membrane protein, | 45/69 |
| TPA0598_03_00870 | 253 | type-II thioesterase | hypothetical protein, | 95/97 |
| TPA0598_03_00880a | 551 | amine oxidase | amine oxidase, | 96/98 |
| TPA0598_03_00890 | 344 | transcriptional regulator | hypothetical protein, | 96/97 |
| TPA0598_03_00900a | 496 | amidase | hypothetical protein, | 94/95 |
aencoded in complementary strand, bidentity/similarity
Fig. 4Genetic map of lydicamycin biosynthetic gene cluster
Fig. 5Proposed lydicamycin synthetic pathway. a starter synthesis compared with that of ECO-02301; b chain elongation; c cyclization and modification yielding final products
Proposed mechanism to produce lydicamycin congeners
| congener | substrate of m3 | m11 | CYP450 |
|---|---|---|---|
| lydicamycin | methylmalonyl-CoA | active | involved |
| TPU-0037-A | malonyl-CoA | active | involved |
| TPU-0037-B | methylmalonyl-CoA | inactive | uninvolved |
| TPU-0037-C | malonyl-CoA | active | uninvolved |
| TPU-0037-D | methylmalonyl-CoA | active | uninvolved |