| Literature DB >> 27800124 |
Hisayuki Komaki1, Natsuko Ichikawa2, Akio Oguchi2, Moriyuki Hamada1, Enjuro Harunari3, Shinya Kodani4, Nobuyuki Fujita2, Yasuhiro Igarashi3.
Abstract
Streptomyces sp. TP-A0867 (=NBRC 109436) produces structurally complex polyketides designated alchivemycins A and B. Here, we report the draft genome sequence of this strain together with features of the organism and assembly, annotation, and analysis of the genome sequence. The 9.9 Mb genome of Streptomyces sp. TP-A0867 encodes 8,385 putative ORFs, of which 7,232 were assigned with COG categories. We successfully identified a hybrid polyketide synthase (PKS)/ nonribosomal peptide synthetase (NRPS) gene cluster that could be responsible for alchivemycin biosynthesis, and propose the biosynthetic pathway. The alchivemycin biosynthetic gene cluster is also present in Streptomyces rapamycinicus NRRL 5491T, Streptomyces hygroscopicus subsp. hygroscopicus NBRC 16556, and Streptomyces ascomycinicus NBRC 13981T, which are taxonomically highly close to strain TP-A0867. This study shows a representative example that distribution of secondary metabolite genes is correlated with evolution within the genus Streptomyces.Entities:
Keywords: Alchivemycin; Biosynthetic gene cluster; Genome mining; Polyketide synthase; Streptomyces; Taxonomy
Year: 2016 PMID: 27800124 PMCID: PMC5078962 DOI: 10.1186/s40793-016-0207-1
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Scanning electron micrograph of Streptomyces sp. TP-A 0867 grown on double-diluted ISP 2 agar for 7 days at 28 °C. Bar, 2 μm
Classification and general features of Streptomyces sp. TP-A0867 [12]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDA | ||
| Subspecies | IDA | ||
| Strain TP-A0867 | [ | ||
| Gram stain | Not tested, likely positive | NAS | |
| Cell shape | Branched mycelia | IDA | |
| Motility | Not observed | IDA | |
| Sporulation | Sporulating | IDA | |
| Temperature range | Grows from 15 °C to 45 °C | IDA | |
| Optimum temperature | 40 °C | IDA | |
| pH range; Optimum | 4 to 10; 7 | IDA | |
| Carbon source | Fructose, glucose, rhamnose, sucrose, xylose | IDA | |
| MIGS-6 | Habitat | Chinese chive ( | TAS [ |
| MIGS-6.3 | Salinity | Grows from 0 % to 7 % NaCl | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Not reported | |
| MIGS-4 | Geographic location | Toyama, Japan | TAS [ |
| MIGS-5 | Sample collection | from April to June in 1998 | TAS [ |
| MIGS-4.1 | Latitude | Not reported | |
| MIGS-4.2 | Longitude | Not reported | |
| MIGS-4.4 | Altitude | Not reported |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [44]
Fig. 2Phylogenetic tree of Streptomyces strains based on 16S rRNA gene sequences. The 16S rRNA sequences were obtained from GenBank, whose accession numbers are shown in parentheses, whereas that of Streptomyces ascomycinicus NBRC 13981T was downloaded from ‘Sequence Information’ of the NBRC Culture Catalog Search (www.nbrc.nite.go.jp/NBRC2/SequencSearchServlet?ID=NBRC&CAT=00013981&DNA=2). The tree was constructed by the neighbor-joining method [45] using sequences aligned by ClustalX2 [9]. All positions containing gaps were eliminated. The building of the tree also involves a bootstrapping process repeated 1,000 times to generate a majority consensus tree, and only bootstrap values above 50 % are shown at branching points. Streptacidiphilus albus NBRC 100918T was used as an outgroup. Strains whose genome were sequenced are boldfaced. Among the genome-sequenced strains, those harboring the putative alchivemycin biosynthetic gene cluster are shadowed in gray
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Improved-high-quality draft |
| MIGS-28 | Libraries used | Illumina paired-end library |
| MIGS 29 | Sequencing platforms | Illumina MiSeq |
| MIGS 31.2 | Fold coverage | 98 x |
| MIGS 30 | Assemblers | Newbler v2.6, GenoFinisher, Sequencher v5.1 |
| MIGS 32 | Gene calling method | Prodigal |
| Locus tag | TPA0867 | |
| GenBank ID | BBON00000000 | |
| GenBank date of release | March 24, 2016 | |
| GOLD ID | Not registered | |
| BioProject | PRJDB3206 | |
| MIGS 13 | Source material identifier | NBRC 109436 |
| Project relevance | Industrial |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 9,889,163 | 100.0 |
| DNA coding (bp) | 8,515,958 | 85.2 |
| DNA G + C (bp) | 7,107,274 | 71.8 |
| DNA scaffolds | 259 | - |
| Total genes | 8,453 | 100.0 |
| Protein coding genes | 8,385 | 99.2 |
| RNA genes | 68 | 0.8 |
| Pseudogenes | - | - |
| Genes in internal clusters | 3,697 | 44.1 |
| Genes with function prediction | 5,588 | 66.1 |
| Genes assigned to COGs | 7,232 | 86.2 |
| Genes with Pfam domains | 6,077 | 71.9 |
| Genes with signal peptides | 625 | 7.4 |
| Genes with transmembrane helices | 1,629 | 19.3 |
| CRISPR repeats | 3 | - |
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 289 | 3.4 | Translation, ribosomal structure and biogenesis |
| A | 4 | 0.04 | RNA processing and modification |
| K | 1,036 | 12.3 | Transcription |
| L | 345 | 4.1 | Replication, recombination and repair |
| B | 3 | 0.03 | Chromatin structure and dynamics |
| D | 52 | 0.6 | Cell cycle control, Cell division, chromosome partitioning |
| V | 133 | 1.58 | Defense mechanisms |
| T | 506 | 6.03 | Signal transduction mechanisms |
| M | 337 | 4.02 | Cell wall/membrane biogenesis |
| N | 43 | 0.51 | Cell motility |
| U | 85 | 1.01 | Intracellular trafficking and secretion |
| O | 211 | 2.52 | Posttranslational modification, protein turnover, chaperones |
| C | 543 | 6.48 | Energy production and conversion |
| G | 751 | 8.96 | Carbohydrate transport and metabolism |
| E | 811 | 9.67 | Amino acid transport and metabolism |
| F | 134 | 1.60 | Nucleotide transport and metabolism |
| H | 286 | 3.41 | Coenzyme transport and metabolism |
| I | 431 | 5.14 | Lipid transport and metabolism |
| P | 462 | 5.51 | Inorganic ion transport and metabolism |
| Q | 529 | 6.31 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 1,383 | 16.5 | General function prediction only |
| S | 471 | 5.62 | Function unknown |
| - | 1,153 | 13.8 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
ORFs in the putative alchivemycin-biosynthetic gene cluster of Streptomyces sp. TP-A0867
| TPA0867_155_ (locus tag) | Length (aa) | Deduced function | Protein homolog | Identity/similarity (%) | Accession number |
|---|---|---|---|---|---|
|
|
|
| hypothetical protein M271_21675 | 96/97 | AGP55859 |
|
|
|
| hypothetical protein M271_21670 | 95/96 | AGP55858 |
| 00290 | 197 | unknown | hypothetical protein M271_21665 | 97/99 | AGP55857 |
| 00300 | 270 | unknown | hypothetical protein M271_21660 | 95/96 | AGP55856 |
|
|
|
| hypothetical protein M271_21655 | 88/89 | AGP55855 |
|
|
|
| hypothetical protein M271_21650 | 96/97 | AGP55854 |
| 00330a | 293 | oxidoreductase | hypothetical protein M271_21645 | 98/99 | AGP55853 |
|
|
|
| hypothetical protein M271_21640 | 87/90 | AGP55852 |
|
|
|
| hypothetical protein M271_21640 [ | 90/92 | AGP55852 |
|
|
|
| type I polyketide synthase AVES 4 | 54/63 | NP_822118 |
|
|
|
| hypothetical protein M271_21625, partial | 89/91 | AGP55849 |
|
|
|
| AmphC [ | 53/64 | AAK73514 |
|
|
|
| hypothetical protein M271_21600, partial | 87/90 | AGP55844 |
|
|
|
| beta-ketoacyl synthase | 54/64 | YP_004817601 |
| 00410a | 309 | phytanoyl-CoA dioxygenase | hypothetical protein M271_21580 | 93/96 | AGP55840 |
|
|
|
| hypothetical protein M271_21575 | 91/93 | AGP55839 |
| 00430 | 426 | transcriptional regulator | helix-turn-helix domain-containing protein | 93/95 | YP_004817135 |
| 00440 | 199 | unknown | hypothetical protein | 75/80 | YP_004812903 |
| 00450 | 157 | unknown | hypothetical protein M271_33560 | 56/64 | AGP58126 |
| 00460 | 295 | phosphotransferase | aminoglycoside phosphotransferase | 71/78 | YP_004817724 |
aencoded in complementary strand. Genes shown in Fig. 4 are bold-faced
Fig. 3Genetic map of the putative alchivemycin biosynthetic gene cluster (TPA0867_155_00270 to TPA0867_155_00460) of Streptomyces sp. TP-A0867
Fig. 4Proposed alchivemycin biosynthetic pathway
Fig. 5Genetic maps of putative alchivemycin biosynthetic gene clusters of S. rapamycinicus NRRL 5491 (a M271_21675 to M271_21575), S. hygroscopicus subsp. hygroscopicus NBRC 16556 (b orf10 to orf1 in scaffold14, orf3 to orf1 in scaffold64, and orf1 to orf2 in scaffold77), and S. ascomycinicus NBRC 13981T (c orf131 to orf145 of scaffold16). n (in grey circle), these parts contained many undetermined DNA sequences; t (in grey circle), scaffold terminal because b was not obtained as single scaffold. *We manually annotated the ORF, which were longer than registered in GenBank/EMBL/DDBJ