| Literature DB >> 26122045 |
Juan Pablo Gomez-Escribano1, Jean Franco Castro2,3, Valeria Razmilic4,5, Govind Chandra6, Barbara Andrews7, Juan A Asenjo8, Mervyn J Bibb9.
Abstract
BACKGROUND: Next Generation DNA Sequencing (NGS) and genome mining of actinomycetes and other microorganisms is currently one of the most promising strategies for the discovery of novel bioactive natural products, potentially revealing novel chemistry and enzymology involved in their biosynthesis. This approach also allows rapid insights into the biosynthetic potential of microorganisms isolated from unexploited habitats and ecosystems, which in many cases may prove difficult to culture and manipulate in the laboratory. Streptomyces leeuwenhoekii (formerly Streptomyces sp. strain C34) was isolated from the hyper-arid high-altitude Atacama Desert in Chile and shown to produce novel polyketide antibiotics.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26122045 PMCID: PMC4487206 DOI: 10.1186/s12864-015-1652-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Putative biosynthetic gene clusters for specialised metabolites
| antiSMASH Cluster No. | antiSMASH type descriptor | Position ( | Our annotation (based on Ref.) | |
|---|---|---|---|---|
| From | To | |||
| 1 | T1pks | 99264 | 143430 | |
|
|
|
| Hygromycin A [ | |
| 2 | T1pks | 191701 | 240196 | |
| 3 | T1pks-nrps | 324784 | 392261 | |
| 4 | Nrps | 379508 | 426758 | |
| 5 | T3pks | 416888 | 458084 | |
| 6 | Bacteriocin | 572464 | 582679 | |
| 7 | Terpene | 598795 | 619823 | |
|
|
|
| Lasso-peptide 2 | |
| 8 | Nrps | 714060 | 794426 | |
| 9 | Terpene | 1056004 | 1076960 | |
| 10 | T2pks-transatpks-nrps | 1075399 | 1155931 | Halogenated polyketide [Razmilic et al.] |
| 11 |
|
|
| Chaxamycin [Castro et al.] |
| 12 | T1pks | 1497127 | 1544539 | |
| 13 | Terpene | 1624097 | 1645110 | |
| 14 | T1pks-siderophore | 1776281 | 1833813 | |
| 15 | Terpene | 1972277 | 1994487 | |
| 16 | Bacteriocin | 2013690 | 2025087 | |
| 17 | Siderophore | 2293580 | 2305424 | Highly conserved |
| 18 | Nrps-t1pks | 2668194 | 2719415 | |
| 19 | T3pks | 2937137 | 2978264 | |
| 20 |
|
|
| Albaflavenone [ |
| Not identified | 3560196 | 3564842 | Lasso-peptide 1 | |
| 21 |
|
|
| Desferrioxamine E [ |
| 22 | Melanin | 5330379 | 5340933 | |
| 23 | Amglyccycl-butyrolactone | 5385171 | 5417416 | |
| 24 | Ectoine | 6176293 | 6186691 | |
| 25 | Other | 6710095 | 6751819 | |
| 26 | T3pks | 6822979 | 6864043 | |
| 27 | T1pks | 7141058 | 7240871 | Chaxalactin [11, Castro et al.] |
| 28 | T1pks | 7355977 | 7439461 | |
| 29 | Other | 7486047 | 7529121 | |
| 30 | Terpene-t2pks | 7530162 | 7588405 | |
| 31 | Terpene | 7744176 | 7768730 | |
|
|
|
| Lasso-peptide 3 | |
Fig. 1Schematic representation of the S. leeuwenhoekii chromosome, circular plasmid pSLE1, and linear plasmid pSLE2 (incomplete sequence). The chromosome is represented as an open circle, covering only the published sequence without the duplication of the terminal inverted repeat (represented as a grey band starting at position 1). From outside to inside, the concentric circles represent: nucleotide position; Protein Coding Sequences (PCSs) on the forward strand; PCSs on the reverse strand; PCSs for putative biosynthetic genes for specialised metabolites (dark red indicates the forward strand, orange the reverse strand); the orange box shown in the fifth circle indicates the chaxamycin biosynthetic gene cluster; tRNA and rRNA genes are shown in the sixth and seventh lines, respectively, in dark blue; the eighth concentric circle shows the GC-plot (GC %, window size = 10000; base step size = 200) and the inner-most circle the GC-skew ([(G − C)/(G + C)] window size = 10000; base step size = 200), both calculated using the sequence with both TIRs, a window size of 10000 and a step size of 200 (purple and olive indicate below and above average, respectively). For pSLE1 and pSLE2, PCSs are coloured red for putative regulatory genes; green, for plasmid replication and partitioning genes; the fourth circle in shows the GC-plot and the inner-most circle the GC-skew, both calculated as for the chromosome. For pSLE1, phage-related genes are shown in orange, and the type III PKS (chalcone synthase) gene is shown in brown. For pSLE2, genes with known plasmid functions are in orange; genes annotated as mobile elements and involved in transposition are in pink; the lasso-peptide biosynthetic gene cluster is shown in dark orange. Not to scale
COG functional categories. COG (Clusters of Orthologous Genes) functional categories of chromosomal protein codding sequences identified in S. leeuwenhoekii chromosome, and from S. coelicolor for comparison (as calculated by BASys [25] for both genomes)
|
|
| |||
|---|---|---|---|---|
| COG functional categories | Percentage | Number | Percentage | Number |
| Energy production and conversion | 4 | 270 | 4.1 | 317 |
| Cell division and chromosome partitioning | 0.5 | 30 | 0.4 | 31 |
| Amino acid transport and metabolism | 6 | 400 | 5.1 | 395 |
| Nucleotide transport and metabolism | 1.3 | 89 | 1.2 | 93 |
| Carbohydrate transport and metabolism | 6.5 | 433 | 6.5 | 503 |
| Coenzyme metabolism | 2.3 | 154 | 2.2 | 170 |
| Lipid metabolism | 3.7 | 250 | 3.3 | 255 |
| Translation, ribosomal structure and biogenesis | 2.7 | 182 | 2.5 | 193 |
| Transcription | 7.3 | 493 | 8.5 | 658 |
| DNA replication, recombination and repair | 2.9 | 193 | 2.9 | 224 |
| Cell envelope biogenesis, outer membrane | 3 | 204 | 2.9 | 224 |
| Cell motility | 0.1 | 5 | 0.1 | 8 |
| Posttranslational modification, protein turnover, chaperones | 1.9 | 127 | 1.8 | 139 |
| Inorganic ion transport and metabolism | 2.3 | 154 | 2.7 | 209 |
| Secondary Structure | 2.8 | 186 | 1.9 | 147 |
| General function prediction only | 6.9 | 460 | 7.4 | 572 |
| COG of unknown function | 3.6 | 243 | 3.6 | 278 |
| Signal Transduction | 3.8 | 256 | 4.2 | 325 |
| Unknown | 36.6 | 2458 | 36.7 | 2839 |
General characteristics of the S. leeuwenhoekii genome
| Assembled chromosome size | 7903895 bp |
| Estimated chromosome size | 8285171 bp |
| Estimated Terminal Inverted Repeats | 388272 bp |
| Chromosome topology | Linear |
| Chromosome G + C content | 73 % |
| rRNA operons | 6 |
| tRNA genes | 65 |
| pSLE1 circular plasmid | 86370 bp |
| pSLE1 G + C content | 69 % |
| pSLE2 linear plasmid | 132226 bp |
| pSLE2 G + C content | 70 % |
| Putative biosynthetic gene clusters for specialised metabolites | 34 (+1 in pSLE2) |
Fig. 2Sequencing and assembly pipeline. The sequencing and assembly pipeline followed in this work (data specific to this project are shown in brackets) and suggested as strategy for actinomycete genome sequencing