| Literature DB >> 22759432 |
Zhan Zhou1, Jianying Gu, Yong-Quan Li, Yufeng Wang.
Abstract
BACKGROUND: Streptomycetes are filamentous soil-dwelling bacteria. They are best known as the producers of a great variety of natural products such as antibiotics, antifungals, antiparasitics, and anticancer agents and the decomposers of organic substances for carbon recycling. They are also model organisms for the studies of gene regulatory networks, morphological differentiation, and stress response. The availability of sets of genomes from closely related Streptomyces strains makes it possible to assess the mechanisms underlying genome plasticity and systems adaptation.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22759432 PMCID: PMC3388780 DOI: 10.1186/1471-2105-13-S10-S8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The orthologous clusters identified by OrthoMCL. The parameters include the BLASTP cutoff E-values (10, 1, 0.1, 0.01, 10-3, 10-4, 10-5, 10-6, 10-7, 10-8, 10-9, and 10-10) and the MCL Markov clustering inflation indexes (1.0, 1.5, and 2.0).
Figure 2The pan-genome of five .
The core genome components and lineage-specific expansions (LSEs) in five Streptomyces species.
| Strains | Accession ID | Genome size (Mbp) | # Genes (# ORFs) in genome | % core in genome | # LSE families | # LSE genes | % LSEs in genome | |
|---|---|---|---|---|---|---|---|---|
| Lineage-unique | Typical LSEs | |||||||
| 9.1 | 7765 (7676) | 41.87 | 38 | 112 | 312 | 4.06 | ||
| 11.9 | 10025 (10023) | 33.35 | 219 | 232 | 1127 | 11.24 | ||
| 9.1 | 8300 (8153) | 39.42 | 98 | 163 | 568 | 6.97 | ||
| 8.5 | 7224 (7136) | 44.63 | 116 | 105 | 471 | 6.60 | ||
| 10.1 | 8901 (8746) | 36.47 | 116 | 186 | 689 | 7.88 | ||
The inter-genomic search yielded a core genome comprised of 3,096 orthologous proteins.
Examples of functional categories of core genome components in five Streptomyces genomes.
| Function description | Examples of GO classes | No. families |
|---|---|---|
| Genetic information processing | GO:0006260 (DNA replication) | 27 |
| GO:0006350 (transcription) | 174 | |
| GO:0006412 (translation) | 102 | |
| GO:0006310 (DNA recombination) | 14 | |
| GO:0004803 (transposase activity) | 4 | |
| GO:0015074 (DNA integration) | 2 | |
| Regulation | GO:0003700 (sequence-specific DNA binding transcription factor activity) | 201 |
| GO:0016566 (specific transcriptional repressor activity) | 43 | |
| GO:0000155 (two-component sensor activity) | 36 | |
| GO:0000156 (two-component response regulator activity) | 49 | |
| GO:0016987 (sigma factor activity) | 31 | |
| Metabolism | GO:0005975 (carbohydrate metabolic process) | 67 |
| GO:0009058 (biosynthetic process) | 66 | |
| GO:0009116 (nucleoside metabolic process) | 13 | |
| GO:0006520 (cellular amino acid metabolic process) | 12 | |
| GO:0006631 (fatty acid metabolic process) | 7 | |
| Morphogenesis | GO:0007047 (cellular cell wall organization) | 14 |
| GO:0007059 (chromosome segregation) | 9 | |
| GO:0000902 (cell morphogenesis) | 3 | |
| GO:0000917 (barrier septum formation) | 2 | |
| Response to stimulus | GO:0006950 (response to stress) | 22 |
| GO:0046677 (response to antibiotic) | 12 | |
| GO:0009432 (SOS response) | 9 | |
| GO:0006979 (response to oxidative stress) | 3 | |
| Transport | GO:0006811 (ion transport) | 14 |
| GO:0015904 (tetracycline transport) | 9 | |
| GO:0015031 (protein transport) | 8 | |
| GO:0006817 (phosphate transport) | 6 | |
| GO:0006865 (amino acid transport) | 6 | |
| GO:0008643 (carbohydrate transport) | 6 | |
| Hydrolase | GO:0008233 (peptidase activity) | 38 |
| GO:0016810 (hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds) | 22 | |
| GO:0004553 (hydrolase activity, hydrolyzing O-glycosyl compounds) | 19 | |
| GO:0016788 (hydrolase activity, acting on ester bonds) | 12 | |
| GO:0016820 (hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances) | 10 | |
| GO:0016799 (hydrolase activity, hydrolyzing N-glycosyl compounds) | 7 | |
| GO:0030245 (cellulose catabolic process) | 3 | |
| GO:0004568 (chitinase activity) | 2 | |
| Ion binding | GO:0008270 (zinc ion binding) | 52 |
| GO:0051536 (iron-sulfur cluster binding) | 45 | |
| GO:0000287 (magnesium ion binding) | 29 | |
| GO:0005506 (iron ion binding) | 26 | |
Figure 3A schematic diagram showing the 5-hydroxyectoine biosynthesis gene cluster (strep3082-3085) that is conserved in the five . The chromosome location of this cluster in each respective genome is: SCO1857-1873 for S. coelicolor, SAV_6407-6388 for S. avermitilis, SBI_08140-08124 for S. bingchenggensis, SGR_5639-5623 for S. griseus, SCAB_70821-70641 for S. scabiei. Blue: strep3082 (ectA), L-2,4-diaminobutyric acid acetyltransferase; magenta: strep3083 (ectB), L-2,4-diaminobutyrate aminotransferase; green: strep3084 (ectC), ectoine synthase; orange: strep3085 (ectD), ectoine hydroxylase; black: core genome components; gray: dispensable genome components; white: species-specific genes.
Figure 4The distribution of gene families with lineage-specific expansions in five . A. Distribution of the size of LSE gene families. B. Distribution of the size of lineage-unique LSE gene families. C. Distribution of the size of typical LSE gene families.
Examples of contiguous single gene expansions in five Streptomyces genomes.
| Stains | Family | Duplications | Function description |
|---|---|---|---|
| strep1777 | SCO3466-3467 | Transposase | |
| strep2302 | SCO2906-2907 | PTS transmembrane protein | |
| strep2354 | SCO5663-5664 | Integral membrane protein | |
| strep5410 | SCO7774-7775 | Secreted protein | |
| strep6870 | SCO7791-7791a | Secreted oxidoreductase | |
| strep8395 | SCO0445-0446 | Secreted protein | |
| strep8429 | SAV576-577 | TetR family transcriptional regulator | |
| strep2207 | SAV5346-5349 | Hypothetical protein | |
| strep1002 | 14 groups | DNA integration | |
| strep1016 | SBI_09333-09334 | Transposase | |
| strep2168 | SBI_06671-06672 | Integral membrane protein | |
| strep2332 | SBI_06321-06322 | Regulatory protein | |
| strep2438 | SBI_04627-04628 | Regulatory protein | |
| strep5446 | SBI_02610-02611 | Cytochrome P450 family protein | |
| strep5448 | SBI_02692-02693 | Beta galactosidase | |
| strep8479 | SBI_01576-01577 | Transcriptional regulator, CdaR | |
| strep8554 | SBI_09566-09567 | NmrA family protein | |
| strep2491 | SGR_2433-2434, SGR_5023-5024 | Hypothetical protein | |
| strep7090 | SGR_4733-4734 | Membrane transporter | |
| strep8591 | SGR_812-813 | FAD-dependent oxidoreductase | |
| strep8821 | SGR_605-606 | Putative enediyne biosynthesis protein | |
| strep1854 | SCAB_42921-42931 | Transposase | |
| strep5499 | SCAB_23621-23631 | Oxidoreductase | |
| strep5512 | SCAB_42581-42591 | Integrin-like protein | |
| strep7062 | SCAB_69611-69621 | Extracellular substrate-binding protein | |
| strep7064 | SCAB_48881-48891 | GntR family transcriptional regulator | |
| strep8654 | SCAB_81001-81011 | IclR family transcriptional regulator | |
Note: a The number in parenthesis shows the number of LSE families with contiguous single gene expansions in each genome.
Examples of LSE block duplications in five Streptomyces genomes.
| Strains | Block size | Blocks |
|---|---|---|
| 5 | (SCO3264-3266, SCO3268-3269)(SCO3986-3984, SCO3982-3981) | |
| (13)a | 4 | (SCO0216-0219)(SCO4947-4950)(SCO6535-6532) |
| 4 | (SCO4525-4528)(SCOSCP1.168-171) | |
| 3 | (SCO1603-1605)(SCO6400-6402) | |
| 2 | (SCO0874-0875)(SCO3521-3522) | |
| 7 | (SAV_354-360)(SAV1008-1011, SAV_1013-1015) | |
| (9) | 4 | (SAV_5337-5340)(SAV7154, SAV7150-7148) |
| 3 | (SAV_4095-4097)(SAV_5434-5432) | |
| 2 | (SAV_1199-1200)(SAV_7454-7453) | |
| 2 | (SAV_7567-7568)(SAP1p22-21) | |
| 7 | (SBI_00573-00577, SBI_00580-00581)(SBI_01909-01913, SBI_01915-01916) | |
| (34) | 4 | (SBI_02525-02528)(SBI_06733-06736) |
| 4 | (SBI_04407-04410)(SBI_04411-04414) | |
| 3 | (SBI_00377-00379)(SBI_05209-05211) | |
| 3 | (SBI_00539-00541)(SBI_05182-05184) | |
| 2 | (SBI_00043-00044)(SBI_07509-07510)(SBI_07927-07928)(SBI_07929-07930) | |
| 2 | (SBI_02527-02528)(SBI_05511-05510)(SBI_07028-07029)(SBI_07426-07425) | |
| 2 | (SBI_03820,03822)(SBI_03854-03855)(SBI_07225-07226) | |
| 2 | (SBI_00058-00059)(SBI_04127-04128)(SBI_05638-05637) | |
| 2 | (SBI_04480-04481)(SBI_04484-04485) | |
| 2 | (SGR_443-444)(SGR_6579-6578) | |
| (4) | 2 | (SGR_454-455)(SGR_4567-4566) |
| 2 | (SGR_2409-2410)(SGR_2414-2415) | |
| 2 | (SGR_2951-2950)(SGR_4306-4305) | |
| 58 | (SCAB_21721-21811, 21981-22181, 22201-22281, 22331, 22351-22371, 22391-22401, 22421-22481, 22501, 22531-22561) (SCAB_32241-32331, 32561-32811, 32831-32861, 32991, 33031-33071, 33061- 33071, 33091-33141, 33161-33211) | |
| 5 | (SCAB_10401-10441)(SCAB_76871-76841) | |
| 4 | (SCAB_0501-0531)(SCAB_84341-84311) | |
| 3 | (SCAB_4431-4451)(SCAB_88631-88611) | |
| 2 | (SCAB_0371-0372)(SCAB_0691-0692)(SCAB_0991-0982)(SCAB_23721-23722) (SCAB_90821-90812) | |
| 2 | (SCAB_0831-0841)(SCAB_1341-1351)(SCAB_10021-10011)(SCAB_81311-81301) | |
| 2 | (SCAB_2791-2801)(SCAB_5601-5591)(SCAB_88151-88161) | |
| 2 | (SCAB_13971-13981)(SCAB_14051-14061) | |
Block size describes the number of genes in chromosomal or plasmid region that has been duplicated together.
Note: a The number in parenthesis shows the number of duplicated gene blocks in each genome.
Figure 5Graphic representation of a large gene block that has been duplicated in [85]. The upper and middle panels depict the two respective duplicated clusters in S. scabiei. (SCAB_21721-22561 and SCAB_32241-33211). Only a single copy of this cluster is present in S. coelicolor (SCO6924-6843 (lower). Average sequence similarities are described as the height of the bars in each LCB (locally collinear block).
Figure 6A phylogenetic tree of the two predicted lineage-unique ABC transporter families and antibiotic related ABC transporters from . The ABC transporters are divided into three classes. Type I: DrrA, S. peucetius; MtrA, S. argillaceus; KasK, S. kasugaensis; TnrB2, S. longisporoflavus; Type II: LmrC, S. lincolnensis; VarM, S. virginiae; OleB, S. antibioticus; SrmB, S. ambofaciens; CarA, S. thermotolerans; TlrC, S. fradiae; Type III: NovA, S. sphaeroides; NysG, NysH, S. noursei; StrW, StrV, S. glaucescens. The lineage-unique families are highlighted in green. The ABC transporters with proved antibiotic resistance function are highlighted in blue, and those not yet tested are highlighted in grey. Tree was inferred by the neighbor-joining method based on the amino acid sequences with Poisson corrected distance. The option of complete deletion of gaps was used for tree construction. 1,000 bootstrap replicates were used to infer the reliability of branching points. Bootstrap values of > 50% are presented. The scale bar indicates the number of amino acid substitutions per site. The Maximum Likelihood and Maximum Parsimony methods give virtually the same topology (data not shown).