| Literature DB >> 17623098 |
Karthik P Jayapal1, Wei Lian, Frank Glod, David H Sherman, Wei-Shou Hu.
Abstract
BACKGROUND: The genomes of Streptomyces coelicolor and Streptomyces lividans bear a considerable degree of synteny. While S. coelicolor is the model streptomycete for studying antibiotic synthesis and differentiation, S. lividans is almost exclusively considered as the preferred host, among actinomycetes, for cloning and expression of exogenous DNA. We used whole genome microarrays as a comparative genomics tool for identifying the subtle differences between these two chromosomes.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17623098 PMCID: PMC1934918 DOI: 10.1186/1471-2164-8-229
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Comparative analysis of S. coelicolor M145 – S. lividans TK21 chromosomes and genomic features in S. coelicolor. (Log2 hybridization ratio as a function of S. coelicolor ORFs. Low values imply absence or divergence of gene in S. lividans. Experimental data (blue circles) and smoothened data using median of a 19-gene sliding window (thick line) are shown. Large genomic islands (red), smaller islets (orange) and S. coelicolor specific chromosome end regions (green) identified in this study are marked with filled bars. The sizes of the bars approximate the extent of each island. GI and Gi numbers corresponding to those in Table 1 are indicated. Note that the smoothened curve, shown here for ease of visualization, will not show a dip in hybridization ratios for genomic islets with fewer than 10 genes. (Logarithm of codon adaptation index (CAI) as a function of S. coelicolor ORFs. Low log2 CAI implies sub-optimal codon usage. (z' value plotted as a function of distance from chromosome left end. A sharp increase in z' value indicates presence of an AT-rich region. The figure shows that there exists a good correlation between presence of genomic islands, codon usage bias and local GC content.
Characteristics of genomic islands (GIs) and islets (Gis) identified in this study
| Chromosome left end; SCO0001-0081 | 69.6 | 81 | 69.5 | 0.58 (59%) | - | - | 8 transposon related genes |
| Gi-1; SCO0090-0099 | 6.6 | 10 | 67.1 | 0.56 (67%) | - | - | 6 transposon related genes |
| 25.9 | 22 | 70.7 | 0.67 (23%) | - | - | 1 phage related integrase | |
| Gi-2; SCO2381-2384 | 6.4 | 4 | 68.4 | 0.57 (75%) | - | - | - |
| Gi-3; SCO2862-2871 | 8.6 | 10 | 65.9 | 0.54 (78%) | - | - | 1 gene similar to phage replication regulator |
| Gi-4; SCO3250-3270 | 14.0 | 21 | 66.2 | 0.54 (80%) | Arg-tRNA CCT | 43 bp perfect repeat | Mostly plasmid (pSAM2) related genes |
| 75.0 | 70 | 68.7 | 0.60 (55%) | - | 61 bp repeat (6 mismatches) | 5 transposon related genes | |
| Gi-5; SCO3929-3937 | 9.2 | 8 | 68.0 | 0.59 (50%) | - | - | pSAM2 integration site, some plasmid functions |
| Gi-6; SCO3980-3998 | 10.7 | 19 | 65.8 | 0.44 (100%) | Ser-tRNA TGA | 106 bp repeat (8 mismatches) | pSAM2 insertion element |
| Gi-7; SCO4060-4066 | 5.2 | 7 | 66.1 | 0.43 (100%) | Ser-tRNA GGA | 88 bp repeat (4 mismatches) | 2 transposon related genes |
| Gi-8; SCO4210-4213 | 3.8 | 4 | 66.8 | 0.50 (75%) | - | 19 bp perfect repeat | Some phage related genes |
| Gi-9; SCO4346-4350 | 7.0 | 5 | 66.9 | 0.55 (75%) | - | - | - |
| Gi-10; SCO4533-4537 | 4.3 | 5 | 72.1 | 0.70 (0%) | - | - | - |
| Gi-11; SCO4615-4631 | 15.9 | 17 | 68.3 | 0.62 (31%) | Tyr-tRNA GTA | 112 bp repeat (1 mismatch) | Possible SLP1 insertion |
| Gi-12; SCO5327-5350 | 21.3 | 24 | 65.3 | 0.67 (55%) | Arg-tRNA CCG | 114 bp repeat (4 mismatches) | pSAM2 integration, plasmid/phage related genes |
| Gi-13; SCO5605-5620 | 12.9 | 16 | 67.0 | 0.58 (67%) | - | - | All phage related genes |
| Gi-14; SCO5632-5644 | 10.4 | 13 | 63.8 | 0.44 (100%) | - | 52 bp repeat (5 mismatches) | Some plasmid related functions |
| Gi-15; SCO5718-5735 | 21.8 | 18 | 68.1 | 0.56 (72%) | - | - | - |
| Gi-16; SCO6120-6124 | 2.9 | 5 | 70.0 | 0.67 (25%) | - | - | - |
| Gi-17; SCO6314-6338 | 23.6 | 23 | 70.2 | 0.59 (70%) | - | - | 2 transposon related genes |
| 56.6 | 53 | 68.6 | 0.56 (61%) | - | - | 4 transposon related genes | |
| 30.4 | 18 | 68.6 | 0.56 (65%) | - | - | Few phage related genes | |
| 153.3 | 148 | 69.0 | 0.60 (53%) | Pro-tRNA GGG | 44 bp perfect repeat | 1 transposon related gene at end | |
| Gi-18; SCO7795-7802 | 4.2 | 8 | 65.2 | 0.48 (86%) | - | - | 3 transposon related genes |
| Chromosome right end; SCO7827-7845 | 20.4 | 19 | 70.1 | 0.55 (83%) | - | - | 1 transposon related gene; |
Large genomic islands (≥25 kb) are in bold face
Percentage GC content within each genomic island; compare with 72.1% average GC content of whole genome
Compare the average CAI of ORFs within each segment with 0.73, the average CAI of all genes in S. coelicolor. Percent genes with low CAI is the fraction of genes in each island with CAI values lower than global mean – standard deviation (= 0.625)
Functional classification of S. coelicolor M145 genes missing or divergent in S. lividans TK21
| Cell processes | 800 | 32 | 4 % |
| Macromolecule metabolism | 496 | 20 | 4 % |
| Amino acids biosynthesis | 123 | 3 | 2 % |
| Nucleotide biosynthesis | 30 | 0 | 0 % |
| Ribosomal constituents | 67 | 0 | 0 % |
| Biosynthesis of cofactors and carriers | 118 | 0 | 0 % |
| Central intermediary metabolisms | 111 | 4 | 4 % |
| Degradation of small molecules | 200 | 6 | 3 % |
| Energy metabolism | 189 | 1 | 1 % |
| Fatty acid and phosphatidic acid biosynthesis | 56 | 4 | 7 % |
| Secondary metabolism | 277 | 3 | 1 % |
| Periplasmic, exported or lipoproteins | 1318 | 84 | 6 % |
| Two-component systems | 165 | 7 | 4 % |
| RNA polymerase core enzyme binding | 88 | 3 | 3 % |
| Regulatory proteins | 673 | 39 | 6 % |
| Protein kinases | 39 | 1 | 3 % |
| Laterally acquired elements | 139 | 95 | 68 % |
| Not classified | 565 | 30 | 5 % |
| Hypothetical proteins | 2371 | 293 | 12 % |
protein classification scheme derived from EcoCyc database; downloaded from
Total number of proteins present in S. coelicolor chromosome in each category
Figure 2Comparison of CGH and transcriptome data. Plot of log2 [gDNA/gDNA] (x-axis) vs max(log2 [mRNA/gDNA]) (y-axis). The y-axis values correspond to the maximum gene expression levels observed considering all analyzed samples (includes both R5- and SMM). Genes belonging to GI-1 (), GI-2 (×), GI-3 (○), GI-4 (□), GI-5 (◇) and all other smaller islets (black +) are shown alongside genes conserved in both S. coelicolor and S. lividans (green +). Genes in the region left of and above the dotted lines are those that are highly expressed in S. coelicolor but absent in S. lividans. High expression in this case was defined as max(log2 [mRNA/gDNA]) ≥ 2.0 which corresponds to the top 14 percentile of gene expression values. The plot shows that a large fraction of genes absent in S. lividans have low expression levels in S. coelicolor. Nevertheless, there are some prominent examples of highly expressed genes within certain S. coelicolor genomic islands. This includes several genes from GI-2.