| Literature DB >> 27154273 |
Ofir Cohen1, Shany Doron2, Omri Wurtzel3, Daniel Dar2, Sarit Edelheit2, Iris Karunker2, Eran Mick4, Rotem Sorek5.
Abstract
Whole-transcriptome sequencing studies from recent years revealed an unexpected complexity in transcriptomes of bacteria and archaea, including abundant non-coding RNAs, cis-antisense transcription and regulatory untranslated regions (UTRs). Understanding the functional relevance of the plethora of non-coding RNAs in a given organism is challenging, especially since some of these RNAs were attributed to 'transcriptional noise'. To allow the search for conserved transcriptomic elements we produced comparative transcriptome maps for multiple species across the microbial tree of life. These transcriptome maps are detailed in annotations, comparable by gene families, and BLAST-searchable by user provided sequences. Our transcriptome collection includes 18 model organisms spanning 10 phyla/subphyla of bacteria and archaea that were sequenced using standardized RNA-seq methods. The utility of the comparative approach, as implemented in our web server, is demonstrated by highlighting genes with exceptionally long 5'UTRs across species, which correspond to many known riboswitches and further suggest novel putative regulatory elements. Our study provides a standardized reference transcriptome to major clinically and environmentally important microbial phyla. The viewer is available at http://exploration.weizmann.ac.il/TCOL, setting a framework for comparative studies of the microbial non-coding genome.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27154273 PMCID: PMC4987935 DOI: 10.1093/nar/gkw394
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The transcriptome maps in the study. A quantitative summary of major transcriptional features including various types of transcriptional start sites (TSSs), inferred small RNAs (sRNAs) and inferred operons
| Organism | Growth conditions | genes | gTSS | iTSS | aTSS | nTSS | sRNAs | Operonsa | RNA-seq and TSS data | |
|---|---|---|---|---|---|---|---|---|---|---|
| Terrific Broth, mid-log phase | 4420 | 1292 | 20 | 29 | 128 | 75 | 1242 | This study | ||
| DSMZ medium | 3466 | 1365 | 204 | 407 | 215 | 99 | 1193 | ( | ||
| mid-log phase | 4495 | 1118 | 10 | 35 | 83 | 36 | 1059 | This study | ||
| log phase 37°C; log phase 30°C; hypoxia; stationary phase 37°C; Δ | 2936 | 1388 | 26 | 19 | 109 | 36 | 1282 | ( | ||
| attack phase; growth phase | 3627 | 407 | 8 | 16 | 46 | 23 | 395 | ( | ||
| DSMZ medium | 8984 | 1996 | 0 | 96 | 197 | 118 | 1908 | ( | ||
| pH 4.5; pH 5.7 | 3777 | 866 | 5 | 1 | 112 | 26 | 839 | This study | ||
| mannitol; glycerol | 2499 | 1268 | 135 | 642 | 168 | 87 | 1113 | This study | ||
| DSMZ medium | 2681 | 627 | 16 | 16 | 49 | 16 | 601 | ( | ||
| ATCC medium | 2265 | 777 | 6 | 7 | 71 | 29 | 764 | ( | ||
| M17 medium | 2400 | 746 | 5 | 7 | 82 | 31 | 714 | ( | ||
| 28°C; 37°C (LB medium) | 5975 | 2224 | 5 | 195 | 321 | 158 | 1728 | ( | ||
| Artificial seawater medium | 2585 | 1176 | 0 | 45 | 31 | 9 | 1125 | ( | ||
| Artificial seawater medium | 2581 | 1005 | 0 | 52 | 44 | 5 | 957 | ( | ||
| DSMZ medium | 4124 | 1284 | 8 | 212 | 84 | 34 | 1229 | ( | ||
| yeast extract, stationary phase | 2330 | 1050 | 0 | 32 | 133 | 45 | 1033 | This study | ||
| glucose; cellobiose; minimal | 3034 | 1094 | 0 | 124 | 202 | 109 | 1040 | ( | ||
| mid-log phase | 2035 | 731 | 0 | 109 | 41 | 15 | 688 | This study |
gTSS = A TSS found upstream to coding gene.
iTSS = internal TSS within a gene.
aTSS = cis-antisense TSS overlapping a gene.
nTSS = non-coding, intergenic TSS.
aIncluding overlapping operons.
Figure 1.Representative transcriptomes presented on the tree of life. The compendium of transcriptomes in this study is denoted as colored lineages across the prokaryotic tree of life consisting of 18 representative species across different phyla including Gamma, Alpha and Delta Proteobacteria, Cyanobacteria, Firmicutes, Deinococcus-Thermus, Actinobacteira, Spirochaetes and the Archael Crenarchaeotes phylum. Tree was taken from (44,45), reconstructed with (45).
Figure 2.The Transcriptomes Compared across the Tree of Life (TCOL) online web-server. Interactive browser with detailed transcriptional maps of 18 representative species across the microbial tree of life. Transcriptome maps can be accessed either by (A) choosing an organism or (B) by using BLAST with a user-provided query sequence. (C) Transcriptome maps can be navigated with multiple functions and are information-rich. Some of the information appears as tooltips when hovering above the features with the cursor. (D) ‘Gene page’ (when clicking on a gene in the browser) containing information about the gene including its homologs. (E) The comparative transcriptome viewer shows aligned transcriptomes of homologous genes, centered on the queried gene. This view is accessible from the ‘Gene page’ and from the BLAST results page.
Figure 3.Comparative transcriptome views of homologous genes with long 5'UTRs. (A) Threonyl-tRNA synthetase (thrS) has a known T-box (RF00230) in Listeria, Clostridium and Bacillus. (B) The transcription termination factor rho gene (COG1158) in which regulatory structure was previously inferred in Pseudomonas (Pseudomon-Rho). (C) The cytochrome d ubiquinol oxidase subunit I (COG1271) gene (D) The cytochrome o ubiquinol oxidase subunit II (COG1622) gene. Brown horizontal arrows denote RFAM-identified cis-regulators.