| Literature DB >> 22872780 |
Rahul Raghavan1, Daniel B Sloan, Howard Ochman.
Abstract
Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell's transcription machinery. IMPORTANCE Application of high-throughput methods has revealed the expression throughout bacterial genomes of transcripts encoded on the strand complementary to protein-coding genes. Because transcription is costly, it is usually assumed that these transcripts, termed antisense RNAs (asRNAs), serve some function; however, the role of most asRNAs is unclear, raising questions about their relevance in cellular processes. Because natural selection conserves functional elements, comparisons between related species provide a method for assessing functionality genome-wide. Applying such an approach, we assayed all transcripts in two closely related bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, and demonstrate that, although the levels of genome-wide antisense transcription are similarly high in both bacteria, only a small fraction of asRNAs are shared across species. Moreover, the promoters associated with asRNAs show no evidence of sequence conservation between, or even within, species. These findings indicate that despite the genome-wide transcription of asRNAs, many of these transcripts are likely nonfunctional.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22872780 PMCID: PMC3419515 DOI: 10.1128/mBio.00156-12
Source DB: PubMed Journal: MBio Impact factor: 7.867
Sequencing reads mapped to E. coli and Salmonella genomes
| Characteristic | Value for genome | |
|---|---|---|
| Total no. of reads | 30,206,434 | 29,275,111 |
| Total mapped reads (%) | 88 | 87 |
| Protein-coding genes (%) | ||
| Sense | 30 | 27 |
| Antisense | 2 | 2 |
| Intergenic regions (%) | 54 | 55 |
| Plasmid (%) | No plasmid | 0.4 |
Of 5,313 putative ORFs in the Salmonella Typhimurium 14028S genome, only 4,477 with orthologs in Salmonella Typhimurium LT2 and/or in E. coli MG1655 were considered.
FIG 1 Location of an asRNA within the holE coding region of E. coli. Transcripts identified by RNA-seq mapped onto the E. coli genome: reads corresponding to holE mRNA transcripts are in red, and reads corresponding to the antisense transcript are in green. Locations of TSSs and the direction of transcription are shown. Numbering is according to the E. coli MG1655 genome (NC_000913.2).
FIG 2 Enrichment of promoter elements associated with antisense RNAs. Distributions show null expectations for the percentage of randomly selected antisense sites associated with a −10 promoter element in the E. coli (black/right) and Salmonella (gray/left) genomes. For each genome, arrows indicate the percentage of identified antisense TSSs associated with −10 promoter elements.
Genes with asRNAs conserved on their noncoding strands in E. coli and Salmonella
| Gene | Product | asRNA expression | Distance between TSS positions (bp) | |
|---|---|---|---|---|
| DNA-binding transcriptional repressor | 495 | 819 | 0 | |
| Aspartate-semialdehyde dehydrogenase, NAD(P) binding | 1,062 | 57 | 0 | |
| 176 | 680 | 1 | ||
| Hydroxyacylglutathione hydrolase | 110 | 1,079 | 1 | |
| Octaprenyl diphosphate synthase | 30 | 637 | 1 | |
| Multidrug efflux system protein | 447 | 143 | 0 | |
| Cystathionine gamma-synthase, PLP dependent | 502 | 40 | 0 | |
| Membrane-bound lytic murein transglycosylase C | 178 | 657 | 0 | |
| Nitrate reductase, periplasmic, large subunit | 419 | 209 | 10 | |
| ADP-ribose pyrophosphatase | 171 | 346 | 0 | |
| Pyruvate formate lyase I | 79 | 214 | 0 | |
| DNA-binding transcriptional repressor | 321 | 28 | 0 | |
| Pyridoxal phosphate (PLP) phosphatase | 270 | 29 | 0 | |
| Predicted zinc-binding hydrolase | 204 | 3,500 | 10 | |
| Predicted inner membrane permease | 90 | 243 | 1 | |
| Inosine/xanthosine triphosphatase | 1,113 | 419 | 0 | |
| Conserved protein, UPF0131 family | 432 | 550 | 0 | |
Maximum depth of mapped reads near asRNA TSS.
FIG 3 Comparisons of divergence/polymorphism in asRNA and mRNA promoter sequences. Plotted are averages from a 9-bp sliding window of the interspecific sequence divergence between either E. coli and Salmonella (A) or E. coli and E. fergusonii (B) or for the proportion of segregating sites within E. coli (C). Black arrows on y axes indicate average divergence (or polymorphism) for the entire coding sequences of ORFs containing asRNAs. Note that levels of sequence divergence (or polymorphism) in the asRNA promoter regions are statistically indistinguishable from the corresponding levels in the surrounding ORF (P > 0.5 for all three data sets; paired t tests). The 10 bp flanking the −10 and −35 regions have been highlighted. Each error bar represents ±1 standard error of the proportion.