Literature DB >> 20689751

Widespread antisense transcription in Escherichia coli.

James E Dornenburg1, Anne M Devita, Michael J Palumbo, Joseph T Wade.   

Abstract

The vast majority of annotated transcripts in bacteria are mRNAs. Here we identify ~1,000 antisense transcripts in the model bacterium Escherichia coli. We propose that these transcripts are generated by promiscuous transcription initiation within genes and that many of them regulate expression of the overlapping gene.

Entities:  

Year:  2010        PMID: 20689751      PMCID: PMC2912661          DOI: 10.1128/mBio.00024-10

Source DB:  PubMed          Journal:  MBio            Impact factor:   7.867


INTRODUCTION

Recent high-throughput sequencing analyses of RNA in eukaryotes have revealed a far more complex network of RNAs than previously appreciated, including thousands of RNAs antisense to protein-coding genes (aRNAs) (1). In contrast, relatively few aRNAs have been identified in bacteria (2). Studies of individual plasmid-encoded and chromosomally encoded aRNAs in a variety of bacterial species have demonstrated that aRNAs can regulate expression of the overlapping gene at the level of translation, mRNA stability, or transcription (3–11). Several studies have hinted at the existence of many more aRNAs, in multiple bacterial species, than those currently described (5, 8, 10, 12–18), suggesting that aRNAs have a widespread regulatory function in bacteria. We sought to identify novel aRNAs in Escherichia coli. We generated a cDNA library by extracting RNA from rapidly growing cells (wild-type strain MG1655 grown with aeration in LB to an optical density at 600 nm [OD600] of 0.7), treating the RNA with tobacco acid pyrophosphatase to convert 5′-triphosphate groups to monophosphates, ligating an RNA oligonucleotide (5′-ACACUCUUUCCCUACACGACGCUCUUCCGAUCU-3′) to the RNA 5′ ends, reverse transcribing with a primer in which the nine 3′-end proximal bases are random (5′-GTTTCCCAGTCACGATCNNNNNNNNN-3′), and amplifying by PCR. Using Solexa sequencing, we identified unique RNA 5′ ends. The mapped RNA 5′-end locations include many known transcription start sites: 24% of sequences of published transcription start sites are matched exactly by a sequence from our library, and 41% of those sequences are ≤2 bp away from a sequence from our library (19). The exact matches include the majority of known aRNAs (GadY, RyjB, RdlA, RdlD, RyeA, SokB, and SokC). The RNA 5′-end locations also include 1,005 locations that map antisense to protein-coding genes (see Table S1 in the supplemental material), suggesting the existence of many more aRNAs. These putative aRNA 5′ ends were each sequenced between 1 and 5,488 times. An additional 385 ends map antisense to known and predicted 5′ and 3′ untranslated regions (UTR) (see Table S1 in the supplemental material) (20). The housekeeping σ factor σ70 binds a bipartite DNA sequence at E. coli promoters during transcription initiation. The downstream recognition site, the −10 hexamer, has the consensus sequence TATAAT and is typically positioned 7 or 8 bp upstream of the transcription start site (21). For the set of 471 published transcription start sites (19), the −10 hexamers match the consensus, on average, 3.28 times out of 6 (−10 match score) (base distribution shown in Fig. 1A). In contrast, 1,000 randomly selected sequences antisense to genes match the consensus only 2.00 times out of 6 (control match score) (base distribution shown in Fig. 1B). This difference is highly significant (Mann-Whitney U test, P of 8.9e−70). Furthermore, 46% of the RNAs with published start sites initiate with “A,” significantly more than expected by chance (P < 1e−22) (Fig. 1A and B). The −10 hexamer sequences for the 1,005 putative aRNAs identified in this work have a −10 match score of 3.27, significantly higher than the control match score (Mann-Whitney U test, P of 8.8e−102) (base distribution shown in Fig. 1C). This holds true even for the 141 aRNA 5′ ends that were sequenced only once (score of 3.12; Mann-Whitney U test, P of 2.8e−21). The −10 match score for the 1,005 aRNAs is not significantly different from that for the set of published start sites (Mann-Whitney U test, P = 0.49). Moreover, 48% of the putative aRNAs initiate with “A,” significantly more than expected by chance (P < 1e−50) (Fig. 1B and C) but not significantly different from the set of published start sites (Fisher’s exact test, P of 0.40) (Fig. 1A and C). Thus, the promoters and transcription start sites of the 1,005 putative aRNAs have DNA sequence properties that are indistinguishable from those of characterized transcripts.
FIG 1

(A) Distribution of nucleotides at the transcription start site (+1) and positions upstream for transcripts with published start sites. Equivalent distributions are shown for 1,000 random intragenic sequences (B) and the 1,005 putative aRNAs identified in this work (C).

(A) Distribution of nucleotides at the transcription start site (+1) and positions upstream for transcripts with published start sites. Equivalent distributions are shown for 1,000 random intragenic sequences (B) and the 1,005 putative aRNAs identified in this work (C). To experimentally validate the putative aRNAs, we fused the promoter regions (up to 200 bp upstream of the putative transcription start site) of 10 aRNAs to a lacZ reporter gene and measured expression levels in a β-galactosidase assay. In 9 out of 10 cases tested, we detected lacZ expression that was significantly reduced by mutation of the −10 hexamer (Fig. 2A). We conclude that the large majority of putative aRNAs are genuine and that our transcription start site assignments are highly accurate.
FIG 2

(A) Expression of a lacZ reporter gene fused to putative aRNA promoters. Wild-type (gray, right) or mutant (orange, right; −10 hexamers replaced by GGGCCC) aRNA promoter regions (200 bp upstream to 10 bp downstream of +1) were transcriptionally fused to lacZ on a single-copy plasmid (a derivative of pBAC-BA-lacZ, Addgene plasmid 13423, in which the HindIII-NotI fragment was replaced with an E. coli rRNA transcription terminator). β-Galactosidase assays were performed using E. coli MG1655 ΔlacZ. Gene names indicate the overlapping protein-coding genes. Numbers in parentheses indicate the number of times the aRNA 5′ end was sequenced/the number of base matches to the −10 hexamer consensus. Note that one promoter tested (eutB) is located in an untranslated region between the eutB and eutC genes (transcribed within an operon), but the putative RNA overlaps the eutB gene. There is no correlation between the number of sequence reads and promoter strength. We speculate that this is due to a combination of differential aRNA stability, introduction of bias by the PCR step of library construction, and the known sequence bias of RNA ligase T4 Rnl1 (27). wt, wild type. (B) Expression of a lacZ reporter translationally fused to rplJ or yrdA, including the natural rplJ or yrdA protein-coding gene promoter, on a single-copy plasmid (described above). Expression levels were measured for wild-type (gray, right) and mutant (orange, right) aRNA −10 hexamers/+1 transcription start sites (mutations did not alter the protein-coding sequence of the mRNA and did not substantially alter the codon bias; the rplJ aRNA −10 hexamer mutated from TACAGT to GACGGT, and the +1 transcription start site mutated from A to G; the yrdA aRNA −10 hexamer mutated from CATAAT to CGTAGT, while the +1 transcription start site was unchanged [boldface shows change]). Expression of rplJ::lacZ and yrdA::lacZ was measured using MG1655 ΔlacZ and MG1655 ΔlacZ ΔyrdA, respectively.

(A) Expression of a lacZ reporter gene fused to putative aRNA promoters. Wild-type (gray, right) or mutant (orange, right; −10 hexamers replaced by GGGCCC) aRNA promoter regions (200 bp upstream to 10 bp downstream of +1) were transcriptionally fused to lacZ on a single-copy plasmid (a derivative of pBAC-BA-lacZ, Addgene plasmid 13423, in which the HindIII-NotI fragment was replaced with an E. coli rRNA transcription terminator). β-Galactosidase assays were performed using E. coli MG1655 ΔlacZ. Gene names indicate the overlapping protein-coding genes. Numbers in parentheses indicate the number of times the aRNA 5′ end was sequenced/the number of base matches to the −10 hexamer consensus. Note that one promoter tested (eutB) is located in an untranslated region between the eutB and eutC genes (transcribed within an operon), but the putative RNA overlaps the eutB gene. There is no correlation between the number of sequence reads and promoter strength. We speculate that this is due to a combination of differential aRNA stability, introduction of bias by the PCR step of library construction, and the known sequence bias of RNA ligase T4 Rnl1 (27). wt, wild type. (B) Expression of a lacZ reporter translationally fused to rplJ or yrdA, including the natural rplJ or yrdA protein-coding gene promoter, on a single-copy plasmid (described above). Expression levels were measured for wild-type (gray, right) and mutant (orange, right) aRNA −10 hexamers/+1 transcription start sites (mutations did not alter the protein-coding sequence of the mRNA and did not substantially alter the codon bias; the rplJ aRNA −10 hexamer mutated from TACAGT to GACGGT, and the +1 transcription start site mutated from A to G; the yrdA aRNA −10 hexamer mutated from CATAAT to CGTAGT, while the +1 transcription start site was unchanged [boldface shows change]). Expression of rplJ::lacZ and yrdA::lacZ was measured using MG1655 ΔlacZ and MG1655 ΔlacZ ΔyrdA, respectively. We selected two mRNAs, rplJ and yrdA, that each overlap a putative aRNA. We translationally fused the mRNAs in frame to lacZ, under control of the natural mRNA promoter, and compared the expression levels of lacZ for a wild-type construct and a construct containing a mutated −10 hexamer and +1 nucleotide for the aRNA (+1 nucleotide not mutated for yrdA). Expression of lacZ increased significantly upon mutation of the aRNA promoter for rplJ but not for yrdA (Fig. 2B). This strongly suggests that the aRNA overlapping rplJ represses expression of the mRNA. Our data demonstrate that (i) antisense transcription is widespread in E. coli and (ii) aRNAs can regulate expression of the overlapping gene. Regulation by aRNAs is likely to be widespread, since all previously characterized bacterial aRNAs regulate expression of the overlapping gene (3–11). The majority of aRNAs are likely to be noncoding due to constraints imposed by the overlapping protein-coding sequence. A small fraction of aRNAs may be mRNAs for which the 5′-end UTR is antisense to another gene; however, this is unlikely in most cases, since only 21% of aRNAs initiate ≤500 bp upstream of a known translation start site on the same strand. Since they are likely to be noncoding, aRNAs are also likely to be substrates for Rho-dependent termination, which occurs within the first few hundred nucleotides of transcription (14). We conclude that the majority of aRNAs are short (<500-nucleotide), noncoding transcripts. We speculate that most of the novel aRNAs are generated by promiscuous transcription initiation within genes, as has been suggested for eukaryotic genomes (22). This hypothesis is consistent with the presence of many transcription factor and σ binding sites within genes (15, 18, 23–26), the low information sequence requirements required to promote transcription in bacteria (21), and the absence of inhibitory chromatin structure within bacterial genes (26). aRNAs are likely to have a major impact on bacterial gene expression due to the high potential for base pairing with an mRNA and the high likelihood of transcriptional interference resulting from the overlap of aRNA and mRNA transcription units. Given that aRNAs have been identified in a wide range of bacterial species, we propose that aRNAs are important regulators of gene expression in all bacteria. NCBI short read archive accession number. Raw sequencing data are available under Accession Number SRA012168.4. Details of 1,005 putative aRNA transcription start sites. Table 1, XLS file, 0.26 MB
  27 in total

1.  Transcriptional noise and the fidelity of initiation by RNA polymerase II.

Authors:  Kevin Struhl
Journal:  Nat Struct Mol Biol       Date:  2007-02       Impact factor: 15.369

2.  Transcription termination within the iron transport-biosynthesis operon of Vibrio anguillarum requires an antisense RNA.

Authors:  Michiel Stork; Manuela Di Lorenzo; Timothy J Welch; Jorge H Crosa
Journal:  J Bacteriol       Date:  2007-03-02       Impact factor: 3.490

Review 3.  Regulatory mechanisms employed by cis-encoded antisense RNAs.

Authors:  Sabine Brantl
Journal:  Curr Opin Microbiol       Date:  2007-03-26       Impact factor: 7.934

4.  Extracytoplasmic function sigma factors regulate expression of the Bacillus subtilis yabE gene via a cis-acting antisense RNA.

Authors:  Warawan Eiamphungporn; John D Helmann
Journal:  J Bacteriol       Date:  2008-12-01       Impact factor: 3.490

Review 5.  Pervasive transcription constitutes a new level of eukaryotic genome regulation.

Authors:  Julia Berretta; Antonin Morillon
Journal:  EMBO Rep       Date:  2009-08-14       Impact factor: 8.807

6.  Studies of the distribution of Escherichia coli cAMP-receptor protein and RNA polymerase along the E. coli chromosome.

Authors:  David C Grainger; Douglas Hurd; Marcus Harrison; Jolyon Holdstock; Stephen J W Busby
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-21       Impact factor: 11.205

Review 7.  Regulatory RNAs in bacteria.

Authors:  Lauren S Waters; Gisela Storz
Journal:  Cell       Date:  2009-02-20       Impact factor: 41.582

8.  Detection of low-level promoter activity within open reading frame sequences of Escherichia coli.

Authors:  Mitsuoki Kawano; Gisela Storz; B Sridhar Rao; Judah L Rosner; Robert G Martin
Journal:  Nucleic Acids Res       Date:  2005-10-31       Impact factor: 16.971

9.  RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.

Authors:  Socorro Gama-Castro; Verónica Jiménez-Jacinto; Martín Peralta-Gil; Alberto Santos-Zavaleta; Mónica I Peñaloza-Spinola; Bruno Contreras-Moreira; Juan Segura-Salazar; Luis Muñiz-Rascado; Irma Martínez-Flores; Heladia Salgado; César Bonavides-Martínez; Cei Abreu-Goodger; Carlos Rodríguez-Penagos; Juan Miranda-Ríos; Enrique Morett; Enrique Merino; Araceli M Huerta; Luis Treviño-Quintanilla; Julio Collado-Vides
Journal:  Nucleic Acids Res       Date:  2007-12-23       Impact factor: 16.971

10.  Deep sequencing analysis of small noncoding RNA and mRNA targets of the global post-transcriptional regulator, Hfq.

Authors:  Alexandra Sittka; Sacha Lucchini; Kai Papenfort; Cynthia M Sharma; Katarzyna Rolle; Tim T Binnewies; Jay C D Hinton; Jörg Vogel
Journal:  PLoS Genet       Date:  2008-08-22       Impact factor: 5.917

View more
  105 in total

1.  Use of recombinase-based in vivo expression technology to characterize Enterococcus faecalis gene expression during infection identifies in vivo-expressed antisense RNAs and implicates the protease Eep in pathogenesis.

Authors:  Kristi L Frank; Aaron M T Barnes; Suzanne M Grindle; Dawn A Manias; Patrick M Schlievert; Gary M Dunny
Journal:  Infect Immun       Date:  2011-12-05       Impact factor: 3.441

2.  Genome-wide antisense transcription drives mRNA processing in bacteria.

Authors:  Iñigo Lasa; Alejandro Toledo-Arana; Alexander Dobin; Maite Villanueva; Igor Ruiz de los Mozos; Marta Vergara-Irigaray; Víctor Segura; Delphine Fagegaltier; José R Penadés; Jaione Valle; Cristina Solano; Thomas R Gingeras
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-28       Impact factor: 11.205

Review 3.  RNAs: regulators of bacterial virulence.

Authors:  Jonas Gripenland; Sakura Netterling; Edmund Loh; Teresa Tiensuu; Alejandro Toledo-Arana; Jörgen Johansson
Journal:  Nat Rev Microbiol       Date:  2010-12       Impact factor: 60.633

4.  Preparation of cDNA libraries for high-throughput RNA sequencing analysis of RNA 5' ends.

Authors:  Irina O Vvedenskaya; Seth R Goldman; Bryce E Nickels
Journal:  Methods Mol Biol       Date:  2015

Review 5.  cis-antisense RNA, another level of gene regulation in bacteria.

Authors:  Jens Georg; Wolfgang R Hess
Journal:  Microbiol Mol Biol Rev       Date:  2011-06       Impact factor: 11.056

6.  Evidence-based annotation of transcripts and proteins in the sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough.

Authors:  Morgan N Price; Adam M Deutschbauer; Jennifer V Kuehl; Haichuan Liu; H Ewa Witkowska; Adam P Arkin
Journal:  J Bacteriol       Date:  2011-08-12       Impact factor: 3.490

Review 7.  Bacterial transcriptomics: what is beyond the RNA horiz-ome?

Authors:  Marc Güell; Eva Yus; Maria Lluch-Senar; Luis Serrano
Journal:  Nat Rev Microbiol       Date:  2011-08-12       Impact factor: 60.633

8.  Convergent transcription confers a bistable switch in Enterococcus faecalis conjugation.

Authors:  Anushree Chatterjee; Christopher M Johnson; Che-Chi Shu; Yiannis N Kaznessis; Doraiswami Ramkrishna; Gary M Dunny; Wei-Shou Hu
Journal:  Proc Natl Acad Sci U S A       Date:  2011-05-23       Impact factor: 11.205

9.  Global transcriptional start site mapping using differential RNA sequencing reveals novel antisense RNAs in Escherichia coli.

Authors:  Maureen K Thomason; Thorsten Bischler; Sara K Eisenbart; Konrad U Förstner; Aixia Zhang; Alexander Herbig; Kay Nieselt; Cynthia M Sharma; Gisela Storz
Journal:  J Bacteriol       Date:  2014-09-29       Impact factor: 3.490

10.  Rho-dependent transcription termination is essential to prevent excessive genome-wide R-loops in Escherichia coli.

Authors:  J Krishna Leela; Aisha H Syeda; K Anupama; J Gowrishankar
Journal:  Proc Natl Acad Sci U S A       Date:  2012-12-18       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.