Agrobacterium species are capable of interkingdom gene transfer between bacteria and plants. The genome of Agrobacterium tumefaciens consists of a circular and a linear chromosome, the At-plasmid and the Ti-plasmid, which harbors bacterial virulence genes required for tumor formation in plants. Little is known about promoter sequences and the small RNA (sRNA) repertoire of this and other α-proteobacteria. We used a differential RNA sequencing (dRNA-seq) approach to map transcriptional start sites of 388 annotated genes and operons. In addition, a total number of 228 sRNAs was revealed from all four Agrobacterium replicons. Twenty-two of these were confirmed by independent RNA gel blot analysis and several sRNAs were differentially expressed in response to growth media, growth phase, temperature or pH. One sRNA from the Ti-plasmid was massively induced under virulence conditions. The presence of 76 cis-antisense sRNAs, two of them on the reverse strand of virulence genes, suggests considerable antisense transcription in Agrobacterium. The information gained from this study provides a valuable reservoir for an in-depth understanding of sRNA-mediated regulation of the complex physiology and infection process of Agrobacterium.
Agrobacterium species are capable of interkingdom gene transfer between bacteria and plants. The genome of Agrobacterium tumefaciens consists of a circular and a linear chromosome, the At-plasmid and the Ti-plasmid, which harbors bacterial virulence genes required for tumor formation in plants. Little is known about promoter sequences and the small RNA (sRNA) repertoire of this and other α-proteobacteria. We used a differential RNA sequencing (dRNA-seq) approach to map transcriptional start sites of 388 annotated genes and operons. In addition, a total number of 228 sRNAs was revealed from all four Agrobacterium replicons. Twenty-two of these were confirmed by independent RNA gel blot analysis and several sRNAs were differentially expressed in response to growth media, growth phase, temperature or pH. One sRNA from the Ti-plasmid was massively induced under virulence conditions. The presence of 76 cis-antisense sRNAs, two of them on the reverse strand of virulence genes, suggests considerable antisense transcription in Agrobacterium. The information gained from this study provides a valuable reservoir for an in-depth understanding of sRNA-mediated regulation of the complex physiology and infection process of Agrobacterium.
Agrobacterium tumefaciens, is a broad-host range plant pathogen, which induces the so-called crown gall disease. Its genome comprises four different replicons: a circular chromosome, a linear chromosome, the At-plasmid and the tumor inducing (Ti)-plasmid. The latter includes a segment of DNA (T-DNA) which is excised, transported via a type IV secretion system, and integrated into the plant chromosome. Expression of the T-DNA-encoded genes results in tumor formation by the synthesis of plant hormones and in production of untypical amino acids, either opines or nopalines, which are used by Agrobacterium as nutrients.To induce virulence (vir)-gene expression in the bacterium, the VirAG two-component system is needed. It is stimulated by diverse plant signals, including acetosyringone. The membrane-bound sensor kinase VirA phosphorylates the response regulator VirG, which activates vir-gene transcription by binding to specific 12-bp DNA sequences, called vir-boxes.- Expression of annotated genes after stimulation with acetosyringone has been analyzed in considerable detail.- In contrast, little is known about small regulatory RNAs (sRNAs) in A. tumefaciens in general and under virulence conditions in particular. We attempted to close this gap in the present study.The importance of sRNAs in regulation of bacterial gene expression is now widely appreciated., Most sRNAs, usually ranging from 50 to 500 nucleotides (nt) in length, act through base pairing with target mRNAs and modulate their translation and/or stability. They are often encoded in intergenic regions (IGRs) between two annotated genes and regulate mRNAs encoded elsewhere on the chromosome. These “trans-encoded sRNAs” usually show only limited complementarity with their targets. In contrast, “cis-encoded antisense sRNAs” are partially or entirely expressed from the reverse strand of a protein-encoding DNA and thus share extended regions of full complementarity with their target. Several sRNAs that act as key players in bacterial virulence have been reported., sRNAs are also known to control a variety of cellular processes, e.g., transcription, oligopeptide transport, plasmid replication or cell devision, just to name a few.- Moreover, there is growing evidence that sRNAs regulate multiple, functionally related target genes and thereby control large posttranscriptional regulons.Both, trans- and cis-encoded sRNAs have been identified in a great variety of organisms by a number of different strategies. Biocomputational predictions revealed a number of bacterial sRNAs. Alternatively, numerous sRNAs can be identified by experimental approaches, like cDNA cloning of small-sized RNA species or by tiling microarrays. Most recently, high-throughput sequencing of cDNA (RNA-seq) has revolutionized transcriptome analysis and led to the discovery of a vast number of sRNAs in many organisms, including several microbial pathogens.- Another benefit of RNA-seq approaches is its potential to identify 5′ transcriptional start sites (TSS) with high resolution in an annotation-independent manner.,Although genome-wide profiling for sRNAs has become very popular, our knowledge on the sRNA inventory of α-proteobacteria is still restricted. Tiling array-based transcriptome studies have been reported for Caulobacter crescentus and Rhizobium etli., Deep sequencing uncovered several photooxidative stress induced sRNAs in Rhodobacter sphaeroides. A recent 454-sequencing study with Sinorhizobium meliloti revealed that about 3% of all genes encode trans-encoded sRNAs and about 2% cis-antisense transcripts in this plant symbiont. To the best of our knowledge, the sRNA inventory of a plant pathogen has not been reported yet. The only experimentally characterized A. tumefaciens sRNAs are the Ti-plasmid encoded RepE RNA, which regulates Ti-plasmid replication, and two recently described, homologous trans-acting sRNAs, AbcR1 and AbcR2., AbcR1 plays an important role in controlling the expression of an ABC transporter involved in γ-aminobutyric acid (GABA) and proline uptake.Both latter sRNAs were derived from a biocomputational prediction which was restricted to the circular chromosome of A. tumefaciens. To detect sRNAs experimentally on a genome-wide scale, we used a previously established differential RNA-sequencing (dRNA-seq) approach, which allows for a discrimination of primary transcripts and processed RNAs. The successful identification of 228 sRNA candidates on the four replicons underlines the power of this method and the potential of sRNA-mediated gene regulation in A. tumefaciens.
Results
Sequencing of cDNA libraries from normally grown and virulence-induced Agrobacterium cells
To analyze the RNA population of A. tumefaciens, cDNA libraries of total RNA were analyzed by dRNA-seq. To examine transcriptional differences under virulence and non-virulence conditions, we grew cultures in minimal medium (pH 5.5) in the absence (-Vir) or presence (+Vir) of the vir-gene inducer acetosyringone. After RNA isolation, two different cDNA libraries were constructed for each of the two conditions. One was generated from the original, untreated total RNA, and the other following treatment with terminator 5′-phosphate-dependent exonuclease (TEX) that degrades processed RNAs with a 5′P but not primary transcripts, which carry a 5′PPP and thus leads to a relative enrichment of primary transcripts. The resulting four libraries [-Vir, -Vir (+TEX) and +Vir, +Vir (+TEX)] were then subjected to 454-pyrosequencing.This strategy yielded 422,204 cDNA sequences. 348,998 cDNA reads being longer than 18 nt were mapped to the four A. tumefaciens replicons (Table 1). Correlating with replicon sizes, more cDNAs were mapped to the circular and linear chromosomes than to the plasmids. In the RNA preparations from cells harvested under virulence conditions (+Vir libraries), the number of cDNA reads from the Ti-plasmid significantly increased, suggesting successful virulence-gene induction.
Table 1. cDNA transcripts on all four A. tumefaciens replicons
Library
Total BLAST hits
Circular chrom.
Linear chrom.
At-plasmid
Ti-plasmid
-Vir
46733
21436
22835
1687
775
-Vir (+TEX)
117641
54550
57112
4426
1553
+Vir
50458
19314
26101
1499
3544
+Vir (+TEX)
93251
41194
44935
2674
4448
All sequenced cDNAs longer than 18 nucleotoides (nt) were blasted. Total BLAST hits are listed and separated on the A. tumefaciens replicons.
All sequenced cDNAs longer than 18 nucleotoides (nt) were blasted. Total BLAST hits are listed and separated on the A. tumefaciens replicons.
Annotation of transcriptional start sites
The terminator exonuclease treatment enriches for primary transcripts characterized by a 5′ tri-phosphate end, leading to a characteristic cDNA enrichment pattern (Fig. 1AB). TSS were retrieved from regions upstream of annotated genes. At least five reads starting at exactly the same nucleotide position were required to define a TSS (see precise mapping criteria in the material and methods section). Overall, 356 potential TSS were identified upstream of annotated ORFs (Table S1), including the previously published TSS for flaA, flaB, and ccrM., Exemplarily, screenshots of the known TSS of flaA, coding for a flagella associated protein (Fig. 1A) and the newly identified TSS of ftsH, encoding a membrane-anchored protease (Fig. 1B) are demonstrated in Figure 1. 32 additional TSS belong to tRNA transcripts and 5S rRNA (Table S2). Start sites that were mapped for tRNA transcripts slightly differed from the corresponding NCBI annotations, most probably because mature tRNAs rather than the authentic unprocessed pre-tRNA transcripts had been annotated. 239 of all 5′ ends were located on the circular, 89 on the linear, 20 on the At- and 8 on the Ti-plasmid. Distances between the identified TSS and start codons of protein-coding genes ranged from 0 to 544 nt with an average distance of 87 nt (Fig. 1C). 32 mRNAs possessed 5′-untranslated regions (5′UTRs) of less than 10 nt, and can thus be classified as leaderless mRNAs. For six genes (indicated by asterisks in Table S1), a TSS within the coding sequence hinted at an incorrectly annotated start codon.
Figure 1. (A) and (B) Enrichment of primary 5′ ends by TEX treatment (in –Vir libraries). Screenshots of (A) flaA and (B) ftsH are shown. The distance between the TSS and the AUG start codon is indicated. (C) Histogram representing the 5′UTR length distribution. (D) A. tumefaciens consensus promoter sequence. Sequences 40 nt upstream and 10 nt downstream of all identified TSS were extracted and aligned. Conserved sequences were visualized with the WebLogo 3.0 program. Numbering of bases is given relative to the TSS.
Figure 1. (A) and (B) Enrichment of primary 5′ ends by TEX treatment (in –Vir libraries). Screenshots of (A) flaA and (B) ftsH are shown. The distance between the TSS and the AUG start codon is indicated. (C) Histogram representing the 5′UTR length distribution. (D) A. tumefaciens consensus promoter sequence. Sequences 40 nt upstream and 10 nt downstream of all identified TSS were extracted and aligned. Conserved sequences were visualized with the WebLogo 3.0 program. Numbering of bases is given relative to the TSS.Upstream 40 nt of each identified TSS were extracted in order to compile putative promoter sequences. The deduced Agrobacterium-specific -35 and -10 regions (CTTGNN and -10: TATNNT, respectively; Figure 1D) deviated from typical E. coli housekeeping promoters (-35: TTGACA and -10: TATAAT) at several positions. In addition, a conserved motif (CAT) from the -1 to +2 position was observed.
Validation of the dRNA-seq Data Set
To evaluate the dRNA-seq results, we compared them with a recently published microarray study conducted with cells under identical virulence and non-virulence conditions. Seventy-one genes were found to be upregulated by a factor of at least two in the array study (Table S3). 45 of these also showed increased expression in the dRNA-seq data, indicating that dRNA-seq can provide reliable quantitative estimates of transcript levels in A. tumefaciens. A consistent set of 22 genes was strongly virulence-induced in both experimental setups. The classical virulence-induced genes or operons virA, virB, virD, virE, virH and tzs were included in this group (Fig. 2A). Among the most strongly induced genes in both approaches was virB1, the first gene of an extended operon of 11 genes, coding for the major components of the T4SS (Fig. 2B).
Figure 2. Virulence-induced genes/operons. (A) List of six classical virulence-induced genes. The fold changes obtained from dRNA-seq are compared with previously published microarray data. (B) Screenshot of the virB virulence induction. cDNA reads of +Vir (top) and -Vir (bottom) libraries are shown. The virB operon and the distance of the mapped TSS to the virB1 AUG start codon are indicated.
Figure 2. Virulence-induced genes/operons. (A) List of six classical virulence-induced genes. The fold changes obtained from dRNA-seq are compared with previously published microarray data. (B) Screenshot of the virB virulence induction. cDNA reads of +Vir (top) and -Vir (bottom) libraries are shown. The virB operon and the distance of the mapped TSS to the virB1 AUG start codon are indicated.
Identification of sRNA Transcripts
For the discovery of sRNAs in A. tumefaciens, cDNA reads along the whole genome were visualized by calculating graph files and displaying them in the Integrated Genome Browser (IGB, Affymetrix). We manually analyzed the 454-sequencing results for transcripts derived from intergenic regions or for reads complementary to protein-coding genes. Transcripts were listed as putative sRNA if they were represented by five or more cDNA reads in at least one library. This procedure resulted in a total of 228 candidate sRNAs (Table S4), including cis-encoded antisense sRNAs that overlap partially or completely with annotated ORFs. sRNAs were identified on all four A. tumefaciens replicons, namely 129 on the circular chromosome, 59 on the linear chromosome, 20 on the At- and 20 on the Ti-plasmid, respectively (Fig. 3). Interestingly, more than half of the sRNAs found on the two plasmids were cis-antisense sRNAs, whereas the majority of chromosomally encoded sRNAs were trans-encoded.
Figure 3.Distribution of sRNAs on all four . The diagram depicts all identified sRNAs on the circular chromosome, the linear chromosome, the At- and the Ti-plasmid. Numbers of trans- and cis-encoded transcripts are given.
Figure 3.Distribution of sRNAs on all four . The diagram depicts all identified sRNAs on the circular chromosome, the linear chromosome, the At- and the Ti-plasmid. Numbers of trans- and cis-encoded transcripts are given.As expected, transcripts of four housekeeping sRNAs, namely the 6S RNA, the signal recognition particle (SRP) RNA 4.5S, RNase P, and tmRNA were identified. The RNase P transcript, which is encoded on the circular chromosome, was found to be located some nucleotides further upstream from its annotated site. The three recently published sRNAs RepE, AbcR1 and AbcR2 were also present in the data set.,
Expression profiling of 22 sRNAs
We used RNA gel blot analysis to confirm selected sRNAs by an independent method and to monitor their expression under various conditions (Table 2). Twenty-two of 28 candidates were detected (Figs. 4, 5, 6 and S1). They were named according to their position on the genome (e.g., C1, C2 etc. on the circular chromosome; L1, L2 etc. on the linear chromosome; see Table S4). Homologs of several of the validated chromosomal sRNAs (C2, C5, C6 and L5) have previously been reported in Rhizobium species (see Table S4 for references). All other sRNAs are initially described in this study. Blast searches revealed that C7, C10, L2, L3, L6, At2, Ti1 and Ti4 are unique to Agrobacterium, whereas sequences homologous to C1, C3, C4, C8, C9, L1, L4, At1, Ti2 and Ti3 can be found in at least one other α-proteobacterium (data not shown).
Table 2. Validated sRNAs
sRNA
Position
Flanking Genes
Antisense Gene
Size dRNA-seq
Size Northern
circular chromosome
C1
109476 – 109593
-
atu0105
117
100, 70
C2
112675 – 112536
atu0109 / atu8081
-
139, 98
140, 95
C3
317175 – 317330
atu0323 / atu0324
-
155
170, 70
C4
998103 – 998034
atu1000 / atu1001
-
69
50
C5
1275442 – 1275296
-
atu1287
146
140
C6
1288585 – 1288698
atu1298 / atu1299
-
113
100
C7
1345809 – 1345652
atu1350 / atu1351
-
157, 103
160, 105
C8
1995974 – 1995854
atu2036 / atu2038
-
120
120
C9
2087200 – 2087380
atu2122 / atu2123
-
180
180
C10
2667194 – 2667282
atu2683 / atu2684
-
88
85
sRNAs are named after their location on the genome. Position on the genome, flanking genes or antisense transcript and sizes estimated from dRNA-seq data and from RNA gel blots are listed.
Figure 4. Differential expression of 10 sRNAs from the circular chromosome. Schematic diagrams of the genomic context of each sRNA are given on the left. Northern hybridizations were performed with RNA from A. tumefaciens from various growth conditions, i.e., different media, growth-phases, temperatures and pH-value, as indicated. Eight µg of total RNA were separated on a 10% polyacrylamide gel containing 7 M urea and detected by RNA gel blot analysis using RNA probes. Sizes of the sRNAs are given on the right. Ethidiumbromide-stained tRNAs were used as loading controls.
Figure 5.Differential expression of sRNAs from the linear chromosome and both Validation of (A) two sRNAs located on the linear chromosome, (B) one on the At-plasmid and (C) one on the Ti-plasmid. Conditions were as described in Figure 4.
Figure 6. The virulence-induced transcript Ti2. (A) Genomic location of Ti2. (B) Schematic drawings of Ti2-dRNA-seq results. From top to bottom, diagrams from +Vir (+TEX), +Vir, -Vir (+TEX) and -Vir libraries are shown. (C) Predicted secondary structure of Ti2 calculated by the mfold program. (D) RNA gel blot analysis of Ti2 in the A. tumefaciens WT and VirA- and VirG deletion mutants. The three strains were grown in AB medium in the absence (-Vir) or the presence (+Vir) of the vir-gene inducer acetosyringone. The Ti2 size is given on the left. Ethidiumbromide-stained tRNAs were used as loading controls.
sRNAs are named after their location on the genome. Position on the genome, flanking genes or antisense transcript and sizes estimated from dRNA-seq data and from RNA gel blots are listed.Figure 4. Differential expression of 10 sRNAs from the circular chromosome. Schematic diagrams of the genomic context of each sRNA are given on the left. Northern hybridizations were performed with RNA from A. tumefaciens from various growth conditions, i.e., different media, growth-phases, temperatures and pH-value, as indicated. Eight µg of total RNA were separated on a 10% polyacrylamide gel containing 7 M urea and detected by RNA gel blot analysis using RNA probes. Sizes of the sRNAs are given on the right. Ethidiumbromide-stained tRNAs were used as loading controls.Figure 5.Differential expression of sRNAs from the linear chromosome and both Validation of (A) two sRNAs located on the linear chromosome, (B) one on the At-plasmid and (C) one on the Ti-plasmid. Conditions were as described in Figure 4.Figure 6. The virulence-induced transcript Ti2. (A) Genomic location of Ti2. (B) Schematic drawings of Ti2-dRNA-seq results. From top to bottom, diagrams from +Vir (+TEX), +Vir, -Vir (+TEX) and -Vir libraries are shown. (C) Predicted secondary structure of Ti2 calculated by the mfold program. (D) RNA gel blot analysis of Ti2 in the A. tumefaciens WT and VirA- and VirG deletion mutants. The three strains were grown in AB medium in the absence (-Vir) or the presence (+Vir) of the vir-gene inducer acetosyringone. The Ti2 size is given on the left. Ethidiumbromide-stained tRNAs were used as loading controls.As bacterial sRNAs are often involved in stress adaptation, their expression profiles were recorded under different growth conditions, i.e., different cultivation media, growth phases, temperatures and pH values (Figs. 4 and 5). Seven sRNAs were hardly detectable and thus were only tested under normal and virulence conditions (Fig. S1). Generally, plasmid-encoded sRNAs and cis-antisense sRNAs were more difficult to detect than chromosome-encoded or trans-acting candidates.Expression of the C1, C4, C5 and C7 RNAs was barely affected by the growth conditions (Fig. 4). Four sRNAs displayed a growth-phase dependent expression profile. C6, C10, L2 and At1 were strongest expressed at high optical densities, whereas C9 was most abundant in early stages of growth. Consistent with the remarkably high number of sequence reads (2,470), the C10 RNA showed the strongest expression in the Northern analysis (Figs. Four and 5). The expression of other sRNAs responded to temperature (L5 and Ti4) and pH alterations (C2, C3, C8 and Ti4), suggesting a biological role under these conditions.In most cases, the transcript lengths estimated from Northern hybridizations were consistent with the lengths determined in the dRNA-seq approach (Table 2). However, eight sRNAs (C1, C2, C3, C7, L1, L4, L5 and Ti1) showed more than one Northern signal, suggesting that these sRNAs are processed into two or three smaller products. In several cases (C1, C2, C7 and L5) such processing events were already apparent in the sequencing data set (data not shown).A total of nine cis-antisense transcripts were validated by RNA gel blot analysis. A fraction of them (C5, L4, Ti4) overlap only partly with the open reading frame encoded on the opposite strand. For instance, Ti4 reaches 48 nt into the 3′end of the coding sequence of the virulence-induced gene virC2. Other antisense sRNAs (C1, L1, L3, AT2, Ti1 and Ti3) show complete or nearly complete complementarity to an open reading frame on the other DNA strand. Interestingly, the virulence-gene virB9 (atu6175) and the conjugal transfer gene traB (atu6129) have an sRNA with perfect complementarity encoded on the reverse strand (Ti3 and Ti1, respectively; Fig. S1).
The virulence-induced sRNA Ti2
The Ti2 RNA, which is encoded in the intergenic region of virF-atu6155, was almost exclusively expressed under virulence conditions (Fig. 6). With 1,772 cDNA reads, the 113 nt transcript was the most abundant transcript from the Ti-plasmid. Ti2 is located upstream of the atu6155-virK-operon and is preceded by a vir-box sequence (Fig. 6A). The predicted secondary structure of Ti2 is given in Figure 6C. RNA gel blot experiments confirmed the dramatic induction in the presence of the virulence-gene inductor acetosyringone (Fig. 6D). Ti2 was not detectable at all under any other growth condition (data not shown).Induction of virulence-genes in A. tumefaciens is mediated by the VirA/VirG two-component system, which is activated by several plant signals. To check whether expression of Ti2 is dependent on these two regulatory proteins, Northern analysis was performed with RNA from virA and virG deletion mutants. Since acetosyringone-induced Ti2 expression was lost in these backgrounds, Ti2 is a likely target of the VirA/VirG system (Fig. 6D).
Discussion
TSS mapping in A. tumefaciens
Within only a few years, RNA-seq technologies have outperformed classical transcriptome analyses by microarrays, in particular when it comes to the precise mapping of 5′ends of transcripts. In the present study, we used 454-pyrosequencing to deduce an A. tumefaciens consensus housekeeping promoter from the start sites of a total of 388 primary transcripts. Knowledge on α-proteobacterial promoter sequences is valuable because it is commonly observed that genes from Agrobacterium and Rhizobium species are poorly expressed in E. coli when transcribed from their authentic promoters although their -10 and -35 regions often resemble E. coli-type sigma 70 promoters. The Agrobacterium consensus promoter extracted from our dRNA-seq data (5′-CTTGNN-N18-TATNNT) deviates slightly from the promoter (5′-CTTGAC-N17-CTATAT) compiled from nearly 100 known and predicted promoters in S. meliloti and other rhizobia. Notably, an additional conserved box with a CAT motif was identified around the TSS. Our data will thus aid in the prediction of promoters of other Agrobacterium genes of interest.A. tumefaciens genes and proteins prevalent under virulence conditions have been identified by several global studies using microarray or proteomics studies, respectively.- Consistent with these findings, we confirmed that nearly all virulence-induced genes were located on the Ti-plasmid. Our approach revealed the virulence-induced short transcript Ti2 that had never been detected in the previous studies. Virulence induction in A. tumefaciens depends on the VirA/VirG two component system. Using a vir-box consensus sequence published by Cho and Winans, we searched for typical VirG binding sites upstream newly annotated TSS from virulence-associated genes. A vir-box with only one mismatch from the given consensus sequence was found upstream of virF, virB1 and the newly identified sRNA candidate Ti2. Among the most strongly induced transcripts were Ti2 and virB1. The corresponding vir-boxes are located 61.5 nt (Ti2) and 63.5 nt (virB1) upstream of the TSS (counted from the vir-box center). For virF, which was only slightly induced under virulence conditions, a vir-box-TSS distance of 53.5 nt was observed. As proposed previously, both the sequence and the position of the vir-box relative to the TSS seem to determine the induction rate of virulence-genes in A. tumefaciens.
sRNA discovery: prediction vs. experiment
Prior to a wave of studies taking advantage of RNA-seq approaches, sRNAs have primarily been discovered through bioinformatics predictions and subsequent experimental validation. In a recently published study from our lab, we used a comparative bioinformatics approach to find sRNAs on the circular chromosome of A. tumefaciens. All IGRs longer than 50 nt on the circular chromosome were compared with different rhizobial genomes. Regions with high sequence conservation were collected as candidates for sRNA-encoding genes. A total of 231 sRNAs were predicted, four of which were validated by RNA gel blot analysis. 129 sRNAs were identified on the circular chromosome by deep sequencing. The overlapping sRNA candidates pool consists of only 26 sRNAs (Fig. 7). This may have several reasons. (1) It is quite possible that some of the computer-predicted sRNAs are not expressed at all. In fact, 9 out of 13 of the predicted sRNAs could not be identified by RNA gel blot hybridization. (2) It is also possible that suitable conditions, under which some of the in silico predicted candidates are expressed, have not been analyzed yet. It is noteworthy that many sRNAs in the present study and in other studies are differentially expressed under various stress conditions and might thus have escaped detection under the two conditions used for our pyrosequencing approach.,,- (3) Since the computer-based search was restricted to IGRs, it will have missed all cis-antisense sRNAs. (iv) Comparative bioinformatics is limited to the discovery of sRNAs that are conserved in closely related species. Only experimental approaches, like RNA-seq, will be able to discover unique sRNAs with a specific function in an organism of interest.
Figure 7. Comparison of sRNA identification by computer prediction and dRNA-seq. Venn diagram comparing the results for the circular chromosome of A. tumefaciens. Twenty-six sRNAs were found by both methods.
Figure 7. Comparison of sRNA identification by computer prediction and dRNA-seq. Venn diagram comparing the results for the circular chromosome of A. tumefaciens. Twenty-six sRNAs were found by both methods.In this context, findings on the sRNome of Listeria monocytogenes are noteworthy because most exhaustive sRNA searches using bioinformatics, tiling arrays and deep sequencing techniques have been performed in this organism.,- Despite substantially overlapping sets of sRNAs, each of the four studies reported up to 71 unique sRNAs. Such findings lend value to each of the possible strategies used for sRNA identification.
sRNAs on all four A. tumefaciens replicons
56% of the 228 identified sRNAs were located on the circular chromosome, 26% on the linear chromosome, 9% on the At-plasmid, and 9% on the Ti-plasmid. sRNAs being located on mega-plasmids have been described earlier in several bacteria, including S. meliloti, R. etli or Chlamydia trachomatis, indicating that non-essential plasmid-encoded functions can be prone to sRNA-mediated regulation.,, Except for sRNAs involved in the control of plasmid replication, the function of all of these sRNAs is currently unknown.,Homologous sRNAs of some identified candidates exist in related bacteria (Table S4), suggesting they might have conserved functions. Homologs of the 270-nt L4 sequence were found in all other rhizobia and many additional α-proteobacteria. This transcript always codes for a small protein of unknown function. A closer inspection of A. tumefaciens L4 revealed a possible AUG start codon preceded by an AG-rich sequence, which may serve as Shine-Dalgarno sequence. Thus, L4 might code for an unannotated peptide rather than a small RNA. Small proteins routinely missed by automated genome annotation and classical proteomic studies have recently received a lot of attention.,- Another interesting class of short transcripts serves dual functions as regulatory RNA and as mRNA, e.g., RNA III from Staphylococcus aureus, the SgrS RNA from E. coli or SR1 from Bacillus subtilis.-For some of the identified sRNAs more than one transcript was observed, suggesting posttranscriptional processing of a primary transcript. Many enterobacterial sRNAs are activated or inactivated by processing events.,- In many bacterial species, RNaseE is responsible for sRNA processing. ArcZ for instance is processed by RNaseE into three transcript variants in E. coli and Salmonella., Since an RNaseE homolog is present in A. tumefaciens, it might be involved in sRNA processing.In most cases, cis-antisense sRNAs modulate expression of the overlapping sense gene on transcriptional or posttranscriptional level, allowing us to speculate about some putative sRNA targets. The sRNA L3 is located antisense to tnp, encoding a transposase. Additionally, we found a second antisense sRNA overlapping with another transposase gene (antisense sRNA to atu4601, Table S4). Thus, transposon mobility might be regulated by an RNA-antisense mechanism in A. tumefaciens, as previously reported for E. coli. Likewise, a number of other studies revealed sRNAs in opposite orientation to transposase transcripts.,,Other cis sRNAs identified and verified in this study are encoded antisense to exoX (exopolysaccharide production repressor protein), traB (conjugal transfer protein), virC2 (excision of the T-DNA) and virB9 (component of the typeIV secretion system). As the latter two putative antisense sRNA targets encode important virulence factors, it is tempting to speculate that the corresponding sRNAs (Ti1 and Ti4) play regulatory roles in the A. tumefaciens plant infection process.One of the most appealing sRNA revealed in this study is Ti2. This transcript was highly induced under virulence conditions in dependence of the VirA/VirG two component system. The distance between the 3′ end of Ti2 and the downstream operon atu6155-virK is only 93 nt. Microarrays demonstrated induction of the atu6155-virK transcript., However, the function of both genes remains unclear. VirK might be involved in the wide host range of Agrobacterium, although an A. tumefaciensvirK deletion strain was not defective in tumor formation.,, As apparent from the dRNA-seq data, atu6155 is not transcribed from a separate TSS but—together with virK—derives from the promoter upstream of Ti2. The remarkable differential accumulation of Ti2 and the downstream mRNA raises the question whether elongation of most of the transcripts initiated at the VirAG-promoter is prematurely terminated at the 3′end of Ti2. Alternatively, the Ti2 RNA could be a processing product from the 5′UTR of atu6155. The ubiquitous 6S RNA and the Vibrio MicX sRNA are two examples processed from longer transcripts with their neighboring genes., Small RNA species derived from processing of 3′ or 5′ UTRs were reported for E. coli.
Cis-regulatory elements (including riboswitches), which can act in trans as non-coding RNA have been described in several organisms.,,We wondered whether the massively induced Ti2 RNA plays a role in the virulence of A. tumefaciens. Like a virK mutant, deletion of Ti2 did not significantly affect tumorigenesis of Kalanchoe daigremontiana leaves (data not shown). Assuming that the massive induction of Ti2 should have a physiological function we are going to do proteomics and/or transcriptomics studies to identify possible targets of Ti2. Similar studies will be performed with other selected sRNA mutants.Some recent studies hypothesized a secretion of sRNAs into host cells akin to effector protein translocation.,, It certainly is an attractive possibility that A. tumefaciens directly transfers sRNAs or—along with the T-DNA—sRNA genes into the host during plant transformation, which might mimic plant microRNA precursors. A recent study demonstrated that the plant microRNA pathway is essential for Agrobacterium disease development. Additionally, it is known that microRNAs are involved in defense mechanisms and in plant-microbe interactions including the response to A. tumefaciens.,The large number of Agrobacterium sRNAs, some of which are differentially expressed, together with our recent finding that a single sRNA like AbcR1 can address several targets promises many interesting findings for the numerous sRNAs in this plant pathogen.
Material and Methods
Bacterial growth conditions and stress experiments
A. tumefaciens C58 was cultivated in YEB complex medium or in AB minimal medium at 30°C. For heat and cold shock conditions, cells were grown in YEB medium to an OD600 of 0.5 and subsequently shifted to pre-warmed or pre-cooled flasks to either 42°C (heat shock) or 17°C (cold shock). Cells for RNA isolation were collected before the temperature shift and after 5 and 10 min. pH-stress was performed in AB minimal medium adjusted to different pH values (5.5 and 7). After 5 h of growth (OD600 of 0.2), cells were harvested for RNA preparation. To induce virulence-gene expression, A. tumefaciens cells were pre-cultivated in AB minimal medium to an OD600 of 0.2 at 30°C before addition of acetosyringone (Sigma-Aldrich, Munich, Germany) to a final concentration of 0.1 mM. Cells were further incubated for 16 h at 23°C. Control cultures (non-induced) were treated with solvent (DMSO and H2O) only. Virulence induction in samples used for dRNA-seq was confirmed by protein gel blotting with a VirB9-probe using standard protocols.
RNA preparation
Cells (10 ml) were harvested by centrifugation. After washing in ice cold AE-buffer (20 mM Na acetate, pH 5.5), pellets were immediately frozen in liquid nitrogen. Total RNA of cultured bacteria was isolated using the hot acid phenol method. Precipitated RNA used for 454-sequencing was treated with 100 U of DNase I (Roche, Mannheim, Germany) for 30 min at 37°C in a reaction volume of 100 µl. SUPERase InTM (100 U, Ambion, Huntingdon, UK) was added in the reaction to inhibit RNase activity. RNA was checked for integrity by agarose gel electrophoresis. The absence of genomic DNA was controlled by PCR.
cDNA library construction and sequence analysis
cDNA libraries were prepared at Vertis Biotechnology AG (Germany) and analyzed on a Roche FLX sequencer as previously described. Primary transcripts of total RNA were enriched by performing a selective degradation of RNAs containing a 5′mono-phosphate (5′P) using a 5′P-dependent terminator exonuclease (Epicenter). For each library [-Vir, -Vir (+TEX), +Vir, +Vir (+TEX)], graphs representing the number of mapped reads per nucleotides were calculated and visualized using the Integrated Genome Browser software from Affymetrix for each of the four A. tumefaciens replicons.
TSS mapping
Transcriptional start sites (TSS) were identified similar to previously published methods. Briefly, sequence reads were mapped to the A. tumefaciens reference genome (NC_003062.2, NC_003063.2, NC_003064.2, NC_003065.3) using Bowtie.,, Combined mapping data from the samples that were enriched for primary transcripts were then used to map TSS according to the following criteria with custom-made Perl scripts: Coverage at the TSS site was ≥ 5 reads and at least 5-fold the number of reads than the previous position in 5′ direction (saw-tooth pattern at the TSS). Coverage at the TSS had to be higher in the samples enriched for primary transcripts than in those not enriched, and coverage at the 18 nucleotide positions downstream of the TSS had to be at least 0.8-fold that of the TSS to prevent spurious annotation. TSS in front of annotated features were listed.
Comparison of dRNA-seq Data to Published Microarray Results
For a quantitative analysis of gene expression, Bowtie mapping data (see previous section) were used to determine the number of reads for each annotated CDS in the A. tumefaciens reference genome using custom-made Perl scripts based on BioPerl packages. Quantitative analysis of read counts was performed with the Bioconductor package DESeq that allows the analysis of experimental data even without replicates (www.bioconductor.org). Comparison of expression ratios (+Vir/-Vir) from this study (dRNA-seq) with previously published microarray data was performed with custom-made Perl scripts.
Northern analysis
For the detection of small RNAs, 8–10 µg total RNA was separated on 10% polyacrylamide gels containing 7 M urea and subsequently transferred onto nylon membranes by semi-dry electroblotting (Biometra, Göttingen, Germany). Hybridizations were performed at 37–48°C over night using digoxygenine (DIG)-labeled DNA- or RNA probes (primers used for probe generation are listed in table S5), which were produced according to the instruction manual (Roche, Mannheim, Germany). After hybridization, membranes were first washed for 5 min at room temperature with solution 1 (5 x SSC, 0.1% SDS) followed by incubation with solution 2 (1 x SSC, 0.1% SDS) and 3 (0.5 x SSC, 0.1% SDS). Detection was performed by exposing the blot to a luminescence detector, using chemiluminescence substrate (CDP-Star; Roche Molecular Biochemicals, Mannheim, Germany).
Authors: D W Wood; J C Setubal; R Kaul; D E Monks; J P Kitajima; V K Okura; Y Zhou; L Chen; G E Wood; N F Almeida; L Woo; Y Chen; I T Paulsen; J A Eisen; P D Karp; D Bovee; P Chapman; J Clendenning; G Deatherage; W Gillet; C Grant; T Kutyavin; R Levy; M J Li; E McClelland; A Palmieri; C Raymond; G Rouse; C Saenphimmachak; Z Wu; P Romero; D Gordon; S Zhang; H Yoo; Y Tao; P Biddle; M Jung; W Krespan; M Perry; B Gordon-Kamm; L Liao; S Kim; C Hendrick; Z Y Zhao; M Dolan; F Chumley; S V Tingey; J F Tomb; M P Gordon; M V Olson; E W Nester Journal: Science Date: 2001-12-14 Impact factor: 47.728
Authors: Jason E Stajich; David Block; Kris Boulez; Steven E Brenner; Stephen A Chervitz; Chris Dagdigian; Georg Fuellen; James G R Gilbert; Ian Korf; Hilmar Lapp; Heikki Lehväslaiho; Chad Matsalla; Chris J Mungall; Brian I Osborne; Matthew R Pocock; Peter Schattner; Martin Senger; Lincoln D Stein; Elia Stupka; Mark D Wilkinson; Ewan Birney Journal: Genome Res Date: 2002-10 Impact factor: 9.043
Authors: B Goodner; G Hinkle; S Gattung; N Miller; M Blanchard; B Qurollo; B S Goldman; Y Cao; M Askenazi; C Halling; L Mullin; K Houmiel; J Gordon; M Vaudin; O Iartchouk; A Epp; F Liu; C Wollam; M Allinger; D Doughty; C Scott; C Lappas; B Markelz; C Flanagan; C Crowell; J Gurson; C Lomo; C Sear; G Strub; C Cielo; S Slater Journal: Science Date: 2001-12-14 Impact factor: 47.728
Authors: Philip Möller; Philip Busch; Beate Sauerbrei; Alexander Kraus; Konrad U Förstner; Tuan-Nan Wen; Aaron Overlöper; Erh-Min Lai; Franz Narberhaus Journal: J Bacteriol Date: 2019-05-08 Impact factor: 3.490
Authors: Steven Slater; João C Setubal; Brad Goodner; Kathryn Houmiel; Jian Sun; Rajinder Kaul; Barry S Goldman; Stephen K Farrand; Nalvo Almeida; Thomas Burr; Eugene Nester; David M Rhoads; Ryosuke Kadoi; Trucian Ostheimer; Nicole Pride; Allison Sabo; Erin Henry; Erin Telepak; Lindsey Cromes; Alana Harkleroad; Louis Oliphant; Phil Pratt-Szegila; Roy Welch; Derek Wood Journal: Appl Environ Microbiol Date: 2012-12-14 Impact factor: 4.792
Authors: Alexander Kraus; Mareen Weskamp; Jennifer Zierles; Miriam Balzer; Ramona Busch; Jessica Eisfeld; Jan Lambertz; Marc M Nowaczyk; Franz Narberhaus Journal: J Bacteriol Date: 2020-10-22 Impact factor: 3.490