| Literature DB >> 18793445 |
Claudio Valverde1, Jonathan Livny, Jan-Philip Schlüter, Jan Reinkensmeier, Anke Becker, Gustavo Parisi.
Abstract
BACKGROUND: Small non-coding RNAs (sRNAs) have emerged as ubiquitous regulatory elements in bacteria and other life domains. However, few sRNAs have been identified outside several well-studied species of gamma-proteobacteria and thus relatively little is known about the role of RNA-mediated regulation in most other bacterial genera. Here we have conducted a computational prediction of putative sRNA genes in intergenic regions (IgRs) of the symbiotic alpha-proteobacterium S. meliloti 1021 and experimentally confirmed the expression of dozens of these candidate loci in the closely related strain S. meliloti 2011.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18793445 PMCID: PMC2573895 DOI: 10.1186/1471-2164-9-416
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
First compilation of S. meliloti chromosomal IgRs predicted to encode sRNAs.
| 1 | Rfam | < | s70 | n | eubacteria | ~160 nt | ~155 nt | <, H, C, S, A, B, O | 6S RNA | [ | |
| 2 | OT | > | NN – s70 | y | ~80 nt | duplet < 100 nt | - | [ | |||
| 3 | SP | < | NN – s70 | y | ~150 nt | ~165 nt | <, A, C, H, S | [ | |||
| 4 | tR | < | - | y | ? | multiple ~150 $~& 80 nt mainly | - | this work | |||
| 5 | tR | > | NN – s70 | y | - | ~240 nt | no bands detected | >, S (*) | [ | ||
| 6 | OT | > | NN | y | ~100–200 nt | not done 6 | >, A, C | - | this work | ||
| 7 | OT | < | NN | y | ~390 nt | duplet, ~135 & ~200 nt | <, A, C, O, S | this work | |||
| 8 | OT | > | NN – s70 – PhoB | y | ~170 nt | not done 6 | - | this work | |||
| 9 | OT | > | s70 | y | ~180 nt | not done | - | this work | |||
| 10 | SP | > | s70 | y | ~190 nt | ~175 nt | - | [ | |||
| 11 | SP | < | s70 | y | ~100–120 nt | duplet, ~120 $~& 110 nt | - | this work | |||
| 12 | SP | > | s70 | y | ~150 nt | not done 6 | >, A, B, C, H, O, S | [ | |||
| 13 | SP | > | NN – s70 | y | ~150 nt | duplet, ~140 & ~90 nt | - | [ | |||
| 14 | SP | > | s70 | y | ~60–150 nt | multiple, ~90–100 & ~180–210 | >, B, C, O, | this work | |||
| 15 | SP | > | s70 | y | ~110 nt | ~80 nt | >, A, B, C, H, O, S | this work | |||
| 16 | SP | > | s70 | y | ~110 nt | ~100 nt | - | this work | |||
| 17 | SP | > | s70 | y | ~120 nt | not done | >, A, B, O, S | this work | |||
| 18 | OT | > | - | y | ~80 nt | not done | >, A, C, H, O, S | this work | |||
1 Rfam, RNA families database [33,67]; OT, IgR with annotated orphan terminator; tR, IgR flanking tRNAs; SP, sRNAPredictHT.
2 The transcribed strand is indicated (>, forward; <, reverse)
3 Putative promoter sequence detected by conservation in sequence alignment, search of consensus sequences or neural network promoter prediction. s70, sequence highly similar to the S. meliloti constitutive promoter consensus sequence [49]; NN, neural network promoter prediction [47]; PhoB, sequence detected with homology to the PhoB transcriptional regulator binding consensus [51].
4 Sm, Sinorhizobium medicae WSM419; Rl, Rhizobium leguminosarum biovar viciae 3841; Re, Rhizobium etli CFN42; At, Agrobacterium tumefaciens C58; Ml, Mesorhizobium loti MAFF303099; Oa, Ochrobactrum anthropi ATCC49188; Msp, Mesorhizobium sp. BNC1; Bo, Brucella ovis ATCC25840.
5 Approximate length of sRNA based on putative 5' and 3' ends.
6 PCR failed to amplify the IgR.
7 Detection on the oligonucleotide microarray Sm14kOLI. > and < denote the orientation of the detected signal (absolute M-value ≥ 2.5 represents an enrichment of small RNA transcripts). A, B, C, H, O and S denote the stress condition under which the signal was detected (A, acidic; B, basic; C, cold shock; H, heat shock; O, oxidative, S, saline). The coordinates of the oligonucleotide probes that gave positive signals are indicated in brackets. -, not detected under the conditions studied. (*), M-value ≥ 2.
8 Gene name as designated in the literature (smrC#, sra#) [26,27]or according to our predictive scheme (sm#).
Figure 1Northern blot analysis of putative sRNAs encoded in the chromosome of . Total RNA was isolated from S. meliloti 2011 cells grown at 28°C with agitation (120 rpm) in RDM minimal medium and harvested at OD600 = 0.5 (Exp) or at OD600 = 3.9 (Stat). Total RNA was also isolated from cells subjected to high salt stress (NaCl; 0.3 M NaCl in RDM, OD600 = 0.55), membrane stress (EtOH; RDM with 2% v/v ethanol, OD600 = 1.6; or SDS; RDM with 0.1% w/v SDS; OD600 = 1.0), phosphate starvation (-P; RDM with 0.1 mM phosphate and 10 mM MOPS pH 7.0, OD600 = 1.0), oxidative stress (H2O2; RDM with 0.1 mM H2O2, OD600 = 1.1), heat stress (37°; RDM grown at 37°C, OD600 = 0.95) and acid stress (pH 5.5; treatment of exponential phase cells at OD600 = 0.5 during 90 min at pH 5.5 before harvest). Northern hybridizations were done with PCR-generated digoxigenin-labeled dsDNA probes directed against the entire IgR or an internal fragment (see Figure 2 and Additional file 2 for further details). RNA molecular weight markers (with their sizes indicated in nt with small arrows at the left of each panel) were run with each set of RNA samples for estimation of transcript size. As exposure times were optimized for visualization here, the signal intensity does not indicate relative abundance of detected transcripts between different IgRs. Hybridization signals were quantified with ImageJ software, normalized to the amount of 5S RNA, 4.5S RNA and tRNA bands detected in silver stained gels present in each sample (bottom panel) and plotted in a bar graph shown below each Northern blot. The band intensity units are relative to the normalized amount present in Exp cells, which were set as 1.0.
Top 20 highest-scoring putative sRNA genes predicted by the global scoring procedure as independent transcriptional units in chromosomal IgRs of S. meliloti 1021.
| 652 | SMc01226 | < | < | < | NN, s70 | 1698732 | 1698618 | 6 | 2 | 98 | 114 | y | <, C | |
| 541 | SMc01844 | > | > | < | NN, s70 | 2371490, 2371606 | 2371852 | 1 | 1 | 82 | 256–362 | n | >, S | |
| 384 | SMc03856 | < | > | > | s70 | 3522121, 3522271 | 3522379 | 12 | 8 | 80 | 108–258 | y | >, A, B, O, S | |
| 491 | SMc01202 | < | < | < | NN, s70 | 1728153, 1728196, 1728269 | 1728021 | 1 | 1 | 62 | 120–248 | n | - | |
| 922 | SMc02051 | < | < | < | NN, s70 | 1667614 | 1667488 | 3 | 5 | 58 | 126 | n | <, S | |
| 301 | SMc04042 | < | > | < | NN, s70 | 3046713 | 3046789 | 9 | 4 | 53 | 76 | y | - | |
| 470 | SMc02080 | > | < | > | NN, s70 | 1635343, 1635411, 1635564 | 1635217 | 1 | 1 | 52 | 126–347 | n | <, B, H, O (*) | |
| 405 | SMc00057 | > | > | > | NN, s70 | 1091047, 1091106 | 1091343 | 1 | 1 | 52 | 237–296 | n | - | |
| 710 | SMc01933 | < | < | > | NN, s70 | 1398427, 1398584 | 1398279 | 4 | 3 | 47 | 148–305 | y | <, A, C, H, S | |
| 472 | SMc02850 | < | > | > | NN, s70 | 201682 | 201829 | 4 | 3 | 47 | 147 | y | - | |
| 1101 | SMc01671 | < | > | < | NN, s70 | 2475717, 2475862 | 2475961 | 5 | 2 | 47 | 99–244 | y | - | |
| 410 | SMc02139 | < | > | > | NN, PhoB | 573816, 573830 * | 574040 | 4 | 1 | 45 | 210 | y | - | |
| 829 | SMc02151 | < | > | < | NN, s70 | 560780, 560916, 561000 | 561258 | 1 | 1 | 42 | 258–478 | n | >, S (*) | |
| 902 | SMc02597 | > | < | > | NN, s70 | 1198309, 1198440, 1198508 | 1198093 | 1 | 1 | 42 | 212–416 | n | - | |
| 277 | SMc01425 | < | > | > | s70 | 2270996, 2271068 | 2271224 | 1 | 1 | 42 | 156–228 | n | - | |
| 292 | SMc01218 | < | < | > | NN | 1706863 | 1706715 | 1 | 3 | 34 | 148 | n | - | |
| 396 | SMc04289 | < | > | > | s70, PhoB | 2210185, 2210260 * | 2210322 | 1 | 2 | 33 | 62–137 | n | - | |
| 1270 | SMc04453 | < | < | > | NN | 2321447 | 2321055 | 2 | 1 | 33 | 392 | y | <, A, C, O, S | |
| 826 | SMc03014 | > | < | > | NN, s70 | 713461, 713680, 713763 | 713290 | 1 | 1 | 32 | 170–463 | n | - | |
| 644 | SMc01257 | < | > | > | NN, s70, PhoB | 1518613, 1518748 * | 1518988 | 1 | 1 | 32 | 234–375 | n | - | |
1 sRNA candidates reported in contemporary studies retain their original notation (smrC# or sra#) [26,27]. Otherwise, the designation sm# corresponds to the full list of sRNA predictions obtained in this work by the global scoring procedure (Additional file 17). Candidate IgRs are sorted according to its descending global score (GS). Bolded candidates have been verified experimentally ([26,27]; this work).
2 NN, neural network promoter prediction; s70, σ70-dependent promoter; s54, RpoN (σ54)-dependent promoter promoter; PhoB, putative PhoB binding site.
3 The coordinates of all putative promoters and/or transcription factor binding sites within an IgR are presented. The position given for NN is the 3' end of the identified sequence. For σ70 the position given is 7 bases downstream of the 3' end of the -10 hexamer. The 3' end of the predicted binding site for σ54 and PhoB is indicated with an asterisk.
4 Given is the position of the last uridine at the end of the terminator sequence.
5 Range of possible lengths based on putative 5' and 3' ends.
6 y, candidate present in sRNAPredictHT search (Additional file 4); n, candidate absent in sRNAPredictHT output.
7 As described in Table 1.
Figure 2Organization of novel . The IgRs encompassing novel putative sRNAs from our first compilation (Table 1) are drawn to scale in the portion between breaks. The chromosomal coordinates of predicted promoters and Rho-independent terminators are indicated next to the corresponding symbols. ORFs flanking each IgR are designated with their annotated codes or gene names. sRNAs are designated according to their position in the output of global scoring for the corresponding IgR (Additional file 17) or to the corresponding IgR from Table 1. Small empty arrowheads indicate the approximate position of the chromosomal target sequences for oligonucleotides used to generate probes for Northern blot. Wavy arrowheads denote the location and orientation of oligonucleotides present in Sm14kOLI microarray that detected the putative sRNA transcripts (Table 1).
Figure 3Improvement of the predictive strategy of chromosomal . From the initial list of 2920 chromosomal IgRs, we retained 778 IgRs longer than 150 bp than did not contain annotated or non-annotated repeats. Next, we introduced a global scoring criterion for each IgR to assign a numerical score taking into account the presence of putative independent transcriptional units or transcriptional signals and their relative distance to flanking ORFs, sequence conservation (BlastN analysis) and secondary structure conservation (QRNA analysis). See text for further details.
Figure 4Summary of the scoring criteria introduced to weigh the relative position of predicted transcriptional signals in IgRs. An IgR with a co-oriented putative promoter and a terminator separated from by 40–500 bp each other was scored 10. Every promoter-terminator pair matching the previous criterion within a single IgR was rated individually and summed to calculate the global score of that IgR. Orphan promoters were scored 3. Putative promoters were rated 2 if the 5'-end of the co-oriented flanking CDS was farther than 300 bp, 1 if this distance was 200–300 bp and 0, if they were closer than 200 bp. Orphan terminators were scored 3. Predicted terminators co-oriented with flanking ORFs were scored according to their relative distance to the 3'-end of the corresponding annotated gene, so that a score of 2 was assigned if the terminator was farther than 200 bp, 1 if the distance was 100–200 nt, and 0 if it was closer than 100 bp.
Other small RNAs and cis-regulatory RNA elements detected in the chromosome of S. meliloti 1021 by the global scoring procedure.
| RNAseP RNA | y | 2356367–2357249 | SMc01857< | < | SMc01856< | 168 | [ |
| FMN riboswitch | y | 2398059–2398373 | SMc01608> | > | 127 | [ | |
| 4.5S RNA (SRP) | y | 259823–260123 | SMc02904< | > | 41 | [ | |
| Glycine riboswitch | y | 1674762–1675635 | < | SMc01242> | 37 | [ | |
| n | 2290955–2291142 | SMc01449< | > | SMc01450> | 33 | [ | |
| y | 3105052–3105637 | SMc02983< | < | SMc02984> | 27 | [ | |
| Thiamine riboswitch | y | 3532759–3533026 | SMc03868< | > | SMc03869> | 19 | [ |
| n | 2276530-2276876 | SMc01431 ( | < | SMc01432> | 16 | [ | |
| Cobalamin riboswitch | y | 1999510–1999806 | SMc00166< | < | SMc00165< | 16 | [ |
| Cobalamin riboswitch | n | 2122378–2123250 | SMc04305 ( | < | SMc04306> | 14 | [ |
| Cobalamin riboswitch | y | 954440–955076 | SMc00983< | > | SMc00982> | 11 | [ |
| y | 3505880–3506125 | SMc03838> | > | SMc03839> | 3 | [ | |
| y | 2938993–2939230 | < | SMc00639< | 2 | [ | ||
| Methionine riboswitch | y | 3461713–3461838 | SMc03796> | > | - | [ | |
| Methionine riboswitch | y | 580068–580264 | < | SMc02218> | - | [ |
1 RNA gene or element present (y) or absent (n) in Rfam database.
2 The transcribed strand is indicated.
3 global score (GS) calculated for the element in our predictive procedure (Figure 3).
Putative sRNA genes predicted as independent transcriptional units in IgRs of S. meliloti 1021 megaplasmids pSymA (smA#) and pSymB (smB#).
| 719 | SMa0450 | > | > | > | s70 | 241667 | 241961 | 294 | >, C, O (*) | |
| 817 | SMa0585 | < | < | > | s54 | 313672 * | 313529 | 143–243 | >, A, B, C, H, S | |
| 1007 | SMa0922 | > | < | < | NN | 512362 | 512154 | 208–285 | <, A, C | |
| 714 | SMa0995 | < | > | > | NN | 552600 | 552980 | 127–280 | >, A, B, C, H, O, S | |
| 292 | SMa1024 | < | > | > | PhoB | 567336 * | 567465 | 129 | >, H, S | |
| 624 | SMa1245 | < | < | < | NN | 682304 | 682237 | 67 | <, O | |
| 879 | SMa1644 | > | > | < | s70 | 916928 | 917238 | 93–310 | - | |
| 1124 | SMa2165 | > | > | > | NN | 1220442 | 1220807 | 365 | >, A, H, O, S | |
| 282 | SMb20203 | > | > | > | NN | 213506 | 213617 | 111 | - | |
| 495 | SMb20366 | > | > | < | NN | 379143 | 379269 | 126 | >, H, S | |
| 750 | SMb20516 | > | < | < | NN | 541963 | 541769 | 64–194 | <, H | |
| 592 | SMb20543 | < | < | < | NN, s70 | 571374 | 571217 | 123–157 | <, A, C | |
| 1623 | SMb20548 | > | > | > | NN | 574477 | 574763 | 142–286 | >, A, sB, C, H, O, S | |
| 751 | SMb20551 | < | > | > | s70 | 577731 | 577869 | 138 | >, O, S | |
| 196 | SMb21316 | < | > | > | s70 | 983786 | 983878 | 92 | >, S (*) | |
| 767 | SMb20872 | < | < | < | NN, s70 | 1279964 | 1279836 | 128–274 | <, A, O | |
| 1014 | SMb20916 | < | > | > | s70 | 1325140 | 1325588 | 448 | >, H | |
| 506 | SMb20748 | < | < | > | NN | 1537794 | 1537673 | 121 | - | |
1 sRNA candidates verified experimentally retain their original notation [26].
2 NN, neural network promoter prediction; s70, σ70-dependent promoter; s54, RpoN (σ54)-dependent promoter promoter; PhoB, putative PhoB binding site.
3 The coordinates of all putative promoters and/or transcription factor binding sites within an IgR are presented. The position given for NN is the 3' end of the identified sequence. For σ70 the position given is 7 bases downstream of the 3' end of the -10 hexamer. The 3' end of the predicted binding site for σ54 and PhoB is indicated with an asterisk.
4 Given is the position of the last uridine at the end of the terminator sequence.
5 Range of possible lengths based on putative 5' and 3' ends.
6 As described in Table 1.