| Literature DB >> 19888332 |
Nataly Perez1, Jeanette Treviño, Zhuyun Liu, Siu Chun Michael Ho, Paul Babitzke, Paul Sumby.
Abstract
The coordinated regulation of gene expression is essential for pathogens to infect and cause disease. A recently appreciated mechanism of regulation is that afforded by small regulatory RNA (sRNA) molecules. Here, we set out to assess the prevalence of sRNAs in the human bacterial pathogen group A Streptococcus (GAS). Genome-wide identification of candidate GAS sRNAs was performed through a tiling Affymetrix microarray approach and identified 40 candidate sRNAs within the M1T1 GAS strain MGAS2221. Together with a previous bioinformatic approach this brings the number of novel candidate sRNAs in GAS to 75, a number that approximates the number of GAS transcription factors. Transcripts were confirmed by Northern blot analysis for 16 of 32 candidate sRNAs tested, and the abundance of several of these sRNAs were shown to be temporally regulated. Six sRNAs were selected for further study and the promoter, transcriptional start site, and Rho-independent terminator identified for each. Significant variation was observed between the six sRNAs with respect to their stability during growth, and with respect to their inter- and/or intra-serotype-specific levels of abundance. To start to assess the contribution of sRNAs to gene regulation in M1T1 GAS we deleted the previously described sRNA PEL from four clinical isolates. Data from genome-wide expression microarray, quantitative RT-PCR, and Western blot analyses are consistent with PEL having no regulatory function in M1T1 GAS. The finding that candidate sRNA molecules are prevalent throughout the GAS genome provides significant impetus to the study of this fundamental gene-regulatory mechanism in an important human pathogen.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19888332 PMCID: PMC2765633 DOI: 10.1371/journal.pone.0007668
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Candidate small regulatory RNAs identified by bioinformatic and tiling microarray approaches.
| RNA name | Left nucleotide | Right nucleotide | Size (nt) | Orientation | Information and/or adjacent genes | Method |
| SR79100 | 79100 | 79500 | 400 | < | Adjacent to ribosmal protein L17P and M5005_Spy_0072 | M |
| SR125800 | 125800 | 125900 | 100 | < | Adjacent to a sortase and M5005_Spy_0115 | M |
| SR146132 | 146132 | 146288 | 157 | > | Adjacent to M5005_Spy_0135 and | L |
| SR188392 | 188392 | 188735 | 343 | > | Adjacent to a tRNA-specific adenosine deaminase and M5005_Spy_0180 | L |
| SR188971 | 188971 | 189044 | 73 | > | Adjacent to a tRNA-specific adenosine deaminase and M5005_Spy_0180 | L |
| SR195750 | 195750 | 195870 | 140 | < | Adjacent to putative transcription factor and a transposase | M |
| SR214350 | 214350 | 214500 | 150 | > |
| M |
| SR237399 | 237399 | 237709 | 310 | > | Adjacent to ribosomal protein S7P and | L |
| SR254481 | 254481 | 254590 | 109 | > | Adjacent to | L |
| SR257300 | 257300 | 257400 | 100 | ? | Adjacent to dacA2 and oppA | M |
| SR263982 | 263982 | 264054 | 72 | > | Adjacent to | L |
| SR271250 | 271250 | 271350 | 100 | > | Adjacent to | M |
| SR277250 | 277250 | 277350 | 100 | > | Adjacent to a putative methyltransferase and M5005_Spy_0267 | M |
| SR307231 | 307231 | 307572 | 341 | < | Adjacent to a putative transposase and inner membrane protein YIDC | L |
| SR331095 | 331095 | 331188 | 93 | < | Adjacent to ferrichrome transport ATP-binding protein | L |
| SR336250 | 336250 | 336450 | 200 | > | Adjacent to | M |
| SR358650 | 358650 | 358800 | 150 | < | Adjacent to | M |
| SR360800 | 360800 | 361300 | 500 | < | Adjacent to M5005_Spy_0354 and M5005_Spy_0355 | M |
| SR396160 | 396160 | 396487 | 327 | > | Adjacent to | L |
| SR418861 | 418861 | 419103 | 242 | > | Adjacent to M5005_Spy_0426 and | L |
| SR452230 | 452230 | 452500 | 270 | > | Adjacent to M5005_Spy_0460 and M5005_Spy_0461 | M |
| SR520921 | 520921 | 521058 | 138 | > | Adjacent to ftsX and M5005_Spy_0533 | L |
| SR540686 | 540686 | 540783 | 98 | < | Adjacent to M5005_Spy_0550 and rplS | L |
| SR541600 | 541600 | 541800 | 200 | > | Adjacent to a tRNA-Arg and M5005_Spy_0552 | L,M |
| SR559590 | 559590 | 559700 | 110 | > |
| M |
| SR622408 | 622408 | 622534 | 126 | > | Adjacent to | L |
| SR638450 | 638450 | 638600 | 150 | > | Adjacent to ribosomal protein L27P and a putative transcription factor | M |
| SR641213 | 641213 | 641345 | 132 | > | Adjacent to M5005_Spy_0638 and | L |
| SR678133 | 678133 | 678532 | 399 | > | Adjacent to M5005_Spy_0674 and M5005_Spy_0676 | L |
| SR701500 | 701500 | 702300 | 800 | > | Adjacent to | M |
| SR721150 | 721150 | 721350 | 200 | ? | Adjacent to tRNA-Arg and M5005_Spy_0716 | M |
| SR758876 | 758876 | 758976 | 100 | > | Adjacent to | L |
| SR759205 | 759205 | 759368 | 163 | < | Adjacent to | L |
| SR800747 | 800747 | 800916 | 169 | > | Adjacent to | L |
| SR801894 | 801894 | 802142 | 249 | > | Adjacent to rplL and M5005_Spy_0797 | L |
| SR843321 | 843321 | 843412 | 91 | > | Adjacent to | L |
| SR862600 | 862600 | 862900 | 300 | > | Adjacent to an ABC transporter and | M |
| SR869300 | 869300 | 869600 | 300 | > | Adjacent to a glyoxalase family protein and M5005_Spy_0877 | M |
| SR914400 | 914400 | 914600 | 200 | < | Adjacent to | L,M |
| SR933600 | 933600 | 934150 | 550 | < | Adacent to cdd and the 16S rRNA methyltransferase | M |
| SR961800 | 961800 | 962000 | 200 | < | Adjacent to M5005_Spy_0976 and | M |
| SR969000 | 969000 | 969100 | 100 | ? | Adjacent to | M |
| SR1016300 | 1016300 | 1016500 | 200 | > | Prophage-encoded | M |
| SR1018400 | 1018400 | 1018500 | 100 | > | Prophage-encoded | M |
| SR1131900 | 1131900 | 1132100 | 200 | < | Adjacent to | L,M |
| SR1173300 | 1173300 | 1173350 | 50 | > | Prophage-encoded | M |
| SR1175500 | 1175500 | 1175800 | 300 | > | Prophage-encoded | M |
| SR1175900 | 1175900 | 1176100 | 200 | > | Prophage-encoded | M |
| SR1201244 | 1201244 | 1201471 | 227 | < | Adjacent to | L |
| SR1207340 | 1207340 | 1207537 | 198 | > | Adjacent to ftsA and divIB | L |
| SR1251900 | 1251900 | 1252100 | 200 | < | Adjacent to M5005_Spy_1295 and a ribosomal protein | L,M |
| SR1291775 | 1291775 | 1291982 | 207 | < | Adjacent to M5005_Spy_1324 and ribosome associated factor Y | L |
| SR1355150 | 1355150 | 1355250 | 100 | < | Adjacent to | M |
| SR1358431 | 1358431 | 1358497 | 66 | < | Adjacent to | L |
| SR1385110 | 1385110 | 1385204 | 94 | < | Adjacent to M5005_Spy_1413 and bacteriophage gene M5005_Spy_1414 | L |
| SR1532800 | 1532800 | 1532900 | 100 | ? | Adjacent to M5005_Spy_1571 and M5005_Spy_1572 | M |
| SR1568180 | 1568180 | 1568273 | 93 | < | Adjacent to pyrG and rpoE | L |
| SR1587818 | 1587818 | 1588167 | 349 | < | Adjacent to hsdM and transcriptional regulatory protein degU | L |
| SR1604140 | 1604140 | 1604210 | 70 | < | Adjacent to | M |
| SR1605828 | 1605828 | 1606280 | 452 | < | Adjacent to an integrase pseudogene and SSU ribosomoal protein rpsI | L |
| SR1678800 | 1678800 | 1678950 | 150 | < | Adjacent to | L,M |
| SR1678950 | 1678950 | 1679050 | 100 | < | Adjacent to | M |
| SR1681917 | 1681917 | 1682067 | 150 | > | Adjacent to virulence factors | L |
| SR1698200 | 1698200 | 1698640 | 440 | ? | Adjacent to | M |
| SR1719800 | 1719800 | 1719900 | 100 | < | Adjacent to | M |
| SR1720792 | 1720792 | 1720852 | 60 | > | Adjacent to putative transcription factor M5005_Spy_1760 and groEL | L |
| SR1720816 | 1720816 | 1720922 | 106 | < | Adjacent to putative transcription factor M5005_Spy_1760 and groEL | L |
| SR1727893 | 1727893 | 1728188 | 295 | < | Adjacent to a tranposase and peroxiredoxin gene ahpC | L |
| SR1745900 | 1745900 | 1746000 | 100 | > | Adjacent to a putative transcriptional regulator and | M |
| SR1754950 | 1754950 | 1755050 | 100 | < | Adjacent to | L,M |
| SR1765900 | 1765900 | 1766000 | 100 | ? | Adjacent to | M |
| SR1789300 | 1789300 | 1789400 | 100 | ? | Adjacent to a putative transcriptional regulator and M5005_Spy_1821 | L,M |
| SR1806601 | 1806601 | 1806858 | 257 | < | Adjacent to | L |
| SR1808413 | 1808413 | 1808633 | 220 | < | Adjacent to | L |
| SR1811574 | 1811574 | 1811651 | 77 | < | Adjacent to M5005_Spy_1843 and ABC permease protein cbiQ | L |
Nucleotide coordinates and gene designations are relative to the publically available MGAS5005 genome sequence [17]. Candidate sRNAs without a clearly defined orientation are highlighted with a question mark. RNAs were identified from a previous bioinformatic analysis (L) or by the microarray-based method described here (M).
Figure 1Representative candidate small RNA molecules identified by tiling microarray.
Genes are represented by black arrows facing the direction of transcription. Red vertical lines represent signal intensities from probes (PM-MM) tiled within intergenic regions. Red lines extending upward indicate left to right transcription, downward extending lines indicate right to left transcription. Blue horizontal bars indicate RNA length with the size in nucleotides shown. (A) Validation of our custom microarray as a tool to identify GAS sRNAs. The previously described FASX sRNA is located downstream of fasA (M5005_spy_0206 from the published MGAS5005 genome) and can be visualized as a distinct peak of signal intensity. (B) A candidate sRNA located upstream, and in the same orientation as, the C5a peptidase encoding gene scpA (M5005_spy_1715). (C) A candidate sRNA located downstream, and in opposite orientation to, dipeptidase A (M5005_spy_1758). (D) A candidate sRNA located downstream of, and in opposite orientation to, the treR gene encoding a putative repressor of the trehalose operon (M5005_spy_1785). (E) A clustered, regularly interspaced short palindromic repeat (CRISPR) element in GAS is transcribed in the same orientation as CRISPR-associated genes (cas1, cas2, cas4; M5005_spy_1285-7).
Candidate riboswitches and other small RNAs identified by bioinformatic and tiling microarray approaches.
| RNA name | Left nucleotide | Right nucleotide | Size (nt) | Orientation | Information and/or adjacent genes |
| RNA190950 | 190950 | 191010 | 60 |
| Putative SRP RNA |
| RNA319780 | 319780 | 319900 | 120 |
| Putative riboswitch |
| RNA484784 | 484784 | 484936 | 152 |
| Putative vitamin B1 riboswitch |
| RNA501950 | 501950 | 502050 | 100 |
| Putative |
| RNA772970 | 772970 | 773110 | 140 |
| CRISPR - 1 |
| RNA849201 | 849201 | 849250 | 49 |
| Putative purine riboswitch |
| RNA964800 | 964800 | 964850 | 50 |
| Putative glycine riboswitch |
| RNA983400 | 983400 | 983800 | 400 |
| Putative tmRNA |
| RNA1083000 | 1083000 | 1083250 | 250 |
| Putative |
| RNA1239700 | 1239700 | 1240020 | 320 |
| CRISPR - 2 |
| RNA1320100 | 1320100 | 1320500 | 400 |
| Putative RNase P |
| RNA1439100 | 1439100 | 1439200 | 100 |
| Putative |
| RNA1660600 | 1660600 | 1660800 | 200 |
| Putative 6S RNA |
Nucleotide coordinates and gene designations are relative to the publically available MGAS5005 genome sequence [17].
Figure 2Northern blot verification of candidate sRNAs.
Northern blots were performed using RNA isolated from strain MGAS2221 at 4 growth phases and probed for the presence of candidate sRNAs. The name or genome location (in nucleotides, relative to the published MGAS5005 genome) of candidate RNAs is displayed to the left of each blot. The approximate size in nucleotides of detected transcript/s is displayed to the right of each blot. Below each blot is a graph representing the normalized signal intensity of each hybridizing band. Signal intensities were generated using the Quantity One software package version 4.6.1., and normalized to signal detected for the housekeeping RNA 5S RNA (a representative 5S RNA blot is shown in figure 3). Normalized signal intensities are plotted relative to the most highly expressed time-point.
Figure 3Northern blot verification of riboswitches and other small RNAs.
Northern blots were performed using RNA isolated from strain MGAS2221 at 4 growth phases. The name of the candidate RNA molecules are shown to the left of each Northern. To the right of each Northern is the approximate size in nucleotides of the transcript/s. The 5S RNA served as a loading control.
Figure 4Analysis of candidate sRNA transcriptional start sites, terminators, and promoter regions.
The transcriptional start sites of candidate sRNAs FASX, SR195750, SR914400, SR1251900, SR1719800, and SR1754950 were determined by 5′ RACE. The identified transcriptional start site is colored red, the deduced sRNA sequences are colored black, and the final base of the terminator hairpin is colored blue. The putative −10 and/or −35 promoter sequences are underlined and putative rho-independent (intrinsic) terminators are highlighted by inverted arrows.
Figure 5Northern blot analysis of sRNA stability.
Aliquots of mid-exponential or late stationary phase cultures of strain MGAS2221 were harvested prior to (T = 0) and following (T = 5, 10, 20, 30, 45, 60, 90 min) rifampicin treatment to inhibit new RNA synthesis. 8 µg of extracted RNA from each time-point was subjected to Northern blot analysis, probing for PEL, FASX, SR195750, SR914400, SR1251900, SR1719800, and SR1754950 transcripts. Note that as the exposure time of each Northern blot varied no comparison of band intensities between blots should be made.
Figure 6Northern blot analysis of intra- and/or inter-serotype variation in sRNA transcription.
(A) Intra-serotype variation. Transcript abundance of sRNAs PEL, FASX, SR195750, SR914400, and SR1251900 were assayed in 9 different serotype M1 GAS strains. The M1 GAS strains were isolated from several different countries over a greater than 10 year period (Table S5). Northern blots were made using RNA isolated from exponential phase cultures. Note that an air bubble, and not a lack of transcript, was responsible for the apparent lack of signal for SR914400 in the SF370 sample. The housekeeping 5S RNA was used as a loading control. (B) Inter-serotype variation. Transcript abundance of sRNAs PEL, FASX, SR195750, SR914400, SR1251900, and SR1754950 were assayed in strains representing 8 GAS serotypes. Northern blots were made using RNA isolated from both exponential and early stationary phase cultures of the serotype M1 strain MGAS2221, the serotype M2 strain MGAS10270, the serotype M3 strain MGAS315, the serotype M4 strain MGAS10750, the serotype M6 strain MGAS10394, the serotype M12 strain MGAS2096, the serotype M18 strain MGAS8232, and the serotype M28 strain MGAS6180. The housekeeping 5S RNA was used as a loading control.
Figure 7PEL has no apparent regulatory function in four M1T1 clinical GAS isolates.
(A) Plate assay showing that the hemolytic negative phenotype of mutant strain 2221ΔPEL is complemented by addition of plasmid pPELC. Plasmid pPELC is a derivative of vector pDC123 that contains wild-type PEL. (B) Fold change (log2) in gene expression between isogenic mutant strain 2221ΔPEL and parental strain MGAS2221 during the exponential phase of growth in THY broth. Corresponding P-values (T-test) are graphed on the y-axes. The two white background areas of the graph signify those genes which are differentially expressed ≥1.5-fold with p≤0.05. Data points corresponding to genes of interest are colored red and labeled. (C) Taqman quantitative RT-PCR analyses comparing the transcript levels of select genes between parental strains MGAS2221, MGAS5005, MGAS5406, MGAS9127, and their isogenic pel mutant derivatives. Note that the spd3 gene is absent in strain MGAS9127. Experiment was performed in triplicate with mean fold-transcript levels relative to the appropriate parental strain (dashed line) shown. Error bars represent ± standard deviation. (D) Western blot analyses showing a lack of regulation by PEL in the four M1T1 GAS isolates studied. Western blots were created using protein isolated from the supernatants of exponential phase THY cultures of each GAS strain.