| Literature DB >> 29515534 |
Wenfeng Liu1, Tatiana Rochat2, Claire Toffano-Nioche1, Thao Nguyen Le Lam1, Philippe Bouloc1, Claire Morvan1.
Abstract
Bacterial regulatory RNAs have been extensively studied for over a decade, and are progressively being integrated into the complex genetic regulatory network. Transcriptomic arrays, recent deep-sequencing data and bioinformatics suggest that bacterial genomes produce hundreds of regulatory RNAs. However, while some have been authenticated, the existence of the others varies according to strains and growth conditions, and their detection fluctuates with the methodologies used for data acquisition and interpretation. For example, several small RNA (sRNA) candidates are now known to be parts of UTR transcripts. Accurate annotation of regulatory RNAs is a complex task essential for molecular and functional studies. We defined bona fide sRNAs as those that (i) likely act in trans and (ii) are not expressed from the opposite strand of a coding gene. Using published data and our own RNA-seq data, we reviewed hundreds of Staphylococcus aureus putative regulatory RNAs using the DETR'PROK computational pipeline and visual inspection of expression data, addressing the question of which transcriptional signals correspond to sRNAs. We conclude that the model strain HG003, a NCTC8325 derivative commonly used for S. aureus genetic regulation studies, has only about 50 bona fide sRNAs, indicating that these RNAs are less numerous than commonly stated. Among them, about half are associated to the S. aureus sp. core genome and a quarter are possibly expressed in other Staphylococci. We hypothesize on their features and regulation using bioinformatic approaches.Entities:
Keywords: RNA-seq; Staphylococcus aureus HG003; bona fide sRNA; gene regulation; transcription factors
Year: 2018 PMID: 29515534 PMCID: PMC5826253 DOI: 10.3389/fmicb.2018.00228
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Defining bona fide sRNAs. (A) An ideal bona fide sRNA gene has its own promotor and transcriptional terminator. Its transcription does not overlap any antisense transcription. (B) In the first case, transcription from the second promoter leads to a bona fide sRNA while the transcription from the first promoter would likely generate a transcript with a long 3′UTR (e.g., RsaG). In the second case, a transcriptional termination read-through generates an alternative longer sRNA (e.g., RsaE-S390). (C) Three examples of non-coding RNAs not considered as bona fide sRNAs: a putative asRNA, a cis-regulatory element and a long 3′-UTR, respectively. Flag: promotor; hairpin loop: terminator; gray arrow: open reading frames; empty arrow: non-coding RNA.
Bona fide sRNAs expressed in HG003.
| srn_0335 | Includes SAOUHSCs258, S35 | 115205 | 115614 | – | + | NB | Repeated region |
| RsaG | Teg93, sRNA31, srn_0510, SAOUHSCs054, S58 | 201738 | 201962 | + | + | NB | Own promoter + read-through from SAOUHSC_00183 |
| Sau-5971 | srn_0880, SAOUHSCs073, S109 | 361904 | 362002 | – | – | NB | |
| ptsrn_0890 | sRNA71, SAOUHSCs205, SAOUHSC_A00354 | 367121 | 367211 | – | – | Part of putative ORF SAOUHSC_A00354 | |
| Teg147 | sRNA85, srn_0960, SAOUHSCs103 | 386294 | 386353 | + | – | ||
| ptsrn_1505 | SAOUHSCs189, S204 | 569615 | 569939 | + | – | NB | Own promoter and 3'UTR from SAOUHSC_00559 |
| RsaA | Teg88, Sau-64, sRNA132, srn_1510, SAOUHSCs048, S210 | 575845 | 575987 | + | + | NB | Sigma B regulation |
| ptRsaAL | 575845 | 576126 | + | + | NB | Sigma B regulation Processed into two sRNAs | |
| RsaC | Teg90, sRNA135, srn_1590, SAOUHSCs050, S234 | 623360 | 624458 | – | + | NB | Internal repeat. Own promoter and read-through from SAOUHSC_00634 |
| ptRsaD | sRNA138, srn_1640, SAOUHSCs051, S243 | 639711 | 639872 | – | + | NB | Antisense of putative SAOUHSC_00650 Antisense expression in some conditions |
| RsaH | Teg94, Sau-6059, sRNA162, srn_1910, SAOUHSCs055, S317 | 774294 | 774421 | + | + | NB | Antisense of SAOUHSC_00792 promoter |
| pttmRNA | Teg150, ssrA, SAOUHSCs006, WAN014GIY, sRNA166, S329 | 788284 | 788675 | + | + | NB | Own promoter and read-through from SAOUHSC_00804 |
| RsaE | Sau-20, Teg92, sRNA183, srn_2130, S389 | 911380 | 911481 | + | + | NB | Own promoter and read-through from SAOUHSC_00937 |
| ptRsaE-S390 | srn_2130, S389 + S390, includes RsaF | 911380 | 911739 | + | + | Poor expression; long product from RsaE terminator read-through | |
| sRNA195 | sRNA195, srn_2320, SAOUHSCs226, S414 | 990586 | 990684 | – | – | Possible antisense of SAOUHSC_01018 3'UTR | |
| RNA207 | srn_2500, SAOUHSCs229 | 1078428 | 1078718 | – | + | Internal repeat | |
| Teg106 | srn_2730, SAOUHSCs093, S540 | 1247774 | 1247925 | + | + | Poor expression | |
| ptTeg108 | sRNA222, srn_2740, SAOUHSCs094 | 1248013 | 1248138 | – | + | ||
| srn_2975 | SAOUHSCs275, S596 | 1362893 | 1363064 | + | + | NB | 5' partly antisense of SAOUHSC_01422. Longer transcript with terminator read-through antisense of SAOUHSC_01423 |
| S627 | None | 1462734 | 1462962 | – | – | Own promoter and 3'UTR from SAOUHSC_01514. Repeated region. Antisense expression in some conditions | |
| SprX2 | Ssr6, RsaOR, Teg15, srn_3820.1, SAOUHSC_A01455 | 1464058 | 1464207 | – | + | NB | Repeated region; putative ORF SAOUHSC_A01455; Possibly associated with S629 |
| 629 | None | 1464252 | 1464380 | – | – | Possibly 5'UTR of SAOUHSC_A01455 | |
| 6S RNA | Teg97, SsrS, Ssr80, WAN01CC8T, sRNA256, SAOUHSCs026, S685 | 1639003 | 1639243 | – | – | NB | Terminator read-through to SAOUHSC_01736 |
| RNA264 | srn_3320, SAOUHSCs017, S706 | 1685428 | 1685667 | – | – | Terminator read-through to SAOUHSC_01787 | |
| srn_3355 | SAOUHSCs110, included in S713 | 1707679 | 1707781 | – | – | ||
| Sau-5949 | Teg120, sRNA272, srn_3460, SAOUHSCs070 | 1771663 | 1771728 | + | – | NB | Possible antisense of SAOUHSC_01865 3'UTR |
| srn_3555 | SAOUHSCs221 | 1821336 | 1821444 | + | – | Repeated region | |
| SprB | Teg9, srn_3600, SAOUHSCs030 | 1849001 | 1849117 | – | – | NB | Not detected in Mäder et al. |
| sRNA287 | srn_9340, SAOUHSCs236, S774 | 1863800 | 1863899 | – | – | Own promoter + possible terminator read-through from SAOUHSC_T00050 | |
| srn_9345 | S808 | 1923614 | 1923879 | – | – | NB | Own promoter + 3'UTR of SAOUHSC_02016 |
| S810 | None | 1924486 | 1924611 | – | – | Own promoter inside SAOUHSC_02019. Repeated regions. | |
| ptTeg122 | srn_3770, SAOUHSCs097 | 2027317 | 2027376 | + | – | 52 pb; in proximity of a putative type I TA system. | |
| SprD | Teg14, sRNA300, srn_3800, SAOUHSCs032, S853 | 2033619 | 2033763 | – | + | NB | |
| ptTeg124 | srn_3810 | 2033838 | 2033899 | – | – | Not detected in Mäder et al. | |
| SprX1 | ssr6, RsaOR, Teg15, sRNA299, srn_3820, SAOUHSCs033, S854 | 2035228 | 2035378 | – | + | NB | Repeated regions; possible 5'UTR of SAOUHSC_02170 |
| ptRNAIII | sRNA317, srn_3910, SAOUHSCs022, S871 | 2093158 | 2093673 | – | + | NB | SAOUHSC_02260 ( |
| RsaOG | RsaI, Teg24, sRNA356, srn_4390, SAOUHSCs047, S999 | 2377317 | 2377465 | – | – | NB | Antisense expression in some conditions in Mäder et al. study. |
| Ssr42 | RsaX28, Teg27, sRNA363, srn_4470, SAOUHSCs084, S1036 | 2446923 | 2448156 | – | + | 1252 pb; high constitutive transcription; terminator read-through antisense of SAOUHSC_02663 | |
| RsaX20 | Teg128 + Teg130, srn_4520, SAOUHSCs100 (included), SAOUHSC_02702 + S1052 | 2484471 | 2484732 | + | + | Contains putative ORF SAOUHSC_02702 | |
| Sau-19 | Teg131, RsaX21, sRNA382, srn_4680, SAOUHSCs060 | 2556335 | 2556412 | + | – | NB | Not detected in Mäder et al. |
| Teg33 | sRNA400, srn_5010, S1164 | 2721121 | 2721350 | – | + | Own promoter; 3′UTR from SAOUHSC_02961; antisense of putative SAOUHSC_02960 |
sRNA names are given according to publication claim priority or most commonly used name. A list of previous analyses taken into account for this table is presented in Table .
Mäder et al., .
Carroll et al., .
Geissmann et al., .
Abu-Qatouseh et al., .
Bohn et al., .
Lioliou et al., .
Pichon and Felden, .
Eyraud et al., .
Novick et al., .
Marchais et al., .
Beaume et al., .
Bona fide sRNAs in HG003 with poor expression in the tested conditions of RNA-seq (this study) and tiling arrays (Mäder et al., 2016) datasets.
| Sau-27 | srn_2690 | 1219192 | 1219282 | + | – | NBd | No signal |
| Sau-85 | srn_2760, SAOUHSCs165 (but longer) | 1252254 | 1252305 | + | – | Poor expression | |
| RsaB | srn_3410, SAOUHSCs049, SAOUHSC_01844 | 1750160 | 1750216 | + | + | RTc 5′3′c | Poor expression; possibly 3'UTR of SAOUHSC_01844 |
| sRNA334 | srn_9480, SAOUHSCs242 | 2214760 | 2214889 | + | – | Poor expression | |
| sRNA390 | srn_9510, SAOUHSCs250 | 2629688 | 2629848 | + | – | Poor expression |
For explanations, see Table .
List of transcription factor motifs found by MAST analysis (E-value < 0.01) in the putative promoter region (−100 to +50 nts from transcription start sites) of bona fide sRNAs.
| RNAIII | AgrA | Cell density (AIP) Biotin NO, anaerobiosis | Regulation of quorum sensing Biotin metabolism Anaerobic switch | Mäder et al., |
| RsaB | Fur | Fe2+ | Iron homeostasis | This study |
| RsaD | CodY | Branched-chain amino acids | Amino acid metabolism | Mäder et al., |
| RsaE | SrrA | NO, anaerobiosis NAD | Anaerobic switch Anaerobic metabolism | Durand et al., |
| RsaOG | CcpA | HPr, phosphocarrier protein; Fructose-1,6-diphosphate | Carbon catabolism | Mäder et al., |
| RsaX20 | Zur | Zn2+ | Zinc homeostasis | Mäder et al., |
| Sau-19 | ArcR Rex | Arginine NAD | Arginine metabolism Anaerobic metabolism | This study This study |
| srn_2975 | Fur NanR | Fe2+ N-acetylmannosamine-6-P | Iron homeostasis, Sialic acid catabolism | Mäder et al., |
| sRNA207 | BirA | Biotin | Biotin metabolism | This study |
| sRNA287 | SarA | Pathogenesis regulation | Mauro et al., |
Motifs were defined as given by Regprecise (Novichkov et al., 2013) for S. aureus N315 strain. We also search for GraRS, WalKR, and SrrA putative regulations using reported motifs (Sterba et al., 2003; Hartig and Jahn, 2012; Nicolas et al., 2012; Mäder et al., 2016; Durand et al., 2017);
corresponds to proposed or validated regulations reported elsewhere (discussed in the text) corresponding to motifs that did not pass the stringent E-value chosen or that are upstream of sequences selected for analysis. No DNA binding site was found for GraRS and WalKR.
Figure 2Example of bona fide sRNAs. RsaD (Left) and RsaOG (Right). Upper: Artemis viewer window showing read log-coverages from pooled RNA samples extracted from HG003 grown in 16 growth conditions. Middle panel: screen snapshots of tiling array data from HG001 grown in different conditions (http://genome.jouy.inra.fr/cgi-bin/aeb/index.py, Mäder et al., 2016). Lower: annotations including genomic coordinates and sRNA names from Carroll et al. (yellow), Mäder et al. (light orange), and this study (mauve). Promoters (flags) and transcription terminators (hairpin loops) are placed according to Mäder et al. and/or TranstermHP software terminator predictions (Kingsford et al., 2007).
Figure 3Staphylococcus aureus sRNAs conservation across the Firmicutes phylum. Vertical axe: list of bona fide sRNAs. Horizontal axe: list of S. aureus strains and other Firmicutes (for accession numbers, see Table S2). Similarity ratios between sRNAs of indicated strains and NCTC8325 sRNAs (reference strain) are represented by the indicated color code.