| Literature DB >> 33629931 |
Pierre Boudry1,2, Emma Piattelli1, Emilie Drouineau1, Johann Peltier1,2, Anaïs Boutserin1, Maxence Lejars3, Eliane Hajnsdorf3, Marc Monot2,4, Bruno Dupuy2, Isabelle Martin-Verstraete2,5, Daniel Gautheret1, Claire Toffano-Nioche1, Olga Soutourina1,2,5.
Abstract
Noncoding RNAs (ncRNA) have emerged as important components of regulatory networks governing bacterial physiology and virulence. Previous deep-sequencing analysis identified a large diversity of ncRNAs in the human enteropathogen Clostridioides (Clostridium) difficile. Some of them are trans-encoded RNAs that could require the RNA chaperone protein Hfq for their action. Recent analysis suggested a pleiotropic role of Hfq in C. difficile with the most pronounced effect on sporulation, a key process during the infectious cycle of this pathogen. However, a global view of RNAs interacting with C. difficile Hfq is missing. In the present study, we performed RNA immunoprecipitation high-throughput sequencing (RIP-Seq) to identify Hfq-associated RNAs in C. difficile. Our work revealed a large set of Hfq-interacting mRNAs and ncRNAs, including mRNA leaders and coding regions, known and potential new ncRNAs. In addition to trans-encoded RNAs, new categories of Hfq ligands were found including cis-antisense RNAs, riboswitches and CRISPR RNAs. ncRNA-mRNA and ncRNA-ncRNA pairings were postulated through computational predictions. Investigation of one of the Hfq-associated ncRNAs, RCd1, suggests that this RNA contributes to the control of late stages of sporulation in C. difficile. Altogether, these data provide essential molecular basis for further studies of post-transcriptional regulatory network in this enteropathogen.Entities:
Keywords: CRISPR; Hfq RNA chaperone protein; cis-antisense RNA; riboswitch; small noncoding RNA; sporulation; toxin-antitoxin
Mesh:
Substances:
Year: 2021 PMID: 33629931 PMCID: PMC8583004 DOI: 10.1080/15476286.2021.1882180
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
Figure 1.Diagram of relative proportion of different RNA species in Hfq-FLAG coIP and in control sample. All sequences that mapped to the C. difficile genome are represented. The rRNA and tRNA abundant housekeeping RNAs are shown in orange, the reads mapping to CDS are shown in grey. The relative proportion of other RNA species including known previously identified regulatory RNAs (named « misc_RNA ») and IGRs for Hfq-3xFLAG coIP samples is detailed in the diagram on the right. Left panel: control coIP, right panel: Hfq 3xFLAG coIP. RIP-seq experiment has been performed with four biological replicates
Figure 2.Visualization of RIP-seq data for regulatory RNAs with IGV. Representative examples from different functional groups of RNAs are presented in A) for sRNAs, B) for TA loci, C) for other antisense RNAs, D) for CRISPR RNAs, E) for riboswitches, F) for MACS peaks without annotation for new ncRNAs in IGR or new antisense RNAs. The results for ‘3xFLAG’ Hfq-coIP sample are compared to the ‘Control’ sample, while the ‘Hfq’ represents Hfq-coIP sample compared to the ‘PI’ pre-immune serum control. The genomic regions for previously identified sRNAs are presented in a green box, newly identified transcribed region are shown in orange box. 6S RNA (an Hfq-independent RNA) serves as a negative control shown in black box. The peaks identified by MACS are shown in blue together with genes from MaGe annotation at the bottom of each panel. In the IGV visualization profiles, ‘+’ strand reads are shown in red and “-“ strand reads are shown in blue. Dynamic IGV visualization is presented with adjusted read threshold for each window to compare the Hfq-immunoprecipitated and control samples
Examples of RNAs enriched in Hfq-FLAG RIP-seq samples
| name | strand | annotation | fold_enrichment | -log10(Pval) |
|---|---|---|---|---|
| AS CD630_s0610 | + | AS Lysine riboswitch | 88.96 | 5.84172 |
| AS RCd1 | - | Antisense ncRNA | 78.17 | 6.83517 |
| AS_CD630_28900 | + | Antisense CDS | 205 | 4.12438 |
| CD630_Cdi1_3 (SQ173) | + | GEMM RNA motif | 391.6 | 6.59405 |
| CD630_Cdi1_4 | + | GEMM RNA motif | 294.48 | 5.31318 |
| CD630_Cdi1_8 | + | GEMM RNA motif | 493.76 | 5.79994 |
| CD630_Cdi1_9 (SQ1656) | + | GEMM RNA motif | 309.79 | 4.25622 |
| CD630_Cdi2_3 | - | c-di-GMP-II | 4,133.49 | 3.99184 |
| CD630_n00030 (RCd2) | - | Antisense CDS | 487.5 | 3.80984 |
| CD630_n00080 | - | Antisense CDS | 94.4 | 4.5933 |
| CD630_n00150 | - | Antisense CDS TA | 1,838.83 | 8.81823 |
| CD630_n00170 (RCd6) | - | ncRNA IGR | 17.72 | ND |
| CD630_n00240 | + | ncRNA IGR | 442.5 | 5.82709 |
| CD630_n00290 | - | ncRNA IGR | 87.13 | 3.85358 |
| CD630_n00330 | - | ncRNA IGR | 6,627.44 | 6.46464 |
| CD630_n00340 | - | Antisense CDS | 3,026.76 | 5.83478 |
| CD630_n00370 (RCd10) | - | Antisense CDS TA | 10,252.1 | 7.05302 |
| CD630_n00380 (CRISPR 3_4) | + | ncRNA IGR | 16.02 | 4.27916 |
| CD630_n00440 | + | ncRNA IGR | 558.25 | 4.85723 |
| CD630_n00460 (CRISPR 6) | + | ncRNA IGR | 105.68 | 5.55447 |
| CD630_n00470 | - | ncRNA IGR | 11.87 | 3.52267 |
| CD630_n00500 | - | Antisense CDS TA | 5018 | 9.35359 |
| CD630_n00510 (CRISPR 7) | + | ncRNA IGR | 14.33 | 3.92762 |
| CD630_n00560 (CRISPR 8) | + | ncRNA IGR | 26.72 | 5.82053 |
| CD630_n00600 (CRISPR 9) | + | ncRNA IGR | 24.99 | 5.31146 |
| CD630_n00620 | + | ncRNA IGR | 109.7 | 5.82838 |
| CD630_n00660 (RCd1) | + | ncRNA IGR | 254.73 | 4.09988 |
| CD630_n00680 (RCd5, Cdi1_6) | + | ncRNA IGR | 374.21 | 6.52781 |
| CD630_n00690 (CRISPR 10) | - | ncRNA IGR | 89.22 | 5.96526 |
| CD630_n00790 (CRISPR 11) | - | ncRNA IGR | 48.54 | 5.5121 |
| CD630_n00830 | + | Antisense CDS | 2,799.15 | 4.11757 |
| CD630_n00860 (CRISPR 12) | - | ncRNA IGR | 27.35 | 6.97567 |
| CD630_n00910 | - | Antisense CDS | 181.93 | 4.4372 |
| CD630_n00930 | - | ncRNA IGR | 73.05 | 5.7913 |
| CD630_n00980 (RCd12) | + | Antisense CDS TA | 305.03 | 5.31318 |
| CD630_n00990 (CRISPR 15_16) | - | ncRNA IGR | 16.03 | 5.45665 |
| CD630_n01000 (RCd9) | + | Antisense CDS TA | 5,892.11 | 8.96925 |
| CD630_n01010 (CRISPR 17) | - | ncRNA IGR | 23.98 | 4.59146 |
| CD630_s0190 | + | T-box | 486.61 | 4.9848 |
| CD630_s0250 | + | SAM riboswitch | 1,188.27 | 3.74616 |
| CD630_s0300 | + | T-box | 716.3 | 4.17452 |
| CD630_s0310 | + | T-box | 334.8 | 4.17452 |
| CD630_s0340 | + | Lysine riboswitch | 329.85 | 5.61064 |
| CD630_s0360 | - | FMN riboswitch | 5,577.85 | 6.65922 |
| CD630_s0450 | - | T-box (Arg) | 385.26 | 5.67265 |
| CD630_s0480 | + | T-box (Thr) | 472.62 | 4.87999 |
| CD630_s0500 | - | Purine riboswitch | 119.25 | 3.2587 |
| CD630_s0510 | - | T-box (Arg) | 369.65 | 3.37631 |
| CD630_s0590 | - | Lysine riboswitch | 3,361.69 | 8.13716 |
| CD630_SQ1641 | + | Antisense CDS TA | 3,492.75 | 8.65138 |
| CD630_SQ1642 | + | Antisense CDS TA | 193.25 | 8.65138 |
| CD630_SQ1781 (RCd8) | - | ncRNA IGR TA | 2,655 | 6.97567 |
| CD630_SQ1828 | - | ncRNA IGR | 219.9 | ND |
| CD630_SQ367 | + | ncRNA IGR | 41.4 | 4.0829 |
| CD630_SQ476 | + | Antisense CDS | 7 | 4.52273 |
| CD630_SQ808 | - | Antisense CDS TA | 1,894.75 | 6.8568 |
| IGR_CD630_06550 | + | ncRNA IGR | 65 | 3.90214 |
| CD630_01640 | + | Putative membrane protein | 35.44 | 4.55381 |
| CD630_03160 | - | ABC-type transport system, permease | 27.42 | 3.66946 |
| CD630_03180 | - | ABC-type transport system, bacitracin/multidrug-family ATP-binding protein | 24.05 | 3.73138 |
| CD630_03650 | - | ABC-type transport system, multidrug-family permease Tn916-like, CTn1-Orf10 | 30.06 | 3.93697 |
| CD630_04610 | - | Two-component sensor histidine kinase | 29.69 | 3.83025 |
| CD630_04620 | - | Two-component response regulator | 16.13 | 3.45413 |
| CD630_05760 ( | + | Sensor histidine kinase | 68.09 | 4.74085 |
| CD630_06630 ( | + | Toxin A | 23.04 | 4.44902 |
| CD630_06640 ( | - | Negative regulator of toxin gene expression | 7.49 | 3.76101 |
| CD630_09562 | + | Putative phage protein TA | 3.99 | 3.94307 |
| CD630_10360 ( | + | Putative N-acetylmuramoyl-L-alanine amidase, autolysin | 7 | 3.43363 |
| CD630_10790 | - | Transcriptional regulator, LysR family | 27.55 | 4.45284 |
| CD630_12330 ( | + | Putative cell surface protein | 16.66 | 4.44377 |
| CD630_14182 | + | Putative phage protein TA | 119.5 | 4.8236 |
| CD630_15080 | + | Putative iron-sulphur binding protein | 24.34 | 4.03033 |
| CD630_15170 ( | - | Ferrous iron transport protein B | 13.78 | 4.05364 |
| CD630_17630 | - | Putative membrane protein | 179 | 4.52408 |
| CD630_18030 ( | + | Putative cell wall-binding protein | 14.11 | 4.4661 |
| CD630_18290 ( | + | Two-component sensor histidine kinase KdpD | 16.02 | 3.75086 |
| CD630_18300 ( | + | Two-component response regulator KdpE | 10.55 | 4.2714 |
| CD630_19090 ( | + | Ethanolamine utilization protein, GTPase family | 20.15 | 3.82847 |
| CD630_19100 ( | + | Two-component response regulator, Ethanolamine specific | 10.55 | 4.55715 |
| CD630_19740 ( | - | Hfq RNA chaperone | 105.1 | 5.05609 |
| CD630_22991 | - | Putative phage protein TA | 75.25 | 6.03728 |
| CD630_23050 ( | + | Type IV pilus protein | 70.28 | 4.96316 |
| CD630_24830 | + | Putative lipoprotein | 29.26 | 4.98239 |
| CD630_24830 | + | Putative lipoprotein | 29.26 | 4.98239 |
| CD630_25180 ( | - | Cell surface protein | 31.15 | 4.40865 |
| CD630_26150 | + | Transcriptional regulator, TetR family | 132.63 | 4.06302 |
| CD630_27500 ( | - | Accessory gene regulator | 59.03 | 3.92598 |
| CD630_28520 ( | - | D-alanyl transferase DltB, MBOAT family | 22.57 | 3.81485 |
| CD630_28530 ( | - | D-alanine–poly(phosphoribitol) ligase subunit 1 | 13.99 | 3.85429 |
| CD630_28540 ( | - | D-alanine transferase | 29.97 | 4.40777 |
| CD630_28890 | - | Putative phage protein TA | 337.02 | 7.52469 |
| CD630_29750 | - | Putative CRISPR-associated Cas2 family protein | 38.37 | 4.59146 |
| CD630_29760 | - | Putative CRISPR-associated Cas1 family protein | 13.77 | 3.39713 |
| CD630_30620 | - | Transcriptional regulator, RpiR family | 57.58 | 4.15443 |
| CD630_31210 | + | Putative flavodoxin/nitric oxide synthase | 25.97 | 4.17693 |
| CD630_31450 ( | - | Surface-exposed adhesin | 20.96 | 3.0304 |
| CD630_32040 | - | Putative membrane protein | 173.22 | 4.93516 |
| CD630_32050 | + | Putative nitroreductase | 16.38 | 5.73797 |
| CD630_35040 ( | - | Type IV prepilin peptidase, A24A family | 15.15 | 4.54123 |
| CD630_35050 ( | - | Type IV pilus twitching motility protein | 8.24 | 3.24639 |
| CD630_35110 ( | - | Type IV pilus secretion protein | 7.45 | 3.5423 |
The RNAs from different functional classes referenced in the text are included with the p-value from MACS2 analysis and the fold-enrichment in Hfq-FLAG RIP-seq samples as compared to control.
Figure 3.Visualization of RIP-seq data for mRNAs with IGV. Representative examples from different functional groups of mRNAs are presented in A) host-adaptation related genes, B) for type IV pili, adhesion, surface protein-encoding genes, C) for regulatory and signalling pathways including quorum sensing, two-component systems and transcriptional regulatory genes, D) for membrane proteins and transporter genes, E) for metabolism genes. The IGV visualization is presented as in. Fig. 2
Figure 4.Diagram showing pathway clustering for mRNAs peaks detected by RIP-seq from KEGG database. The functional category of genes is shown with the corresponding gene number
Figure 6.The expression profile of RCd1 and selected potential targets during sporulation. (A) RNA samples were extracted from 630∆erm strain (630), 630∆erm strain carrying an empty vector (630/p) or overexpressing RCd1 from plasmid (630/pRCd1) grown in TY medium supplemented on not with 250 ng/ml ATc (+ATc) or in SM medium for 10 h, 12 h, 14 h, 16 h, 18 h, 20 h, 22 h, 24 h as indicated. The strain deleted for RCd1 (ΔRCd1) was used as a control for specific RCd1 detection. 5S RNA at the bottom serves as loading control. As indicated at the left, the blots were hybridized either with RCd1, spoIIID or CD1234-specific probes. The same 5S control panel is shown when reprobing of the same membrane was performed. Relative intensity of the bands ‘RI’ normalized to 5S estimated with ImageJ software is indicated underneath each lane, ‘>’ indicates the saturation of the signal. MW, the size of RNA molecular weight standards in nucleotides. (B) Effect of RCd1 overexpression on spoIIID and CD1234 genes during sporulation was monitored in SM medium during sporulation for 10 h, 12 h, 14 h, 16 h, 18 h, 20 h, 22 h, 24 h as indicated
Figure 5.The role of RCd1 in the skin element excision. A) Sporulation efficiency by numeration of spores formed by the strains overexpressing RCd1 and their control strains. Sporulation assays were performed in SM after 24 h of culture. Spores were selected by heat treatment at 65°C for 20 min and their number estimated for strains CDIP590 (630ΔermΔskin/p), CDIP592 (630ΔermΔskin/pRCd1), CDIP219 (630Δerm/p), CDIP258 (630Δerm/pRCd1). The diagram shows the sporulation rate as a proportion in % of spores for the total number of CFU, means of results from at least three independent experiments are presented. Error bars correspond to standard deviations from three biological replicates. Statistical significance for spore formation differences between strains CDIP219 and CDIP258 was evaluated by a Welch two sample t test (p = 0.02), N = 3. B) Diagram of the skin element integrated into the sigK gene and its episomal form shows the location of the oligonucleotides used as primers for PCR (IMV824/IMV825) to distinguish between integrated and episomal skin forms. C) The PCR results from three independent experiments for relative skin excision are presented. D) Western-blot detection of FLAG-tagged CD1234 protein in the cell extracts of C. difficile strains: lane 1- strain CDIP704 (CD1234-FLAG/p); lane 2- strain CDIP708 (CD1234-FLAG/pRCd1); lane 3- strain CDIP607 (630Δerm/pCD1234-FLAG); lane 4- CDIP369 (630Δerm/p). Immunoblotting with anti-FLAG antibodies detected a major polypeptide of ~11.5 kDa in whole-cell lysates of the strain carrying CD1234-3xFLAG gene on the chromosome of C. difficile 630Δerm strain but not in extracts of the wild-type strain expressing the non-tagged protein (negative control on lane 4). The representative result from three independent experiments is shown. The strains were grown in SM medium for 12 h in the presence of 250 ng/ml ATc and 7.5 µg/ml Tm. 60 µg (lanes 1–2) or 10 µg (lanes 3–4) of whole-cell lysate were loaded on the gels. The lysate of the strain overexpressing CD1234-FLAG from the plasmid serves as a positive control for detection (lane 3). Proteins were separated on 4–20% Bis-Tris polyacrylamide gels in Tris-Glycine SDS buffer. Immunoblotting with anti-SigA antibodies serves as loading control (bottom panel). The relative RCd1 abundance has been estimated by qRT-PCR, the mean values with standard deviations from three independent experiments are shown
Effect of RCd1 overexpression on the abundance of potential targets
| | | | Ratio 630p/630pRCd1 | | |
|---|---|---|---|---|---|
| Gene ID | Name | Function | TY | TY pulse | SM* |
| RNA polymerase sigma-K factor SigK | ND | ||||
| Stage III sporulation protein D | |||||
| Spore coat protein | |||||
| Spore outer coat layer protein CotA | ND | ||||
| Site-specific recombinase for | 1.1 ± 0.4 | 1.3 ± 0.6 | 0.9 ± 0.4 | ||
| Putative cell surface protein | 1.2 ± 0.7 | 0.9 ± 0.3 | 0.7 ± 0.2 | ||
| Assessory protein for |
* After 8 h of growth in TY with continuous ATc induction or ‘pulse’ ATc induction for 7 min or 12 h of growth in SM medium in the presence of ATc inducer.
The effect of RCd1 overexpression has been estimated by qRT-PCR analysis in triplicate as described in Material and Methods.
Mean values ± standard deviations from at least three biological replicates are presented. ND, not determined. The values greater than 2 are shown in green.
Figure 7.Analysis of the interaction between RCd1 and spoIIID or CD1234 in the presence or the absence of Hfq. Radioactive RCd1 transcript was synthesized in vitro and incubated alone or together with CD1234 or spoIIID transcripts with purified Hfq-His6 protein expressed as monomer forms. Brackets show Hfq-RCd1 complexes, the position of unbound RNA and RCd1-potential target complexes is indicated at the left