| Literature DB >> 28082390 |
Anaïs Le Rhun1,2,3, Anne-Laure Lécrivain1,2, Johan Reimegård4, Estelle Proux-Wéra5, Laura Broglia2,3, Cristina Della Beffa3, Emmanuelle Charpentier1,2,3,6.
Abstract
A better understanding of transcriptional and post-transcriptional regulation of gene expression in bacteria relies on studying their transcriptome. RNA sequencing methods are used not only to assess RNA abundance but also the exact boundaries of primary and processed transcripts. Here, we developed a method, called identification of specific cleavage position (ISCP), which enables the identification of direct endoribonuclease targets in vivo by comparing the 5΄ and 3΄ ends of processed transcripts between wild type and RNase deficient strains. To demonstrate the ISCP method, we used as a model the double-stranded specific RNase III in the human pathogen Streptococcus pyogenes. We mapped 92 specific cleavage positions (SCPs) among which, 48 were previously described and 44 are new, with the characteristic 2 nucleotides 3΄ overhang of RNase III. Most SCPs were located in untranslated regions of RNAs. We screened for RNase III targets using transcriptomic differential expression analysis (DEA) and compared those with the RNase III targets identified using the ISCP method. Our study shows that in S. pyogenes, under standard growth conditions, RNase III has a limited impact both on antisense transcripts and on global gene expression with the expression of most of the affected genes being downregulated in an RNase III deletion mutant.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28082390 PMCID: PMC5389636 DOI: 10.1093/nar/gkw1316
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Identification of specific cleavage positions (ISCP). (A) Example of an RNA molecule (curved black line) which is processed by an endoribonuclease (green) generating two RNA molecules. The new 3΄ and 5΄ ends produced (labeled in green) are detected by RNA sequencing in the WT (green bars) but are absent when the RNase responsible for the cleavage is deleted (ΔRNase). ‘Total’: total read coverage; ‘5΄ ends’ and ‘3΄ ends’: number of reads starting or ending respectively at each nucleotide (black bars). (B) ISCP was done by comparing processed transcript ends in WT and ΔRNase strains. Pp libraries containing primary (5΄ PPP) and processed (5΄ P/OH) transcripts and p libraries enriched in 5΄ processed transcripts were used to identify specific cleavage positions (SCPs). (C) SCPs were defined by several criteria. Lower cut-offs were chosen for the Pp libraries compared to the p libraries since Pp libraries contain a reduced relative amount of processed transcripts. We only considered nucleotides with more than 10 reads mapped as expressed. The first parameter (‘Expression’) selects positions with a sufficient total number of read ends in both ΔRNase and the reference (WT or a different ΔRNase) strains (green bars). The second parameter (‘Noise reduction’) evaluates if the 5΄ or 3΄ end of interest (green bar) is distinct from the noise, defined as the total of 5΄ or 3΄ ends (black bars) in a window size (WS) of 20 nt. To detect only relevant positions, the third parameter (‘Cleavage ratio’) computes the amount of cleaved transcripts by comparing the count of the 5΄ or 3΄ ends of interest (green bar) with the total number of reads covering the position (pink rectangle). The fourth parameter (‘Comparison WT versus ΔRNase’) selected positions where the cleavage ratio (end count normalized to the total coverage, i.e. ratio green bar/pink rectangle) of the reference was superior to the one in the ΔRNase strain. This parameter also prevents any artefact due to a possible change in RNA level between the two analyzed strains.
Figure 2.RNase III cleavage ratios. (A) Proportion of untranslated regions (UTRs) and open reading frames (ORFs) in the genome and among RNase III specific cleavage positions (SCPs). (B) The cleavage ratio of each SCP was plotted for each SCP location (ORF or UTR). Scatter box plot of RNase III cleavage ratios show the first and third quartiles of the data range (top and the bottom of the grey boxes) and the median of the data (line inside the boxes).
Figure 3.RNase III SCPs. (A) RNase III SCPs were classified in three groups. SL: cleavages on both sides of a stem loop forming a 2 nt 3΄ overhang; AS: cleavages on both duplex strands forming a 2 nt 3΄ overhang; SA: stand-alone positions, where the opposite cleavage position could not be retrieved with the ISCP method or previous sRNA sequencing data. We hypothesized that an RNA with SA SCP could base pair with other mRNAs or sRNAs and be co-processed by RNase III but it is also possible that the second cleavage position in the same transcript could not be retrieved here. (B–D) RNA coverage visualized using the Integrative Genome Viewer (IGV) (see also Supplementary Figure S4). The genomic coordinates are indicated. ‘reads’: total read coverage; ‘5΄ ends’ and ‘3΄ ends’: number of reads starting or ending respectively at each nucleotide (black bars). The positions detected by the ISCP method are shown with green bars and green arrows both on the genomic sequence and RNA folding. The SCPs are numbered to facilitate the visualization. Foldings were generated using RNAfold. Grey rectangles: sRNA genes. UTR: untranslated region. (B) Coverage of the 5΄ UTR of pnpA. The position visualized in the sRNA sequencing data is shown with a pink bar and a pink arrow both on the genomic sequence and RNA folding. (C) Coverage of the UTR located between secY and adk. The individual mapped reads are shown. (D) Coverage of Spy_sRNA477714/Spy_sRNA477741 region. Foldings of the RNA from the positive and negative strands are shown.
Figure 4.Venn diagram of differential gene expression in the RNase III deletion strain compared to the WT. Comparison of RNase III-regulated (A) genes and (B) transcripts identified by two approaches: differential expression analysis (DEA) and identification of specific cleavage positions (ISCP). The numbers of affected transcripts are indicated with an up arrow for upregulated genes and a down arrow for downregulated genes.